Publication | Open Access
A deep learning based graph-transformer for whole slide image classification
22
Citations
21
References
2021
Year
Unknown Venue
Convolutional Neural NetworkMedical Image SegmentationStain NormalizationMachine LearningGraph Representation LearningEngineeringDigital PathologyPathologyGraph ProcessingImage AnalysisData ScienceBiopsy SlidesAbstract Deep LearningPredictive BiomarkersVideo TransformerMedical ImagingComputational PathologyMedical Image ComputingDeep LearningLung CancerComputer VisionComputer-aided DiagnosisGraph Neural NetworkMedicineSpatial Information
Abstract Deep learning is a powerful tool for assessing pathology data obtained from digitized biopsy slides. In the context of supervised learning, most methods typically divide a whole slide image (WSI) into patches, aggregate convolutional neural network outcomes on them and estimate overall disease grade. However, patch-based methods introduce label noise in training by assuming that each patch is independent with the same label as the WSI and neglect the important contextual information that is significant in disease grading. Here we present a Graph-Transformer (GT) based framework for processing pathology data, called GTP, that interprets morphological and spatial information at the WSI-level to predict disease grade. To demonstrate the applicability of our approach, we selected 3,024 hematoxylin and eosin WSIs of lung tumors and with normal histology from the Clinical Proteomic Tumor Analysis Consortium, the National Lung Screening Trial, and The Cancer Genome Atlas, and used GTP to distinguish adenocarcinoma (LUAD) and squamous cell carcinoma (LSCC) from those that have normal histology. Our model achieved consistently high performance on binary (tumor versus normal: mean overall accuracy = 0.975 ± 0.013) as well as three-label (normal versus LUAD versus LSCC: mean accuracy = 0.932 ± 0.019) classification on held-out test data, underscoring the power of GT-based deep learning for WSI-level classification. We also introduced a graphbased saliency mapping technique, called GraphCAM, that captures regional as well as contextual information and allows our model to highlight WSI regions that are highly associated with the class label. Taken together, our findings demonstrate GTP as a novel interpretable and effective deep learning framework for WSI-level classification.
| Year | Citations | |
|---|---|---|
Page 1
Page 1