Unknown

Dataset Information

0

Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features.


ABSTRACT: Lung cancer is the most prevalent cancer worldwide, and histopathological assessment is indispensable for its diagnosis. However, human evaluation of pathology slides cannot accurately predict patients' prognoses. In this study, we obtain 2,186 haematoxylin and eosin stained histopathology whole-slide images of lung adenocarcinoma and squamous cell carcinoma patients from The Cancer Genome Atlas (TCGA), and 294 additional images from Stanford Tissue Microarray (TMA) Database. We extract 9,879 quantitative image features and use regularized machine-learning methods to select the top features and to distinguish shorter-term survivors from longer-term survivors with stage I adenocarcinoma (P<0.003) or squamous cell carcinoma (P=0.023) in the TCGA data set. We validate the survival prediction framework with the TMA cohort (P<0.036 for both tumour types). Our results suggest that automatically derived image features can predict the prognosis of lung cancer patients and thereby contribute to precision oncology. Our methods are extensible to histopathology images of other organs.

SUBMITTER: Yu KH 

PROVIDER: S-EPMC4990706 | biostudies-other | 2016 Aug

REPOSITORIES: biostudies-other

altmetric image

Publications

Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features.

Yu Kun-Hsing KH   Zhang Ce C   Berry Gerald J GJ   Altman Russ B RB   Ré Christopher C   Rubin Daniel L DL   Snyder Michael M  

Nature communications 20160816


Lung cancer is the most prevalent cancer worldwide, and histopathological assessment is indispensable for its diagnosis. However, human evaluation of pathology slides cannot accurately predict patients' prognoses. In this study, we obtain 2,186 haematoxylin and eosin stained histopathology whole-slide images of lung adenocarcinoma and squamous cell carcinoma patients from The Cancer Genome Atlas (TCGA), and 294 additional images from Stanford Tissue Microarray (TMA) Database. We extract 9,879 qu  ...[more]

Similar Datasets

| S-EPMC4941525 | biostudies-literature
| S-EPMC8346561 | biostudies-literature
| S-EPMC6894590 | biostudies-literature
| S-EPMC7471726 | biostudies-literature
| S-EPMC5856260 | biostudies-literature
| S-EPMC6138894 | biostudies-other
| S-EPMC7403709 | biostudies-literature
| S-EPMC8931011 | biostudies-literature
| S-EPMC6345704 | biostudies-other
| S-EPMC6977110 | biostudies-literature