Unknown

Dataset Information

0

Mass Spectrometry Imaging for Reliable and Fast Classification of Non-Small Cell Lung Cancer Subtypes.


ABSTRACT: Subtyping of non-small cell lung cancer (NSCLC) is paramount for therapy stratification. In this study, we analyzed the largest NSCLC cohort by mass spectrometry imaging (MSI) to date. We sought to test different classification algorithms and to validate results obtained in smaller patient cohorts. Tissue microarrays (TMAs) from including adenocarcinoma (ADC, n = 499) and squamous cell carcinoma (SqCC, n = 440), were analyzed. Linear discriminant analysis, support vector machine, and random forest (RF) were applied using samples randomly assigned for training (66%) and validation (33%). The m/z species most relevant for the classification were identified by on-tissue tandem mass spectrometry and validated by immunohistochemistry (IHC). Measurements from multiple TMAs were comparable using standardized protocols. RF yielded the best classification results. The classification accuracy decreased after including less than six of the most relevant m/z species. The sensitivity and specificity of MSI in the validation cohort were 92.9% and 89.3%, comparable to IHC. The most important protein for the discrimination of both tumors was cytokeratin 5. We investigated the largest NSCLC cohort by MSI to date and found that the classification of NSCLC into ADC and SqCC is possible with high accuracy using a limited set of m/z species.

SUBMITTER: Kriegsmann M 

PROVIDER: S-EPMC7564257 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications


Subtyping of non-small cell lung cancer (NSCLC) is paramount for therapy stratification. In this study, we analyzed the largest NSCLC cohort by mass spectrometry imaging (MSI) to date. We sought to test different classification algorithms and to validate results obtained in smaller patient cohorts. Tissue microarrays (TMAs) from including adenocarcinoma (ADC, <i>n</i> = 499) and squamous cell carcinoma (SqCC, <i>n</i> = 440), were analyzed. Linear discriminant analysis, support vector machine, a  ...[more]

Similar Datasets

| S-EPMC7608526 | biostudies-literature
| S-EPMC6722907 | biostudies-literature
| S-EPMC5054336 | biostudies-literature
| S-EPMC7183755 | biostudies-literature
2016-11-15 | GSE89818 | GEO
| S-EPMC7540554 | biostudies-literature
| S-EPMC6096711 | biostudies-literature
| S-EPMC3125788 | biostudies-literature
| S-EPMC8456075 | biostudies-literature
| S-EPMC3755500 | biostudies-literature