Copy number variations distinguish lung adenocarcinomas from squamous cell carcinomas
Ontology highlight
ABSTRACT: For a number of clinical and biological reasons, the accurate classification of non-small cell lung carcinoma (NSCLC) into adenocarcinoma (ADC) and squamous cell carcinoma (SCC) is essential. DNA-based tests, which are not currently used, are more robust when applied to formalin-fixed paraffin-embedded tissues. To develop a molecular-based classification of NSCLC based on genome wide copy number variations (CNVs), the corresponding TCGA, SPORE and CANARY patient datasets were used as training and independent validation data. The signature genes were selected by advanced supervised classification algorithms and restricted to known important oncogenes/tumor suppressors, resulting in a final 27-gene signature that was able to classify ADC from SCC with 0.85-0.87 accuracies of SPORE validation sets and 0.96-0.98 accuracy of CANARY validation sets. Even by using the top 7 genes in this signature, the accuracies of the validation sets were still as high as 0.80 and 0.97, respectively. These signature genes also classified adenocarcinoma and squamous cell carcinomas from the non-malignant lung samples with accuracies of 91-97%.
ORGANISM(S): Homo sapiens
PROVIDER: GSE74948 | GEO | 2015/11/13
SECONDARY ACCESSION(S): PRJNA301991
REPOSITORIES: GEO
ACCESS DATA