Unknown

Dataset Information

0

Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration.


ABSTRACT: Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intelligence algorithm based on the graph convolutional network that integrates mRNA expression, DNA methylation, and DNA sequencing data. This NSCLC prediction model achieved a 93.7% macro F1-score, indicating that values for false positives and negatives were substantially low, which is desirable for accurate classification. Gene ontology enrichment and pathway analysis of features revealed that two major subtypes of NSCLC, lung adenocarcinoma and lung squamous cell carcinoma, have both specific and common GO biological processes. Numerous biomarkers (i.e., microRNA, long non-coding RNA, differentially methylated regions) were newly identified, whereas some biomarkers were consistent with previous findings in NSCLC (e.g., SPRR1B). Thus, using multi-omics data integration, we developed a promising cancer prediction algorithm.

SUBMITTER: Park MK 

PROVIDER: S-EPMC9775093 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration.

Park Min-Koo MK   Lim Jin-Muk JM   Jeong Jinwoo J   Jang Yeongjae Y   Lee Ji-Won JW   Lee Jeong-Chan JC   Kim Hyungyu H   Koh Euiyul E   Hwang Sung-Joo SJ   Kim Hong-Gee HG   Kim Keun-Cheol KC  

Biomolecules 20221208 12


Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intelligence algorithm based on the graph convolutional network that integrates mRNA expression, DNA methylation, and DNA sequencing data. This NSCLC prediction model achieved a 93.7% macro F1-score, indicati  ...[more]

Similar Datasets

| S-EPMC7753845 | biostudies-literature
| S-EPMC7992339 | biostudies-literature
| S-EPMC10019780 | biostudies-literature
| S-EPMC9259796 | biostudies-literature
| S-EPMC6050171 | biostudies-literature
| S-EPMC6612815 | biostudies-literature
| S-EPMC6022810 | biostudies-literature
| S-EPMC10328436 | biostudies-literature
| S-EPMC5907722 | biostudies-literature
| S-EPMC6201709 | biostudies-literature