Unknown

Dataset Information

0

Exploration of predictive and prognostic alternative splicing signatures in lung adenocarcinoma using machine learning methods.


ABSTRACT:

Background

Alternative splicing (AS) plays critical roles in generating protein diversity and complexity. Dysregulation of AS underlies the initiation and progression of tumors. Machine learning approaches have emerged as efficient tools to identify promising biomarkers. It is meaningful to explore pivotal AS events (ASEs) to deepen understanding and improve prognostic assessments of lung adenocarcinoma (LUAD) via machine learning algorithms.

Method

RNA sequencing data and AS data were extracted from The Cancer Genome Atlas (TCGA) database and TCGA SpliceSeq database. Using several machine learning methods, we identified 24 pairs of LUAD-related ASEs implicated in splicing switches and a random forest-based classifiers for identifying lymph node metastasis (LNM) consisting of 12 ASEs. Furthermore, we identified key prognosis-related ASEs and established a 16-ASE-based prognostic model to predict overall survival for LUAD patients using Cox regression model, random survival forest analysis, and forward selection model. Bioinformatics analyses were also applied to identify underlying mechanisms and associated upstream splicing factors (SFs).

Results

Each pair of ASEs was spliced from the same parent gene, and exhibited perfect inverse intrapair correlation (correlation coefficient?=?-?1). The 12-ASE-based classifier showed robust ability to evaluate LNM status of LUAD patients with the area under the receiver operating characteristic (ROC) curve (AUC) more than 0.7 in fivefold cross-validation. The prognostic model performed well at 1, 3, 5, and 10 years in both the training cohort and internal test cohort. Univariate and multivariate Cox regression indicated the prognostic model could be used as an independent prognostic factor for patients with LUAD. Further analysis revealed correlations between the prognostic model and American Joint Committee on Cancer stage, T stage, N stage, and living status. The splicing network constructed of survival-related SFs and ASEs depicts regulatory relationships between them.

Conclusion

In summary, our study provides insight into LUAD researches and managements based on these AS biomarkers.

SUBMITTER: Cai Q 

PROVIDER: S-EPMC7720605 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Exploration of predictive and prognostic alternative splicing signatures in lung adenocarcinoma using machine learning methods.

Cai Qidong Q   He Boxue B   Zhang Pengfei P   Zhao Zhenyu Z   Peng Xiong X   Zhang Yuqian Y   Xie Hui H   Wang Xiang X  

Journal of translational medicine 20201207 1


<h4>Background</h4>Alternative splicing (AS) plays critical roles in generating protein diversity and complexity. Dysregulation of AS underlies the initiation and progression of tumors. Machine learning approaches have emerged as efficient tools to identify promising biomarkers. It is meaningful to explore pivotal AS events (ASEs) to deepen understanding and improve prognostic assessments of lung adenocarcinoma (LUAD) via machine learning algorithms.<h4>Method</h4>RNA sequencing data and AS data  ...[more]

Similar Datasets

| S-EPMC10665256 | biostudies-literature
| S-EPMC6591313 | biostudies-literature
| S-EPMC10229769 | biostudies-literature
2021-06-04 | GSE166865 | GEO
| S-EPMC7272712 | biostudies-literature
| S-EPMC8873598 | biostudies-literature
| S-EPMC6684923 | biostudies-literature
| S-EPMC8141138 | biostudies-literature
| S-EPMC10548142 | biostudies-literature
| S-EPMC7693837 | biostudies-literature