Unknown

Dataset Information

0

Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model.


ABSTRACT: Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from 503 patients with LAD, as well as expression data from a further 59 LAD and adjacent tissues, was obtained from The Cancer Genome Atlas. Additionally, a validation dataset was acquired from the Gene Expression Omnibus database (GSE26939), which included clinical and gene expression data from 115 patients. Using the DESeq2 package to compare expression between LAD and adjacent tissues, differentially expressed genes (DEGs) were identified. On the basis of survival and the random forests for survival, regression and classification package, genes for constructing the prognosis risk model were selected. The prognosis risk model was constructed and validated using the survival package. Subsequently, high- and low-risk groups were compared using the Limma package to identify DEGs, and enrichment analysis was performed using the web-based gene set analysis toolkit. A protein-protein interaction network was visualized using Cytoscape software. There were 18,567 DEGs between the LAD samples and the adjacent tissues, and 363 DEGs between the high- and low-risk groups. Of these, four genes were selected for constructing the prognosis risk model, myosin IE (MYO1E), endoplasmic reticulum oxidoreductase 1? (ERO1L), C1q and tumor necrosis factor-related protein 6 (C1QTNF6) and family with sequence similarity 83, member A (FAM83A). The survival time of high- and low-risk groups in the validation set were significantly different. Functional enrichment revealed that the genes that interacted with MYO1E, ERO1L, C1QTNF6 and FAM83A separately were enriched in 'cell cycle regulation', 'synthesis and assembly of nucleic acids', 'histone modification and cell cycle progression' and 'cell secretion process'. The four-gene prognosis risk model could potentially be used for predicting the survival of patients with LAD.

SUBMITTER: Zhang W 

PROVIDER: S-EPMC6539490 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model.

Zhang Wei W   Shen Yang Y   Feng Ganzhu G  

Oncology letters 20190517 1


Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from 503 patients with LAD, as well as expression data from a further 59 LAD and adjacent tissues, was obtained from The Cancer Genome Atlas. Additionally, a validation dataset was acquired from the Gene E  ...[more]

Similar Datasets

| S-EPMC7720456 | biostudies-literature
| S-EPMC8465999 | biostudies-literature
| S-EPMC8034940 | biostudies-literature
| S-EPMC9243036 | biostudies-literature
| S-EPMC8504460 | biostudies-literature
| S-EPMC7439715 | biostudies-literature
| S-EPMC7779611 | biostudies-literature
| S-EPMC6399972 | biostudies-literature
| S-EPMC8283194 | biostudies-literature
| S-EPMC8766344 | biostudies-literature