Unknown

Dataset Information

0

Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses.


ABSTRACT: This study was designed to identify lncRNA biomarker candidates using lung cancer data from RNA-Seq and microarray platforms separately.Lung cancer datasets were obtained from the Gene Expression Omnibus (GEO, n = 287) and The Cancer Genome Atlas (TCGA, n = 216) repositories, only common lncRNAs were used. Differentially expressed (DE) lncRNAs in tumors with respect to normal were selected from the Affymetrix and TCGA datasets. A training model consisting of the top 20 DE Affymetrix lncRNAs was used for validation in the TCGA and Agilent datasets. A second similar training model was generated using the TCGA dataset.First, a model using the top 20 DE lncRNAs from Affymetrix for training and validated using TCGA and Agilent, achieved high prediction accuracy for both training (98.5% AUC for Affymetrix) and validation (99.2% AUC for TCGA and 92.8% AUC for Agilent). A similar model using the top 20 DE lncRNAs from TCGA for training and validated using Affymetrix and Agilent, also achieved high prediction accuracy for both training (97.7% AUC for TCGA) and validation (96.5% AUC for Affymetrix and 80.9% AUC for Agilent). Eight lncRNAs were found to be overlapped from these two lists.

SUBMITTER: Zhao T 

PROVIDER: S-EPMC7425463 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses.

Zhao Tianying T   Khadka Vedbar Singh VS   Deng Youping Y  

Aging 20200716 14


This study was designed to identify lncRNA biomarker candidates using lung cancer data from RNA-Seq and microarray platforms separately.Lung cancer datasets were obtained from the Gene Expression Omnibus (GEO, n = 287) and The Cancer Genome Atlas (TCGA, n = 216) repositories, only common lncRNAs were used. Differentially expressed (DE) lncRNAs in tumors with respect to normal were selected from the Affymetrix and TCGA datasets. A training model consisting of the top 20 DE Affymetrix lncRNAs was  ...[more]

Similar Datasets

| S-EPMC11233682 | biostudies-literature
| S-EPMC4820141 | biostudies-literature
| S-EPMC6380409 | biostudies-literature
| EGAS00001000711 | EGA
| S-EPMC7906147 | biostudies-literature
| S-EPMC8131310 | biostudies-literature
| S-EPMC7294636 | biostudies-literature
| S-EPMC7295234 | biostudies-literature
| S-EPMC2691408 | biostudies-literature
| S-EPMC8962515 | biostudies-literature