Unknown

Dataset Information

0

A robust data-driven genomic signature for idiopathic pulmonary fibrosis with applications for translational model selection.


ABSTRACT: Idiopathic pulmonary fibrosis (IPF) is a chronic and progressive lung disease affecting ~5 million people globally. We have constructed an accurate model of IPF disease status using elastic net regularized regression on clinical gene expression data. Leveraging whole transcriptome microarray data from 230 IPF and 89 control samples from Yang et al. (2013), sourced from the Lung Tissue Research Consortium (LTRC) and National Jewish Health (NJH) cohorts, we identify an IPF gene expression signature. We performed optimal feature selection to reduce the number of transcripts required by our model to a parsimonious set of 15. This signature enables our model to accurately separate IPF patients from controls. Our model outperforms existing published models when tested with multiple independent clinical cohorts. Our study underscores the utility of elastic nets for gene signature/panel selection which can be used for the construction of a multianalyte biomarker of disease. We also filter the gene sets used for model input to construct a model reliant on secreted proteins. Using this approach, we identify the preclinical bleomycin rat model that is most congruent with human disease at day 21 post-bleomycin administration, contrasting with earlier timepoints suggested by other studies.

SUBMITTER: Ammar R 

PROVIDER: S-EPMC6472794 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

A robust data-driven genomic signature for idiopathic pulmonary fibrosis with applications for translational model selection.

Ammar Ron R   Sivakumar Pitchumani P   Jarai Gabor G   Thompson John Ryan JR  

PloS one 20190418 4


Idiopathic pulmonary fibrosis (IPF) is a chronic and progressive lung disease affecting ~5 million people globally. We have constructed an accurate model of IPF disease status using elastic net regularized regression on clinical gene expression data. Leveraging whole transcriptome microarray data from 230 IPF and 89 control samples from Yang et al. (2013), sourced from the Lung Tissue Research Consortium (LTRC) and National Jewish Health (NJH) cohorts, we identify an IPF gene expression signatur  ...[more]

Similar Datasets

| S-EPMC4370242 | biostudies-literature
| S-EPMC4720265 | biostudies-literature
| S-EPMC7985078 | biostudies-literature
| S-EPMC3229869 | biostudies-literature
2014-03-04 | E-GEOD-45686 | biostudies-arrayexpress
2023-01-01 | GSE195770 | GEO
| S-EPMC4654815 | biostudies-literature
2020-05-13 | PXD010965 | Pride
2019-05-29 | GSE116086 | GEO