Transcriptomics

Dataset Information

0

Complex Disease Subtypes Identified by Network-Based Clustering of Gene Expression Data: Application to COPD


ABSTRACT: One of the most common smoking-related diseases, chronic obstructive pulmonary disease (COPD), results from a dysregulated, multi-tissue inflammatory response to cigarette smoke. We hypothesized that systemic inflammatory signals in genome-wide blood gene expression can identify clinically important COPD-related disease subtypes, and we leveraged pre-existing gene interaction networks to guide unsupervised clustering of blood microarray expression data. Using network-informed non-negative matrix factorization, we analyzed genome-wide blood gene expression from 229 former smokers in the ECLIPSE Study, and we identified novel, clinically relevant molecular subtypes of COPD. These network-informed clusters were more stable and more strongly associated with measures of lung structure and function than clusters derived from a network-naïve approach, and they were associated with subtype-specific enrichment for inflammatory and protein catabolic pathways. These clusters were successfully reproduced in an independent sample of 135 smokers from the COPDGene Study. Briefly, gene expression was derived from whole blood samples in ECLIPSE subjects and peripheral blood mononuclear cells (PBMCs) for the COPDGene subjects. Gene expression profiling was performed using the Affymetrix Human U133 Plus2 array. Gene expression data were log-transformed, and background correction and normalization were performed for the merged ECLIPSE and COPDGene samples using robust multi-array averaging and quantile normalization as implemented in the affy Bioconductor package[27]. Of the 136 COPDGene subjects reported in a previous publication[13], one self-reported African-American subject was removed from analysis, which was conducted on the remaining 135 non-Hispanic white subjects. To identify a set of genes associated with COPD, we performed differential expression analysis for 38,519 probesets in ECLIPSE that passed quality control measures. Normalized probeset intensities were related to measures indicative of two primary dimensions of pulmonary impairment in COPD airway obstruction as indicated by two measures of spirometric lung function (FEV1 (% of predicted) and FEV1/FVC) and lung parenchymal destruction, i.e., emphysema (as quantified by the percentage of low attenuation area less than -950 Hounsfield units on lung computed tomography, %LAA-950). The analysis was conducted using the limma Bioconductor package, and the false discovery rate was controlled at 5%. The following covariates were included in the differential expression analysis age, pack-years of cigarette smoke exposure, and gender.

ORGANISM(S): Homo sapiens

PROVIDER: GSE76705 | GEO | 2016/01/12

SECONDARY ACCESSION(S): PRJNA308432

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2016-01-12 | E-GEOD-76705 | biostudies-arrayexpress
2008-05-31 | GSE8581 | GEO
2008-06-18 | E-GEOD-8581 | biostudies-arrayexpress
| phs001252 | dbGaP
2023-10-01 | GSE244036 | GEO
2015-07-23 | E-GEOD-71220 | biostudies-arrayexpress
2019-04-09 | GSE124180 | GEO
2017-04-21 | ST000625 | MetabolomicsWorkbench
2017-04-21 | ST000626 | MetabolomicsWorkbench
2017-04-21 | ST000627 | MetabolomicsWorkbench