Unknown

Dataset Information

0

Biasogram: visualization of confounding technical bias in gene expression data.


ABSTRACT: Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may have been driven by a confounding technical variable. This approach can be used as a quality control step to identify data sets that are likely to yield false positive results.

SUBMITTER: Krzystanek M 

PROVIDER: S-EPMC3628873 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Biasogram: visualization of confounding technical bias in gene expression data.

Krzystanek Marcin M   Szallasi Zoltan Z   Eklund Aron C AC  

PloS one 20130416 4


Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Her  ...[more]

Similar Datasets

| S-EPMC1124898 | biostudies-literature
2020-03-23 | GSE130450 | GEO
| S-EPMC2711113 | biostudies-literature
| S-EPMC5541427 | biostudies-literature
| S-EPMC7243731 | biostudies-literature
| S-EPMC6004614 | biostudies-literature
| S-EPMC2374720 | biostudies-literature
| S-EPMC329133 | biostudies-literature
| S-EPMC7176054 | biostudies-literature