Dataset Information

Correction for hidden confounders in the genetic analysis of gene expression.

ABSTRACT: Understanding the genetic underpinnings of disease is important for screening, treatment, drug development, and basic biological insight. One way of getting at such an understanding is to find out which parts of our DNA, such as single-nucleotide polymorphisms, affect particular intermediary processes such as gene expression. Naively, such associations can be identified using a simple statistical test on all paired combinations of genetic variants and gene transcripts. However, a wide variety of confounders lie hidden in the data, leading to both spurious associations and missed associations if not properly addressed. We present a statistical model that jointly corrects for two particular kinds of hidden structure--population structure (e.g., race, family-relatedness), and microarray expression artifacts (e.g., batch effects), when these confounders are unknown. Applying our method to both real and synthetic, human and mouse data, we demonstrate the need for such a joint correction of confounders, and also the disadvantages of other possible approaches based on those in the current literature. In particular, we show that our class of models has maximum power to detect eQTL on synthetic data, and has the best performance on a bronze standard applied to real data. Lastly, our software and the associations we found with it are available at http://www.microsoft.com/science.

SUBMITTER: Listgarten J

PROVIDER: S-EPMC2944732 | biostudies-literature | 2010 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Correction for hidden confounders in the genetic analysis of gene expression.

Listgarten Jennifer J Kadie Carl C Schadt Eric E EE Heckerman David D

Proceedings of the National Academy of Sciences of the United States of America 20100901 38

Understanding the genetic underpinnings of disease is important for screening, treatment, drug development, and basic biological insight. One way of getting at such an understanding is to find out which parts of our DNA, such as single-nucleotide polymorphisms, affect particular intermediary processes such as gene expression. Naively, such associations can be identified using a simple statistical test on all paired combinations of genetic variants and gene transcripts. However, a wide variety of ...[more]

PMID: 20810919

Dataset Information

Correction for hidden confounders in the genetic analysis of gene expression.

Publications

Correction for hidden confounders in the genetic analysis of gene expression.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

On the problem of confounders in modeling gene expression.
| S-EPMC6530814 | biostudies-literature

Correction: Evaluation of Schistosome Promoter Expression for Transgenesis and Genetic Analysis.
| S-EPMC6377129 | biostudies-other

Efficient and accurate causal inference with hidden confounders from genome-transcriptome variation data.
| S-EPMC5576763 | biostudies-literature

A powerful and efficient set test for genetic markers that handles confounders.
| S-EPMC3673214 | biostudies-other

Meta-analysis of gene expression signatures reveals hidden links among diverse biological processes in Arabidopsis.
| S-EPMC4232243 | biostudies-literature

Liver transcriptome profiling and functional analysis of intrauterine growth restriction (IUGR) piglets reveals a genetic correction and sexual-dimorphic gene expression during postnatal development.
| S-EPMC7545842 | biostudies-literature

Correction: Metagenomic and Metatranscriptomic Analysis of Microbial Community Structure and Gene Expression of Activated Sludge.
| S-EPMC7703942 | biostudies-literature

Overcoming the Undesirable CRISPR-Cas9 Expression in Gene Correction.
| S-EPMC6278715 | biostudies-literature

Correction of indels in the atpB gene by genetic recoding in the chloroplast of Oenothera and tobacco
2022-02-15 | PXD020246 | Pride

Genetic analysis of radiation-induced changes in human gene expression.
| S-EPMC3005325 | biostudies-literature