Unknown

Dataset Information

0

A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies.


ABSTRACT: Intra-sample cellular heterogeneity presents numerous challenges to the identification of biomarkers in large Epigenome-Wide Association Studies (EWAS). While a number of reference-based deconvolution algorithms have emerged, their potential remains underexplored and a comparative evaluation of these algorithms beyond tissues such as blood is still lacking.Here we present a novel framework for reference-based inference, which leverages cell-type specific DNAse Hypersensitive Site (DHS) information from the NIH Epigenomics Roadmap to construct an improved reference DNA methylation database. We show that this leads to a marginal but statistically significant improvement of cell-count estimates in whole blood as well as in mixtures involving epithelial cell-types. Using this framework we compare a widely used state-of-the-art reference-based algorithm (called constrained projection) to two non-constrained approaches including CIBERSORT and a method based on robust partial correlations. We conclude that the widely-used constrained projection technique may not always be optimal. Instead, we find that the method based on robust partial correlations is generally more robust across a range of different tissue types and for realistic noise levels. We call the combined algorithm which uses DHS data and robust partial correlations for inference, EpiDISH (Epigenetic Dissection of Intra-Sample Heterogeneity). Finally, we demonstrate the added value of EpiDISH in an EWAS of smoking.Estimating cell-type fractions and subsequent inference in EWAS may benefit from the use of non-constrained reference-based cell-type deconvolution methods.

SUBMITTER: Teschendorff AE 

PROVIDER: S-EPMC5307731 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies.

Teschendorff Andrew E AE   Breeze Charles E CE   Zheng Shijie C SC   Beck Stephan S  

BMC bioinformatics 20170213 1


<h4>Background</h4>Intra-sample cellular heterogeneity presents numerous challenges to the identification of biomarkers in large Epigenome-Wide Association Studies (EWAS). While a number of reference-based deconvolution algorithms have emerged, their potential remains underexplored and a comparative evaluation of these algorithms beyond tissues such as blood is still lacking.<h4>Results</h4>Here we present a novel framework for reference-based inference, which leverages cell-type specific DNAse  ...[more]

Similar Datasets

| S-EPMC5548182 | biostudies-literature
| S-EPMC4053810 | biostudies-literature
| S-EPMC8086103 | biostudies-literature
| S-EPMC5551540 | biostudies-literature
| S-EPMC6629651 | biostudies-literature
| S-EPMC8645110 | biostudies-literature
| S-EPMC8594691 | biostudies-literature
| S-EPMC4855979 | biostudies-literature
| S-EPMC5813244 | biostudies-literature
| S-EPMC6035894 | biostudies-literature