Unknown

Dataset Information

0

Identification of cell type-specific methylation signals in bulk whole genome bisulfite sequencing data.


ABSTRACT:

Background

The traditional approach to studying the epigenetic mechanism CpG methylation in tissue samples is to identify regions of concordant differential methylation spanning multiple CpG sites (differentially methylated regions). Variation limited to single or small numbers of CpGs has been assumed to reflect stochastic processes. To test this, we developed software, Cluster-Based analysis of CpG methylation (CluBCpG), and explored variation in read-level CpG methylation patterns in whole genome bisulfite sequencing data.

Results

Analysis of both human and mouse whole genome bisulfite sequencing datasets reveals read-level signatures associated with cell type and cell type-specific biological processes. These signatures, which are mostly orthogonal to classical differentially methylated regions, are enriched at cell type-specific enhancers and allow estimation of proportional cell composition in synthetic mixtures and improved prediction of gene expression. In tandem, we developed a machine learning algorithm, Precise Read-Level Imputation of Methylation (PReLIM), to increase coverage of existing whole genome bisulfite sequencing datasets by imputing CpG methylation states on individual sequencing reads. PReLIM both improves CluBCpG coverage and performance and enables identification of novel differentially methylated regions, which we independently validate.

Conclusions

Our data indicate that, rather than stochastic variation, read-level CpG methylation patterns in tissue whole genome bisulfite sequencing libraries reflect cell type. Accordingly, these new computational tools should lead to an improved understanding of epigenetic regulation by DNA methylation.

SUBMITTER: Scott CA 

PROVIDER: S-EPMC7329512 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of cell type-specific methylation signals in bulk whole genome bisulfite sequencing data.

Scott C Anthony CA   Duryea Jack D JD   MacKay Harry H   Baker Maria S MS   Laritsky Eleonora E   Gunasekara Chathura J CJ   Coarfa Cristian C   Waterland Robert A RA  

Genome biology 20200701 1


<h4>Background</h4>The traditional approach to studying the epigenetic mechanism CpG methylation in tissue samples is to identify regions of concordant differential methylation spanning multiple CpG sites (differentially methylated regions). Variation limited to single or small numbers of CpGs has been assumed to reflect stochastic processes. To test this, we developed software, Cluster-Based analysis of CpG methylation (CluBCpG), and explored variation in read-level CpG methylation patterns in  ...[more]

Similar Datasets

| S-BSST612 | biostudies-other
| S-BSST618 | biostudies-other
| S-EPMC3938178 | biostudies-literature
| S-EPMC5320668 | biostudies-literature
| S-EPMC4623491 | biostudies-literature
| S-EPMC10963232 | biostudies-literature
| S-EPMC7359584 | biostudies-literature
| S-EPMC4344394 | biostudies-literature
| S-EPMC3592415 | biostudies-literature
| S-EPMC5856372 | biostudies-literature