Unknown

Dataset Information

0

An integrative approach for efficient analysis of whole genome bisulfite sequencing data.


ABSTRACT:

Background

Whole genome bisulfite sequencing (WGBS) is a high-throughput technique for profiling genome-wide DNA methylation at single nucleotide resolution. However, the applications of WGBS are limited by low accuracy resulting from bisulfite-induced damage on DNA fragments. Although many computer programs have been developed for accurate detecting, most of the programs have barely succeeded in improving either quantity or quality of the methylation results. To improve both, we attempted to develop a novel integration of most widely used bisulfite-read mappers: Bismark, BSMAP, and BS-seeker2.

Results

A comprehensive analysis of the three mappers revealed that the mapping results of the mappers were mutually complementary under diverse read conditions. Therefore, we sought to integrate the characteristics of the mappers by scoring them to gain robustness against artifacts. As a result, the integration significantly increased detection accuracy compared with the individual mappers. In addition, the amount of detected cytosine was higher than that by Bismark. Furthermore, the integration successfully reduced the fluctuation of detection accuracy induced by read conditions. We applied the integration to real WGBS samples and succeeded in classifying the samples according to the originated tissues by both CpG and CpH methylation patterns.

Conclusions

In this study, we improved both quality and quantity of methylation results from WGBS data by integrating the mapping results of three bisulfite-read mappers. Also, we succeeded in combining and comparing WGBS samples by reducing the effects of read heterogeneity on methylation detection. This study contributes to DNA methylation researches by improving efficiency of methylation detection from WGBS data and facilitating the comprehensive analysis of public WGBS data.

SUBMITTER: Lee JH 

PROVIDER: S-EPMC4682396 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

An integrative approach for efficient analysis of whole genome bisulfite sequencing data.

Lee Jong-Hun JH   Park Sung-Joon SJ   Kenta Nakai N  

BMC genomics 20151209


<h4>Background</h4>Whole genome bisulfite sequencing (WGBS) is a high-throughput technique for profiling genome-wide DNA methylation at single nucleotide resolution. However, the applications of WGBS are limited by low accuracy resulting from bisulfite-induced damage on DNA fragments. Although many computer programs have been developed for accurate detecting, most of the programs have barely succeeded in improving either quantity or quality of the methylation results. To improve both, we attempt  ...[more]

Similar Datasets

| S-EPMC5842653 | biostudies-literature
| S-EPMC5378105 | biostudies-literature
| S-EPMC4063866 | biostudies-literature
| S-EPMC5905984 | biostudies-literature
| S-EPMC6821270 | biostudies-literature
| S-EPMC7195798 | biostudies-literature
| S-EPMC4204604 | biostudies-literature
| S-EPMC4682368 | biostudies-literature
| S-EPMC3575794 | biostudies-literature
| S-EPMC4344394 | biostudies-literature