Unknown

Dataset Information

0

A new parallel pipeline for DNA methylation analysis of long reads datasets.


ABSTRACT: DNA methylation is an important mechanism of epigenetic regulation in development and disease. New generation sequencers allow genome-wide measurements of the methylation status by reading short stretches of the DNA sequence (Methyl-seq). Several software tools for methylation analysis have been proposed over recent years. However, the current trend is that the new sequencers and the ones expected for an upcoming future yield sequences of increasing length, making these software tools inefficient and obsolete.In this paper, we propose a new software based on a strategy for methylation analysis of Methyl-seq sequencing data that requires much shorter execution times while yielding a better level of sensitivity, particularly for datasets composed of long reads. This strategy can be exported to other methylation, DNA and RNA analysis tools.The developed software tool achieves execution times one order of magnitude shorter than the existing tools, while yielding equal sensitivity for short reads and even better sensitivity for long reads.

SUBMITTER: Olanda R 

PROVIDER: S-EPMC5343294 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A new parallel pipeline for DNA methylation analysis of long reads datasets.

Olanda Ricardo R   Pérez Mariano M   Orduña Juan M JM   Tárraga Joaquín J   Dopazo Joaquín J  

BMC bioinformatics 20170309 1


<h4>Background</h4>DNA methylation is an important mechanism of epigenetic regulation in development and disease. New generation sequencers allow genome-wide measurements of the methylation status by reading short stretches of the DNA sequence (Methyl-seq). Several software tools for methylation analysis have been proposed over recent years. However, the current trend is that the new sequencers and the ones expected for an upcoming future yield sequences of increasing length, making these softwa  ...[more]

Similar Datasets

| S-EPMC4147891 | biostudies-other
| S-EPMC7393826 | biostudies-literature
| S-EPMC4758075 | biostudies-literature
| S-EPMC4351923 | biostudies-literature
| S-EPMC9277797 | biostudies-literature
| S-EPMC8352503 | biostudies-literature
| S-EPMC10045170 | biostudies-literature
| S-EPMC6247925 | biostudies-literature
| S-EPMC9235486 | biostudies-literature
2014-08-22 | E-GEOD-55569 | biostudies-arrayexpress