An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome.
Ontology highlight
ABSTRACT: SUMMARY: Many high-throughput sequencing experiments produce paired DNA reads. Paired-end DNA reads provide extra positional information that is useful in reliable mapping of short reads to a reference genome, as well as in downstream analyses of structural variations. Given the importance of paired-end alignments, it is surprising that there have been no previous publications focusing on this topic. In this article, we present a new probabilistic framework to predict the alignment of paired-end reads to a reference genome. Using both simulated and real data, we compare the performance of our method with six other read-mapping tools that provide a paired-end option. We show that our method provides a good combination of accuracy, error rate and computation time, especially in more challenging and practical cases, such as when the reference genome is incomplete or unavailable for the sample, or when there are large variations between the reference genome and the source of the reads. An open-source implementation of our method is available as part of Last, a multi-purpose alignment program freely available at http://last.cbrc.jp. CONTACT: martin@cbrc.jp SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
SUBMITTER: Shrestha AM
PROVIDER: S-EPMC3624798 | biostudies-literature | 2013 Apr
REPOSITORIES: biostudies-literature
ACCESS DATA