Unknown

Dataset Information

0

MMR: a tool for read multi-mapper resolution.


ABSTRACT:

Motivation

Mapping high-throughput sequencing data to a reference genome is an essential step for most analysis pipelines aiming at the computational analysis of genome and transcriptome sequencing data. Breaking ties between equally well mapping locations poses a severe problem not only during the alignment phase but also has significant impact on the results of downstream analyses. We present the multi-mapper resolution (MMR) tool that infers optimal mapping locations from the coverage density of other mapped reads.

Results

Filtering alignments with MMR can significantly improve the performance of downstream analyses like transcript quantitation and differential testing. We illustrate that the accuracy (Spearman correlation) of transcript quantification increases by 15% when using reads of length 51. In addition, MMR decreases the alignment file sizes by more than 50%, and this leads to a reduced running time of the quantification tool. Our efficient implementation of the MMR algorithm is easily applicable as a post-processing step to existing alignment files in BAM format. Its complexity scales linearly with the number of alignments and requires no further inputs.

Availability and implementation

Open source code and documentation are available for download at http://github.com/ratschlab/mmr Comprehensive testing results and further information can be found at http://bioweb.me/mmr.

Contact

andre.kahles@ratschlab.org or gunnar.ratsch@ratschlab.org

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Kahles A 

PROVIDER: S-EPMC4795617 | biostudies-literature | 2016 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

MMR: a tool for read multi-mapper resolution.

Kahles André A   Behr Jonas J   Rätsch Gunnar G  

Bioinformatics (Oxford, England) 20151030 5


<h4>Motivation</h4>Mapping high-throughput sequencing data to a reference genome is an essential step for most analysis pipelines aiming at the computational analysis of genome and transcriptome sequencing data. Breaking ties between equally well mapping locations poses a severe problem not only during the alignment phase but also has significant impact on the results of downstream analyses. We present the multi-mapper resolution (MMR) tool that infers optimal mapping locations from the coverage  ...[more]

Similar Datasets

| S-EPMC7320720 | biostudies-literature
| S-EPMC7034980 | biostudies-literature
| S-EPMC5657049 | biostudies-literature
| S-EPMC1523221 | biostudies-literature
| S-EPMC6157080 | biostudies-other
| S-EPMC5411769 | biostudies-literature
2014-09-25 | E-GEOD-57862 | biostudies-arrayexpress
| S-EPMC5846869 | biostudies-other