Unknown

Dataset Information

0

URMAP, an ultra-fast read mapper.


ABSTRACT: Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an order of magnitude faster than BWA with comparable accuracy on several validation tests. On a Genome in a Bottle (GIAB) variant calling test with 30× coverage 2×150 reads, URMAP achieves high accuracy (precision 0.998, sensitivity 0.982 and F-measure 0.990) with the strelka2 caller. However, GIAB reference variants are shown to be biased against repetitive regions which are difficult to map and may therefore pose an unrealistically easy challenge to read mappers and variant callers.

SUBMITTER: Edgar R 

PROVIDER: S-EPMC7320720 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

URMAP, an ultra-fast read mapper.

Edgar Robert R  

PeerJ 20200624


Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an order of magnitude faster than BWA with comparable accuracy on several validation tests. On a Genome in a Bottle (GIAB) variant calling test with 30× coverage 2×150 reads, URMAP achieves high accuracy  ...[more]

Similar Datasets

| S-EPMC9753264 | biostudies-literature
| S-EPMC4795617 | biostudies-literature
| S-EPMC10883419 | biostudies-literature
| S-EPMC4866519 | biostudies-literature
2016-12-06 | GSE60865 | GEO
| S-EPMC3822393 | biostudies-literature
| S-EPMC5850834 | biostudies-literature
| S-EPMC5860201 | biostudies-literature
2023-08-08 | GSE237874 | GEO
| S-EPMC3322381 | biostudies-literature