Unknown

Dataset Information

0

Testing of Alignment Parameters for Ancient Samples: Evaluating and Optimizing Mapping Parameters for Ancient Samples Using the TAPAS Tool.


ABSTRACT: High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present 'TAPAS', (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material.

SUBMITTER: Taron UH 

PROVIDER: S-EPMC5867878 | biostudies-literature | 2018 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Testing of Alignment Parameters for Ancient Samples: Evaluating and Optimizing Mapping Parameters for Ancient Samples Using the TAPAS Tool.

Taron Ulrike H UH   Lell Moritz M   Barlow Axel A   Paijmans Johanna L A JLA  

Genes 20180313 3


High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with hig  ...[more]

Similar Datasets

| S-EPMC10444664 | biostudies-literature
| S-EPMC6454472 | biostudies-literature
2024-10-10 | PXD050548 | Pride
| S-EPMC8712333 | biostudies-literature
| S-EPMC5409310 | biostudies-literature
| S-EPMC5026160 | biostudies-literature
| S-EPMC8796372 | biostudies-literature
| S-EPMC10513296 | biostudies-literature
| S-EPMC2829014 | biostudies-literature
| S-EPMC3799479 | biostudies-literature