Unknown

Dataset Information

0

ARIADNA: machine learning method for ancient DNA variant discovery.


ABSTRACT: Ancient DNA (aDNA) studies often rely on standard methods of mutation calling, optimized for high-quality contemporary DNA but not for excessive contamination, time- or environment-related damage of aDNA. In the absence of validated datasets and despite showing extreme sensitivity to aDNA quality, these methods have been used in many published studies, sometimes with additions of arbitrary filters or modifications, designed to overcome aDNA degradation and contamination problems. The general lack of best practices for aDNA mutation calling may lead to inaccurate results. To address these problems, we present ARIADNA (ARtificial Intelligence for Ancient DNA), a novel approach based on machine learning techniques, using specific aDNA characteristics as features to yield improved mutation calls. In our comparisons of variant callers across several ancient genomes, ARIADNA consistently detected higher-quality genome variants with fast runtimes, while reducing the false positive rate compared with other approaches.

SUBMITTER: Kawash JK 

PROVIDER: S-EPMC6289774 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

ARIADNA: machine learning method for ancient DNA variant discovery.

Kawash Joseph K JK   Smith Sean D SD   Karaiskos Spyros S   Grigoriev Andrey A  

DNA research : an international journal for rapid publication of reports on genes and genomes 20181201 6


Ancient DNA (aDNA) studies often rely on standard methods of mutation calling, optimized for high-quality contemporary DNA but not for excessive contamination, time- or environment-related damage of aDNA. In the absence of validated datasets and despite showing extreme sensitivity to aDNA quality, these methods have been used in many published studies, sometimes with additions of arbitrary filters or modifications, designed to overcome aDNA degradation and contamination problems. The general lac  ...[more]

Similar Datasets

2022-10-01 | GSE200096 | GEO
2020-09-01 | E-MTAB-9501 | biostudies-arrayexpress
2021-07-09 | GSE163896 | GEO
2023-06-01 | GSE193400 | GEO
| S-EPMC7581251 | biostudies-literature
| S-EPMC1955739 | biostudies-literature
| S-EPMC8085602 | biostudies-literature
| S-EPMC8356896 | biostudies-literature
| S-EPMC8062585 | biostudies-literature
| S-EPMC6428806 | biostudies-literature