Unknown

Dataset Information

0

A curated dataset of modern and ancient high-coverage shotgun human genomes.


ABSTRACT: Over the last few years, genome-wide data for a large number of ancient human samples have been collected. Whilst datasets of captured SNPs have been collated, high coverage shotgun genomes (which are relatively few but allow certain types of analyses not possible with ascertained captured SNPs) have to be reprocessed by individual groups from raw reads. This task is computationally intensive. Here, we release a dataset including 35 whole-genome sequenced samples, previously published and distributed worldwide, together with the genetic pipeline used to process them. The dataset contains 72,041,355 sites called across 19 ancient and 16 modern individuals and includes sequence data from four previously published ancient samples which we sequenced to higher coverage (10-18x). Such a resource will allow researchers to analyse their new samples with the same genetic pipeline and directly compare them to the reference dataset without re-processing published samples. Moreover, this dataset can be easily expanded to increase the sample distribution both across time and space.

SUBMITTER: Maisano Delser P 

PROVIDER: S-EPMC8338957 | biostudies-literature | 2021 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A curated dataset of modern and ancient high-coverage shotgun human genomes.

Maisano Delser Pierpaolo P   Jones Eppie R ER   Hovhannisyan Anahit A   Cassidy Lara L   Pinhasi Ron R   Manica Andrea A  

Scientific data 20210804 1


Over the last few years, genome-wide data for a large number of ancient human samples have been collected. Whilst datasets of captured SNPs have been collated, high coverage shotgun genomes (which are relatively few but allow certain types of analyses not possible with ascertained captured SNPs) have to be reprocessed by individual groups from raw reads. This task is computationally intensive. Here, we release a dataset including 35 whole-genome sequenced samples, previously published and distri  ...[more]

Similar Datasets

| S-EPMC10858950 | biostudies-literature
| S-EPMC10027547 | biostudies-literature
| S-EPMC3468387 | biostudies-literature
| S-EPMC9858685 | biostudies-literature
| S-EPMC7596702 | biostudies-literature
| S-EPMC8553948 | biostudies-literature
| S-EPMC10282092 | biostudies-literature
| S-EPMC7930196 | biostudies-literature
| S-EPMC4586848 | biostudies-literature
| S-EPMC9972692 | biostudies-literature