Unknown

Dataset Information

0

Reference genotype and exome data from an Australian Aboriginal population for health-based research.


ABSTRACT: Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ?80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.

SUBMITTER: Tang D 

PROVIDER: S-EPMC4828942 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reference genotype and exome data from an Australian Aboriginal population for health-based research.

Tang Dave D   Anderson Denise D   Francis Richard W RW   Syn Genevieve G   Jamieson Sarra E SE   Lassmann Timo T   Blackwell Jenefer M JM  

Scientific data 20160412


Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucle  ...[more]

Similar Datasets

| S-EPMC7190730 | biostudies-literature
| EGAS00001001585 | EGA
| EGAS00001003745 | EGA
| S-EPMC7578642 | biostudies-literature
| S-EPMC9354439 | biostudies-literature
| S-EPMC3942517 | biostudies-literature
| S-EPMC7223353 | biostudies-literature
| S-EPMC6478284 | biostudies-literature
| S-EPMC5347126 | biostudies-literature
| S-EPMC6975592 | biostudies-literature