Genomics

Dataset Information

6

Genotype and exome data for an Australian Aboriginal population: a reference panel for health-based research


ABSTRACT: Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalog of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in 80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found 99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.

PROVIDER: EGAS00001001585 | EGA |

REPOSITORIES: EGA

altmetric image

Publications

Reference genotype and exome data from an Australian Aboriginal population for health-based research.

Tang Dave D   Anderson Denise D   Francis Richard W RW   Syn Genevieve G   Jamieson Sarra E SE   Lassmann Timo T   Blackwell Jenefer M JM  

Scientific data 20160412


Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucle  ...[more]

Similar Datasets

2018-08-06 | GSE115017 | GEO
| EGAS00001001766 | EGA
2014-09-01 | E-GEOD-45204 | biostudies-arrayexpress
| EGAS00001003745 | EGA
2013-05-01 | E-GEOD-37870 | biostudies-arrayexpress
2014-09-01 | GSE45204 | GEO
| EGAS00001003359 | EGA
2010-01-24 | E-GEOD-19986 | biostudies-arrayexpress
2013-05-01 | GSE37870 | GEO
2013-04-02 | E-GEOD-33294 | biostudies-arrayexpress