Unknown

Dataset Information

0

Extraction and annotation of human mitochondrial genomes from 1000 Genomes Whole Exome Sequencing data.


ABSTRACT: BACKGROUND: Whole Exome Sequencing (WES) is one of the most used and cost-effective next generation technologies that allows sequencing of all nuclear exons. Off-target regions may be captured if they present high sequence similarity with baits. Bioinformatics tools have been optimized to retrieve a large amount of WES off-target mitochondrial DNA (mtDNA), by exploiting the aspecificity of probes, partially overlapping to Nuclear mitochondrial Sequences (NumtS). The 1000 Genomes project represents one of the widest resources to extract mtDNA sequences from WES data, considering the large effort the scientific community is undertaking to reconstruct human population history using mtDNA as marker, and the involvement of mtDNA in pathology. RESULTS: A previously published pipeline aimed at assembling mitochondrial genomes from off-target WES reads and further improved to detect insertions and deletions (indels) and heteroplasmy in a dataset of 1242 samples from the 1000 Genomes project, enabled to obtain a nearly complete mitochondrial genome from 943 samples (76% analyzed exomes). The robustness of our computational strategy was highlighted by the reduction of reads amount recognized as mitochondrial in the original annotation produced by the Consortium, due to NumtS filtering. CONCLUSIONS: To the best of our knowledge, this is likely the most extended population-scale mitochondrial genotyping in humans enriched with the estimation of heteroplasmies.

SUBMITTER: Diroma MA 

PROVIDER: S-EPMC4083402 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extraction and annotation of human mitochondrial genomes from 1000 Genomes Whole Exome Sequencing data.

Diroma Maria Angela MA   Calabrese Claudia C   Simone Domenico D   Santorsola Mariangela M   Calabrese Francesco Maria FM   Gasparre Giuseppe G   Attimonelli Marcella M  

BMC genomics 20140506


<h4>Background</h4>Whole Exome Sequencing (WES) is one of the most used and cost-effective next generation technologies that allows sequencing of all nuclear exons. Off-target regions may be captured if they present high sequence similarity with baits. Bioinformatics tools have been optimized to retrieve a large amount of WES off-target mitochondrial DNA (mtDNA), by exploiting the aspecificity of probes, partially overlapping to Nuclear mitochondrial Sequences (NumtS). The 1000 Genomes project r  ...[more]

Similar Datasets

| S-EPMC3819389 | biostudies-literature
| S-EPMC5818140 | biostudies-literature
| S-EPMC8686061 | biostudies-literature
| S-EPMC4253833 | biostudies-other
| PRJNA59773 | ENA
| S-EPMC4929867 | biostudies-other
| S-EPMC4489280 | biostudies-literature
| S-EPMC5549930 | biostudies-other
| S-EPMC5699183 | biostudies-literature
| S-EPMC3563612 | biostudies-literature