Unknown

Dataset Information

0

Revealing large metagenomic regions through long DNA fragment hybridization capture.


ABSTRACT: High-throughput DNA sequencing technologies have revolutionized genomic analysis, including the de novo assembly of whole genomes from single organisms or metagenomic samples. However, due to the limited capacity of short-read sequence data to assemble complex or low coverage regions, genomes are typically fragmented, leading to draft genomes with numerous underexplored large genomic regions. Revealing these missing sequences is a major goal to resolve concerns in numerous biological studies.To overcome these limitations, we developed an innovative target enrichment method for the reconstruction of large unknown genomic regions. Based on a hybridization capture strategy, this approach enables the enrichment of large genomic regions allowing the reconstruction of tens of kilobase pairs flanking a short, targeted DNA sequence.Applied to a metagenomic soil sample targeting the linA gene, the biomarker of hexachlorocyclohexane (HCH) degradation, our method permitted the enrichment of the gene and its flanking regions leading to the reconstruction of several contigs and complete plasmids exceeding tens of kilobase pairs surrounding linA. Thus, through gene association and genome reconstruction, we identified microbial species involved in HCH degradation which constitute targets to improve biostimulation treatments.This new hybridization capture strategy makes surveying and deconvoluting complex genomic regions possible through large genomic regions enrichment and allows the efficient exploration of metagenomic diversity. Indeed, this approach enables to assign identity and function to microorganisms in natural environments, one of the ultimate goals of microbial ecology.

SUBMITTER: Gasc C 

PROVIDER: S-EPMC5351058 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Revealing large metagenomic regions through long DNA fragment hybridization capture.

Gasc Cyrielle C   Peyret Pierre P  

Microbiome 20170314 1


<h4>Background</h4>High-throughput DNA sequencing technologies have revolutionized genomic analysis, including the de novo assembly of whole genomes from single organisms or metagenomic samples. However, due to the limited capacity of short-read sequence data to assemble complex or low coverage regions, genomes are typically fragmented, leading to draft genomes with numerous underexplored large genomic regions. Revealing these missing sequences is a major goal to resolve concerns in numerous bio  ...[more]

Similar Datasets

| S-EPMC8767324 | biostudies-literature
| S-EPMC6526642 | biostudies-literature
| S-ECPF-GEOD-21068 | biostudies-other
| S-EPMC7909704 | biostudies-literature
| S-EPMC2877570 | biostudies-literature
2010-04-29 | E-GEOD-21068 | biostudies-arrayexpress
| S-EPMC4938946 | biostudies-literature
| S-EPMC6459864 | biostudies-literature
2010-04-29 | GSE21068 | GEO
| S-EPMC4099319 | biostudies-literature