Unknown

Dataset Information

0

High-throughput genome scaffolding from in vivo DNA interaction frequency.


ABSTRACT: Despite advances in DNA sequencing technology, assembly of complex genomes remains a major challenge, particularly for genomes sequenced using short reads, which yield highly fragmented assemblies. Here we show that genome-wide in vivo chromatin interaction frequency data, which are measurable with chromosome conformation capture-based experiments, can be used as genomic distance proxies to accurately position individual contigs without requiring any sequence overlap. We also use these data to construct approximate genome scaffolds de novo. Applying our approach to incomplete regions of the human genome, we predict the positions of 65 previously unplaced contigs, in agreement with alternative methods in 26/31 cases attempted in common. Our approach can theoretically bridge any gap size and should be applicable to any species for which global chromatin interaction data can be generated.

SUBMITTER: Kaplan N 

PROVIDER: S-EPMC3880131 | biostudies-literature | 2013 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

High-throughput genome scaffolding from in vivo DNA interaction frequency.

Kaplan Noam N   Dekker Job J  

Nature biotechnology 20131124 12


Despite advances in DNA sequencing technology, assembly of complex genomes remains a major challenge, particularly for genomes sequenced using short reads, which yield highly fragmented assemblies. Here we show that genome-wide in vivo chromatin interaction frequency data, which are measurable with chromosome conformation capture-based experiments, can be used as genomic distance proxies to accurately position individual contigs without requiring any sequence overlap. We also use these data to c  ...[more]

Similar Datasets

| S-EPMC1766414 | biostudies-literature
| S-EPMC310828 | biostudies-literature
| S-EPMC5025741 | biostudies-literature
| S-EPMC4806852 | biostudies-literature
| S-EPMC2442187 | biostudies-literature
| S-EPMC4596663 | biostudies-literature
| S-EPMC2941625 | biostudies-literature
| S-EPMC7387798 | biostudies-literature