Unknown

Dataset Information

0

Rapid genome mapping in nanochannel arrays for highly complete and accurate de novo sequence assembly of the complex Aegilops tauschii genome.


ABSTRACT: Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high quantities of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large and complex genomes intractable thus far. Using two-color genome mapping of tiling bacterial artificial chromosomes (BAC) clones on nanochannel arrays, we completed high-confidence assembly of a 2.1-Mb, highly repetitive region in the large and complex genome of Aegilops tauschii, the D-genome donor of hexaploid wheat (Triticum aestivum). Genome mapping is based on direct visualization of sequence motifs on single DNA molecules hundreds of kilobases in length. With the genome map as a scaffold, we anchored unplaced sequence contigs, validated the initial draft assembly, and resolved instances of misassembly, some involving contigs <2 kb long, to dramatically improve the assembly from 75% to 95% complete.

SUBMITTER: Hastie AR 

PROVIDER: S-EPMC3566107 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Rapid genome mapping in nanochannel arrays for highly complete and accurate de novo sequence assembly of the complex Aegilops tauschii genome.

Hastie Alex R AR   Dong Lingli L   Smith Alexis A   Finklestein Jeff J   Lam Ernest T ET   Huo Naxin N   Cao Han H   Kwok Pui-Yan PY   Deal Karin R KR   Dvorak Jan J   Luo Ming-Cheng MC   Gu Yong Y   Xiao Ming M  

PloS one 20130206 2


Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high quantities of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large  ...[more]

Similar Datasets

| S-EPMC4701098 | biostudies-literature
| S-EPMC6374599 | biostudies-literature
| S-EPMC9562832 | biostudies-literature
| S-EPMC4855405 | biostudies-literature
| S-EPMC5004832 | biostudies-other
| S-EPMC6114841 | biostudies-literature
| S-EPMC7416625 | biostudies-literature
| S-EPMC10210529 | biostudies-literature
| S-EPMC4622089 | biostudies-literature
2018-06-08 | GSE115454 | GEO