Unknown

Dataset Information

0

Paired-end sequencing of long-range DNA fragments for de novo assembly of large, complex Mammalian genomes by direct intra-molecule ligation.


ABSTRACT:

Background

The relatively short read lengths from next generation sequencing (NGS) technologies still pose a challenge for de novo assembly of complex mammal genomes. One important solution is to use paired-end (PE) sequence information experimentally obtained from long-range DNA fragments (>1 kb). Here, we characterize and extend a long-range PE library construction method based on direct intra-molecule ligation (or molecular linker-free circularization) for NGS.

Results

We found that the method performs stably for PE sequencing of 2- to 5- kb DNA fragments, and can be extended to 10-20 kb (and even in extremes, up to ?35 kb). We also characterized the impact of low quality input DNA on the method, and develop a whole-genome amplification (WGA) based protocol using limited input DNA (<1 µg). Using this PE dataset, we accurately assembled the YanHuang (YH) genome, the first sequenced Asian genome, into a scaffold N50 size of >2 Mb, which is over 100-times greater than the initial size produced with only small insert PE reads(17 kb). In addition, we mapped two 7- to 8- kb insertions in the YH genome using the larger insert sizes of the long-range PE data.

Conclusions

In conclusion, we demonstrate here the effectiveness of this long-range PE sequencing method and its use for the de novo assembly of a large, complex genome using NGS short reads.

SUBMITTER: Asan 

PROVIDER: S-EPMC3459883 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Paired-end sequencing of long-range DNA fragments for de novo assembly of large, complex Mammalian genomes by direct intra-molecule ligation.

Asan   Geng Chunyu C   Chen Yan Y   Wu Kui K   Cai Qingle Q   Wang Yu Y   Lang Yongshan Y   Cao Hongzhi H   Yang Huangming H   Wang Jian J   Zhang Xiuqing X  

PloS one 20120927 9


<h4>Background</h4>The relatively short read lengths from next generation sequencing (NGS) technologies still pose a challenge for de novo assembly of complex mammal genomes. One important solution is to use paired-end (PE) sequence information experimentally obtained from long-range DNA fragments (>1 kb). Here, we characterize and extend a long-range PE library construction method based on direct intra-molecule ligation (or molecular linker-free circularization) for NGS.<h4>Results</h4>We found  ...[more]

Similar Datasets

| S-EPMC3158087 | biostudies-literature
| S-EPMC6280916 | biostudies-literature
| S-EPMC3635884 | biostudies-literature
| S-EPMC3076424 | biostudies-literature
| S-EPMC4552406 | biostudies-literature
| S-EPMC5841375 | biostudies-literature
| S-EPMC4583709 | biostudies-literature
| S-EPMC4545970 | biostudies-literature
2010-07-13 | E-GEOD-22765 | biostudies-arrayexpress
2010-07-13 | GSE22765 | GEO