Unknown

Dataset Information

0

A new strategy for better genome assembly from very short reads.


ABSTRACT: BACKGROUND: With the rapid development of the next generation sequencing (NGS) technology, large quantities of genome sequencing data have been generated. Because of repetitive regions of genomes and some other factors, assembly of very short reads is still a challenging issue. RESULTS: A novel strategy for improving genome assembly from very short reads is proposed. It can increase accuracies of assemblies by integrating de novo contigs, and produce comparative contigs by allowing multiple references without limiting to genomes of closely related strains. Comparative contigs are used to scaffold de novo contigs. Using simulated and real datasets, it is shown that our strategy can effectively improve qualities of assemblies of isolated microbial genomes and metagenomes. CONCLUSIONS: With more and more reference genomes available, our strategy will be useful to improve qualities of genome assemblies from very short reads. Some scripts are provided to make our strategy applicable at http://code.google.com/p/cd-hybrid/.

SUBMITTER: Ji Y 

PROVIDER: S-EPMC3268122 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

A new strategy for better genome assembly from very short reads.

Ji Yan Y   Shi Yixiang Y   Ding Guohui G   Li Yixue Y  

BMC bioinformatics 20111230


<h4>Background</h4>With the rapid development of the next generation sequencing (NGS) technology, large quantities of genome sequencing data have been generated. Because of repetitive regions of genomes and some other factors, assembly of very short reads is still a challenging issue.<h4>Results</h4>A novel strategy for improving genome assembly from very short reads is proposed. It can increase accuracies of assemblies by integrating de novo contigs, and produce comparative contigs by allowing  ...[more]

Similar Datasets

| S-EPMC2813480 | biostudies-literature
| S-EPMC2529408 | biostudies-literature
| S-EPMC3158087 | biostudies-literature
| S-EPMC5173252 | biostudies-literature
| S-EPMC3092772 | biostudies-literature
| S-EPMC4120091 | biostudies-literature
| S-EPMC9508831 | biostudies-literature
2023-10-14 | GSE215355 | GEO
2023-10-14 | GSE215357 | GEO
| S-EPMC4779561 | biostudies-literature