Unknown

Dataset Information

0

Improving draft genome contiguity with reference-derived in silico mate-pair libraries.


ABSTRACT: Background:Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available. Findings:In order to improve genome contiguity, we have developed Cross-Species Scaffolding-a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico. Conclusions:We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ?30x coverage of shotgun sequencing data.

SUBMITTER: Grau JH 

PROVIDER: S-EPMC5967465 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving draft genome contiguity with reference-derived in silico mate-pair libraries.

Grau José Horacio JH   Hackl Thomas T   Koepfli Klaus-Peter KP   Hofreiter Michael M  

GigaScience 20180501 5


<h4>Background</h4>Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.<h4>Findings</h4>In order to improve genome contiguity, we have developed Cross-Species Scaffolding-a new pipeline that imports long-  ...[more]

Similar Datasets

| S-EPMC9833964 | biostudies-literature
| S-EPMC3648348 | biostudies-literature
| S-EPMC3928519 | biostudies-literature
| S-EPMC4035081 | biostudies-other
| PRJEB4453 | ENA
| S-EPMC6240028 | biostudies-literature
| S-EPMC6117203 | biostudies-literature
| S-EPMC4176323 | biostudies-literature
| PRJEB13570 | ENA
2012-05-15 | E-MTAB-1082 | biostudies-arrayexpress