Unknown

Dataset Information

0

Integrating Hi-C links with assembly graphs for chromosome-scale assembly.


ABSTRACT: Long-read sequencing and novel long-range assays have revolutionized de novo genome assembly by automating the reconstruction of reference-quality genomes. In particular, Hi-C sequencing is becoming an economical method for generating chromosome-scale scaffolds. Despite its increasing popularity, there are limited open-source tools available. Errors, particularly inversions and fusions across chromosomes, remain higher than alternate scaffolding technologies. We present a novel open-source Hi-C scaffolder that does not require an a priori estimate of chromosome number and minimizes errors by scaffolding with the assistance of an assembly graph. We demonstrate higher accuracy than the state-of-the-art methods across a variety of Hi-C library preparations and input assembly sizes. The Python and C++ code for our method is openly available at https://github.com/machinegun/SALSA.

SUBMITTER: Ghurye J 

PROVIDER: S-EPMC6719893 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrating Hi-C links with assembly graphs for chromosome-scale assembly.

Ghurye Jay J   Rhie Arang A   Walenz Brian P BP   Schmitt Anthony A   Selvaraj Siddarth S   Pop Mihai M   Phillippy Adam M AM   Koren Sergey S  

PLoS computational biology 20190821 8


Long-read sequencing and novel long-range assays have revolutionized de novo genome assembly by automating the reconstruction of reference-quality genomes. In particular, Hi-C sequencing is becoming an economical method for generating chromosome-scale scaffolds. Despite its increasing popularity, there are limited open-source tools available. Errors, particularly inversions and fusions across chromosomes, remain higher than alternate scaffolding technologies. We present a novel open-source Hi-C  ...[more]

Similar Datasets

| S-EPMC8496284 | biostudies-literature
| S-EPMC7878743 | biostudies-literature
| S-EPMC6889754 | biostudies-literature
| S-EPMC7645714 | biostudies-literature
2021-02-12 | GSE152150 | GEO
| S-EPMC7395734 | biostudies-literature
| S-EPMC5946948 | biostudies-literature
| S-EPMC6952475 | biostudies-literature
| S-EPMC9292758 | biostudies-literature
| S-EPMC4774023 | biostudies-literature