Unknown

Dataset Information

0

Puzzle Hi-C: An accurate scaffolding software.


ABSTRACT: High-quality, chromosome-scale genomes are essential for genomic analyses. Analyses, including 3D genomics, epigenetics, and comparative genomics rely on a high-quality genome assembly, which is often accomplished with the assistance of Hi-C data. Curation of genomes reveal that current Hi-C-assisted scaffolding algorithms either generate ordering and orientation errors or fail to assemble high-quality chromosome-level scaffolds. Here, we offer the software Puzzle Hi-C, which uses Hi-C reads to accurately assign contigs or scaffolds to chromosomes. Puzzle Hi-C uses the triangle region instead of the square region to count interactions in a Hi-C heatmap. This strategy dramatically diminishes scaffolding interference caused by long-range interactions. This software also introduces a dynamic, triangle window strategy during assembly. Initially small, the window expands with interactions to produce more effective clustering. Puzzle Hi-C outperforms available scaffolding tools.

SUBMITTER: Lin G 

PROVIDER: S-EPMC11249255 | biostudies-literature | 2024

REPOSITORIES: biostudies-literature

altmetric image

Publications


High-quality, chromosome-scale genomes are essential for genomic analyses. Analyses, including 3D genomics, epigenetics, and comparative genomics rely on a high-quality genome assembly, which is often accomplished with the assistance of Hi-C data. Curation of genomes reveal that current Hi-C-assisted scaffolding algorithms either generate ordering and orientation errors or fail to assemble high-quality chromosome-level scaffolds. Here, we offer the software Puzzle Hi-C, which uses Hi-C reads to  ...[more]

Similar Datasets

| S-EPMC9848053 | biostudies-literature
| S-EPMC5290626 | biostudies-literature
| S-EPMC6952475 | biostudies-literature
| S-EPMC9292758 | biostudies-literature
| S-EPMC6816165 | biostudies-literature
| S-EPMC1919397 | biostudies-literature
| S-EPMC4477656 | biostudies-literature
| S-EPMC7987978 | biostudies-literature
| S-EPMC8851831 | biostudies-literature
| S-EPMC5860379 | biostudies-literature