Unknown

Dataset Information

0

Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph.


ABSTRACT: Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two human and two plant genomes, we demonstrate that our algorithm is around an order of magnitude cheaper than existing methods, while producing better diploid and haploid assemblies. Notably, our algorithm is the only feasible solution to the haplotype-resolved assembly of polyploid genomes.

SUBMITTER: Cheng H 

PROVIDER: S-EPMC10274930 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph.

Cheng Haoyu H   Asri Mobin M   Lucas Julian J   Koren Sergey S   Li Heng H  

ArXiv 20230606


Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient <i>de novo</i> assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two human and two plant genomes, we demonstrate that our algorithm is around an order of magnitude c  ...[more]

Similar Datasets

| S-EPMC11214949 | biostudies-literature
| S-EPMC6938933 | biostudies-literature
| S-EPMC10427740 | biostudies-literature
| S-EPMC7066127 | biostudies-literature
| S-EPMC6022571 | biostudies-literature
| S-EPMC9464699 | biostudies-literature
| S-EPMC9668749 | biostudies-literature
| S-EPMC3694639 | biostudies-literature
| S-EPMC11655631 | biostudies-literature
| S-EPMC6805285 | biostudies-literature