Unknown

Dataset Information

0

HaploMerger: reconstructing allelic relationships for polymorphic diploid genome assemblies.


ABSTRACT: Whole-genome shotgun assembly has been a long-standing issue for highly polymorphic genomes, and the advent of next-generation sequencing technologies has made the issue more challenging than ever. Here we present an automated pipeline, HaploMerger, for reconstructing allelic relationships in a diploid assembly. HaploMerger combines a LASTZ-ChainNet alignment approach with a novel graph-based structure, which helps to untangle allelic relationships between two haplotypes and guides the subsequent creation of reference haploid assemblies. The pipeline provides flexible parameters and schemes to improve the contiguity, continuity, and completeness of the reference assemblies. We show that HaploMerger produces efficient and accurate results in simulations and has advantages over manual curation when applied to real polymorphic assemblies (e.g., 4%-5% heterozygosity). We also used HaploMerger to analyze the diploid assembly of a single Chinese amphioxus (Branchiostoma belcheri) and compared the resulting haploid assemblies with EST sequences, which revealed that the two haplotypes are not only divergent but also highly complementary to each other. Taken together, we have demonstrated that HaploMerger is an effective tool for analyzing and exploiting polymorphic genome assemblies.

SUBMITTER: Huang S 

PROVIDER: S-EPMC3409271 | biostudies-literature | 2012 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

HaploMerger: reconstructing allelic relationships for polymorphic diploid genome assemblies.

Huang Shengfeng S   Chen Zelin Z   Huang Guangrui G   Yu Ting T   Yang Ping P   Li Jie J   Fu Yonggui Y   Yuan Shaochun S   Chen Shangwu S   Xu Anlong A  

Genome research 20120503 8


Whole-genome shotgun assembly has been a long-standing issue for highly polymorphic genomes, and the advent of next-generation sequencing technologies has made the issue more challenging than ever. Here we present an automated pipeline, HaploMerger, for reconstructing allelic relationships in a diploid assembly. HaploMerger combines a LASTZ-ChainNet alignment approach with a novel graph-based structure, which helps to untangle allelic relationships between two haplotypes and guides the subsequen  ...[more]

Similar Datasets

| S-EPMC6267036 | biostudies-literature
| S-EPMC8016491 | biostudies-literature
| S-EPMC5870766 | biostudies-literature
| S-EPMC4449708 | biostudies-literature
| S-EPMC6829152 | biostudies-literature
| S-EPMC7728601 | biostudies-literature
| S-EPMC5025496 | biostudies-literature
| S-EPMC5941971 | biostudies-literature
| S-EPMC9890229 | biostudies-literature
| S-EPMC4422153 | biostudies-literature