Unknown

Dataset Information

0

HalSynteny: a fast, easy-to-use conserved synteny block construction method for multiple whole-genome alignments.


ABSTRACT: BACKGROUND:Large-scale sequencing projects provide high-quality full-genome data that can be used for reconstruction of chromosomal exchanges and rearrangements that disrupt conserved syntenic blocks. The highest resolution of cross-species homology can be obtained on the basis of whole-genome, reference-free alignments. Very large multiple alignments of full-genome sequence stored in a binary format demand an accurate and efficient computational approach for synteny block production. FINDINGS:halSynteny performs efficient processing of pairwise alignment blocks for any pair of genomes in the alignment. The tool is part of the HAL comparative genomics suite and is targeted to build synteny blocks for multi-hundred-way, reference-free vertebrate alignments built with the Cactus system. CONCLUSIONS:halSynteny enables an accurate and rapid identification of synteny in multiple full-genome alignments. The method is implemented in C++11 as a component of the halTools software and released under MIT license. The package is available at https://github.com/ComparativeGenomicsToolkit/hal/.

SUBMITTER: Krasheninnikova K 

PROVIDER: S-EPMC7254927 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

halSynteny: a fast, easy-to-use conserved synteny block construction method for multiple whole-genome alignments.

Krasheninnikova Ksenia K   Diekhans Mark M   Armstrong Joel J   Dievskii Aleksei A   Paten Benedict B   O'Brien Stephen S  

GigaScience 20200601 6


<h4>Background</h4>Large-scale sequencing projects provide high-quality full-genome data that can be used for reconstruction of chromosomal exchanges and rearrangements that disrupt conserved syntenic blocks. The highest resolution of cross-species homology can be obtained on the basis of whole-genome, reference-free alignments. Very large multiple alignments of full-genome sequence stored in a binary format demand an accurate and efficient computational approach for synteny block production.<h4  ...[more]

Similar Datasets

| S-EPMC7728760 | biostudies-literature
| S-EPMC2720179 | biostudies-literature
| S-EPMC1463900 | biostudies-literature
| S-EPMC3157923 | biostudies-literature
| S-EPMC9243546 | biostudies-literature
| S-EPMC3226196 | biostudies-literature
| S-EPMC2808710 | biostudies-literature
| S-EPMC3416384 | biostudies-literature
| S-EPMC2206062 | biostudies-literature
| S-EPMC146152 | biostudies-other