Unknown

Dataset Information

0

GenNon-h: generating multiple sequence alignments on nonhomogeneous phylogenetic trees.


ABSTRACT:

Background

A number of software packages are available to generate DNA multiple sequence alignments (MSAs) evolved under continuous-time Markov processes on phylogenetic trees. On the other hand, methods of simulating the DNA MSA directly from the transition matrices do not exist. Moreover, existing software restricts to the time-reversible models and it is not optimized to generate nonhomogeneous data (i.e. placing distinct substitution rates at different lineages).

Results

We present the first package designed to generate MSAs evolving under discrete-time Markov processes on phylogenetic trees, directly from probability substitution matrices. Based on the input model and a phylogenetic tree in the Newick format (with branch lengths measured as the expected number of substitutions per site), the algorithm produces DNA alignments of desired length. GenNon-h is publicly available for download.

Conclusion

The software presented here is an efficient tool to generate DNA MSAs on a given phylogenetic tree. GenNon-h provides the user with the nonstationary or nonhomogeneous phylogenetic data that is well suited for testing complex biological hypotheses, exploring the limits of the reconstruction algorithms and their robustness to such models.

SUBMITTER: Kedzierska AM 

PROVIDER: S-EPMC3532078 | biostudies-literature | 2012 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

GenNon-h: generating multiple sequence alignments on nonhomogeneous phylogenetic trees.

Kedzierska Anna M AM   Casanellas Marta M  

BMC bioinformatics 20120828


<h4>Background</h4>A number of software packages are available to generate DNA multiple sequence alignments (MSAs) evolved under continuous-time Markov processes on phylogenetic trees. On the other hand, methods of simulating the DNA MSA directly from the transition matrices do not exist. Moreover, existing software restricts to the time-reversible models and it is not optimized to generate nonhomogeneous data (i.e. placing distinct substitution rates at different lineages).<h4>Results</h4>We pr  ...[more]

Similar Datasets

| S-EPMC3598851 | biostudies-literature
| S-EPMC4115562 | biostudies-literature
| S-EPMC2637874 | biostudies-literature
| S-EPMC9588007 | biostudies-literature
| S-EPMC4538881 | biostudies-literature
| S-EPMC1948021 | biostudies-literature
| S-EPMC7297217 | biostudies-literature
| S-EPMC1463900 | biostudies-literature
| S-EPMC7671350 | biostudies-literature