Unknown

Dataset Information

0

CURatio: Genome-wide phylogenomic analysis method using ratios of total branch lengths.


ABSTRACT: Evolutionary hypotheses provide important underpinnings of biological and medical sciences, and comprehensive, genome-wide understanding of evolutionary relationships among organisms are needed to test and refine such hypotheses. Theory and empirical evidence clearly indicate that phylogenies (trees) of different genes (loci) should not display precisely matching topologies. The main reason for such phylogenetic incongruence is reticulated evolutionary history of most species due to meiotic sexual recombination in eukaryotes, or horizontal transfers of genetic material in prokaryotes. Nevertheless, many genes should display topologically related phylogenies, and should group into one or more (for genetic hybrids) clusters in poly-dimensional "tree space". Unusual evolutionary histories or effects of selection may result in "outlier" genes with phylogenies that fall outside the main distribution(s) of trees in tree space. We present a new phylogenomic method, CURatio, which uses ratios of total branch lengths in gene trees to help identify phylogenetic outliers in a given set of ortholog groups from multiple genomes. An advantage of CURatio over other methods is that genes absent from and/or duplicated in some genomes can be included in the analysis. We conducted a simulation study under the coalescent model, and showed that, given sufficient species depth and topological difference, these ratios are significantly higher for the "outlier" gene phylogenies. Also, we applied CURatio to a set of annotated genomes of the fungal family, Clavicipitaceae, and identified alkaloid biosynthesis genes as outliers, probably due to a history of duplication and loss. The source code is available at https://github.com/QiwenKang/CURatio, and the empirical data set on Clavicipitaceae and simulated data set are available at Mendeley https://data.mendeley.com/datasets/mrxts7wjrr/1.

SUBMITTER: Kang Q 

PROVIDER: S-EPMC7372714 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

CURatio: Genome-wide phylogenomic analysis method using ratios of total branch lengths.

Kang Qiwen Q   Schardl Christopher L CL   Moore Neil N   Yoshida Ruriko R  

IEEE/ACM transactions on computational biology and bioinformatics 20181030


Evolutionary hypotheses provide important underpinnings of biological and medical sciences, and comprehensive, genome-wide understanding of evolutionary relationships among organisms are needed to test and refine such hypotheses. Theory and empirical evidence clearly indicate that phylogenies (trees) of different genes (loci) should not display precisely matching topologies. The main reason for such phylogenetic incongruence is reticulated evolutionary history of most species due to meiotic sexu  ...[more]

Similar Datasets

2020-06-04 | GSE84287 | GEO
| S-EPMC11326709 | biostudies-literature
2005-12-30 | GSE3932 | GEO
| S-EPMC4983288 | biostudies-literature
| S-EPMC8486251 | biostudies-literature
| S-EPMC4724952 | biostudies-literature
| S-EPMC8920512 | biostudies-literature
| S-EPMC2793527 | biostudies-literature
| S-EPMC8633083 | biostudies-literature
| S-EPMC2701336 | biostudies-literature