Unknown

Dataset Information

0

Phylogenetic Clustering of Genes Reveals Shared Evolutionary Trajectories and Putative Gene Functions.


ABSTRACT: Homologous genes in prokaryotes can be described using phylogenetic profiles which summarize their patterns of presence or absence across a set of genomes. Phylogenetic profiles have been used for nearly twenty years to cluster genes based on measures such as the Euclidean distance between profile vectors. However, most approaches do not take into account the phylogenetic relationships amongst the profiled genomes, and overrepresentation of certain taxonomic groups (i.e., pathogenic species with many sequenced representatives) can skew the interpretation of profiles. We propose a new approach that uses a coevolutionary method defined by Pagel to account for the phylogenetic relationships amongst target organisms, and a hierarchical-clustering approach to define sets of genes with common distributions across the organisms. The clusters we obtain using our method show greater evidence of phylogenetic and functional clustering than a recently published approach based on hidden Markov models. Our clustering method identifies sets of amino-acid biosynthesis genes that constitute cohesive pathways, and motility/chemotaxis genes with common histories of descent and lateral gene transfer.

SUBMITTER: Liu C 

PROVIDER: S-EPMC6130602 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phylogenetic Clustering of Genes Reveals Shared Evolutionary Trajectories and Putative Gene Functions.

Liu Chaoyue C   Wright Benjamin B   Allen-Vercoe Emma E   Gu Hong H   Beiko Robert R  

Genome biology and evolution 20180901 9


Homologous genes in prokaryotes can be described using phylogenetic profiles which summarize their patterns of presence or absence across a set of genomes. Phylogenetic profiles have been used for nearly twenty years to cluster genes based on measures such as the Euclidean distance between profile vectors. However, most approaches do not take into account the phylogenetic relationships amongst the profiled genomes, and overrepresentation of certain taxonomic groups (i.e., pathogenic species with  ...[more]

Similar Datasets

| S-EPMC10182736 | biostudies-literature
| S-EPMC10282123 | biostudies-literature
| S-EPMC4868114 | biostudies-literature
| S-EPMC8602892 | biostudies-literature
| S-EPMC8319724 | biostudies-literature
| S-EPMC8262867 | biostudies-literature
| S-EPMC8559708 | biostudies-literature
| S-EPMC3213679 | biostudies-literature
| S-EPMC3010121 | biostudies-literature
| S-EPMC127635 | biostudies-literature