Unknown

Dataset Information

0

Profile Comparer Extended: phylogeny of lytic polysaccharide monooxygenase families using profile hidden Markov model alignments.


ABSTRACT: Insight into the inter- and intra-family relationship of protein families is important, since it can aid understanding of substrate specificity evolution and assign putative functions to proteins with unknown function. To study both these inter- and intra-family relationships, the ability to build phylogenetic trees using the most sensitive sequence similarity search methods (e.g. profile hidden Markov model (pHMM)-pHMM alignments) is required. However, existing solutions require a very long calculation time to obtain the phylogenetic tree. Therefore, a faster protocol is required to make this approach efficient for research. To contribute to this goal, we extended the original Profile Comparer program (PRC) for the construction of large pHMM phylogenetic trees at speeds several orders of magnitude faster compared to pHMM-tree. As an example, PRC Extended (PRCx) was used to study the phylogeny of over 10,000 sequences of lytic polysaccharide monooxygenase (LPMO) from over seven families. Using the newly developed program we were able to reveal previously unknown homologs of LPMOs, namely the PFAM Egh16-like family. Moreover, we show that the substrate specificities have evolved independently several times within the LPMO superfamily. Furthermore, the LPMO phylogenetic tree, does not seem to follow taxonomy-based classification.

SUBMITTER: Voshol GP 

PROVIDER: S-EPMC6950343 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Profile Comparer Extended: phylogeny of lytic polysaccharide monooxygenase families using profile hidden Markov model alignments.

Voshol Gerben P GP   Punt Peter J PJ   Vijgenboom Erik E  

F1000Research 20191031


Insight into the inter- and intra-family relationship of protein families is important, since it can aid understanding of substrate specificity evolution and assign putative functions to proteins with unknown function. To study both these inter- and intra-family relationships, the ability to build phylogenetic trees using the most sensitive sequence similarity search methods (e.g. profile hidden Markov model (pHMM)-pHMM alignments) is required. However, existing solutions require a very long cal  ...[more]

Similar Datasets

| S-EPMC5860389 | biostudies-literature
| S-EPMC6894463 | biostudies-literature
| S-EPMC3893531 | biostudies-literature
| S-EPMC6404106 | biostudies-literature
| S-EPMC7072406 | biostudies-literature
| S-EPMC5787815 | biostudies-literature
| S-EPMC7078924 | biostudies-literature
| S-EPMC5651836 | biostudies-literature
| S-EPMC6334667 | biostudies-literature
| S-EPMC5865291 | biostudies-literature