Unknown

Dataset Information

0

Functional Annotations of Paralogs: A Blessing and a Curse.


ABSTRACT: Gene duplication followed by mutation is a classic mechanism of neofunctionalization, producing gene families with functional diversity. In some cases, a single point mutation is sufficient to change the substrate specificity and/or the chemistry performed by an enzyme, making it difficult to accurately separate enzymes with identical functions from homologs with different functions. Because sequence similarity is often used as a basis for assigning functional annotations to genes, non-isofunctional gene families pose a great challenge for genome annotation pipelines. Here we describe how integrating evolutionary and functional information such as genome context, phylogeny, metabolic reconstruction and signature motifs may be required to correctly annotate multifunctional families. These integrative analyses can also lead to the discovery of novel gene functions, as hints from specific subgroups can guide the functional characterization of other members of the family. We demonstrate how careful manual curation processes using comparative genomics can disambiguate subgroups within large multifunctional families and discover their functions. We present the COG0720 protein family as a case study. We also discuss strategies to automate this process to improve the accuracy of genome functional annotation pipelines.

SUBMITTER: Zallot R 

PROVIDER: S-EPMC5041015 | biostudies-literature | 2016 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Functional Annotations of Paralogs: A Blessing and a Curse.

Zallot Rémi R   Harrison Katherine J KJ   Kolaczkowski Bryan B   de Crécy-Lagard Valérie V  

Life (Basel, Switzerland) 20160908 3


Gene duplication followed by mutation is a classic mechanism of neofunctionalization, producing gene families with functional diversity. In some cases, a single point mutation is sufficient to change the substrate specificity and/or the chemistry performed by an enzyme, making it difficult to accurately separate enzymes with identical functions from homologs with different functions. Because sequence similarity is often used as a basis for assigning functional annotations to genes, non-isofuncti  ...[more]

Similar Datasets

| S-EPMC8615642 | biostudies-literature
| S-EPMC6993797 | biostudies-literature
| S-EPMC8238368 | biostudies-literature
| S-EPMC9988505 | biostudies-literature
| S-EPMC3280971 | biostudies-literature
| S-EPMC10023830 | biostudies-literature
| S-EPMC3338835 | biostudies-literature
| S-EPMC10852238 | biostudies-literature
| S-EPMC5304448 | biostudies-literature