Dataset Information

GeneHunt for rapid domain-specific annotation of glycoside hydrolases.

ABSTRACT: The identification of glycoside hydrolases (GHs) for efficient polysaccharide deconstruction is essential for the development of biofuels. Here, we investigate the potential of sequential HMM-profile identification for the rapid and precise identification of the multi-domain architecture of GHs from various datasets. First, as a validation, we successfully reannotated >98% of the biochemically characterized enzymes listed on the CAZy database. Next, we analyzed the 43 million non-redundant sequences from the M5nr data and identified 322,068 unique GHs. Finally, we searched 129 assembled metagenomes retrieved from MG-RAST for environmental GHs and identified 160,790 additional enzymes. Although most identified sequences corresponded to single domain enzymes, many contained several domains, including known accessory domains and some domains never identified in association with GH. Several sequences displayed multiple catalytic domains and few of these potential multi-activity proteins combined potentially synergistic domains. Finally, we produced and confirmed the biochemical activities of a GH5-GH10 cellulase-xylanase and a GH11-CE4 xylanase-esterase. Globally, this "gene to enzyme pipeline" provides a rationale for mining large datasets in order to identify new catalysts combining unique properties for the efficient deconstruction of polysaccharides.

SUBMITTER: Nguyen SN

PROVIDER: S-EPMC6626019 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

GeneHunt for rapid domain-specific annotation of glycoside hydrolases.

Nguyen S N SN Flores A A Talamantes D D Dar F F Valdez A A Schwans J J Berlemont R R

Scientific reports 20190712 1

The identification of glycoside hydrolases (GHs) for efficient polysaccharide deconstruction is essential for the development of biofuels. Here, we investigate the potential of sequential HMM-profile identification for the rapid and precise identification of the multi-domain architecture of GHs from various datasets. First, as a validation, we successfully reannotated >98% of the biochemically characterized enzymes listed on the CAZy database. Next, we analyzed the 43 million non-redundant seque ...[more]

PMID: 31300677

Dataset Information

GeneHunt for rapid domain-specific annotation of glycoside hydrolases.

Publications

GeneHunt for rapid domain-specific annotation of glycoside hydrolases.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Analysis of Domain Architecture and Phylogenetics of Family 2 Glycoside Hydrolases (GH2).
| S-EPMC5145203 | biostudies-literature

Glycoside Hydrolases across Environmental Microbial Communities.
| S-EPMC5218504 | biostudies-literature

Hyperthermophilic Thermotoga species differ with respect to specific carbohydrate transporters and glycoside hydrolases.
| S-EPMC3298158 | biostudies-literature

Functional exploration of novel glycoside hydrolases in Fervidibacter sacchari PD1T
2023-12-31 | GSE249938 | GEO

Allylic Carbocyclic Inhibitors Covalently Bind Glycoside Hydrolases.
| S-EPMC10131216 | biostudies-literature

Biomass-degrading glycoside hydrolases of archaeal origin.
| S-EPMC7469102 | biostudies-literature

Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases.
| S-EPMC2633212 | biostudies-literature

Glycoside Hydrolases Degrade Polymicrobial Bacterial Biofilms in Wounds.
| S-EPMC5278739 | biostudies-literature

Differential Efficacy of Glycoside Hydrolases to Disperse Biofilms.
| S-EPMC7393775 | biostudies-literature

Glycoside hydrolases in the biodegradation of lignocellulosic biomass.
| S-EPMC10654287 | biostudies-literature