Unknown

Dataset Information

0

A subfamily roadmap of the evolutionarily diverse glycoside hydrolase family 16 (GH16).


ABSTRACT: Glycoside hydrolase family (GH) 16 comprises a large and taxonomically diverse family of glycosidases and transglycosidases that adopt a common ?-jelly-roll fold and are active on a range of terrestrial and marine polysaccharides. Presently, broadly insightful sequence-function correlations in GH16 are hindered by a lack of a systematic subfamily structure. To fill this gap, we have used a highly scalable protein sequence similarity network analysis to delineate nearly 23,000 GH16 sequences into 23 robust subfamilies, which are strongly supported by hidden Markov model and maximum likelihood molecular phylogenetic analyses. Subsequent evaluation of over 40 experimental three-dimensional structures has highlighted key tertiary structural differences, predominantly manifested in active-site loops, that dictate substrate specificity across the GH16 evolutionary landscape. As for other large GH families (i.e. GH5, GH13, and GH43), this new subfamily classification provides a roadmap for functional glycogenomics that will guide future bioinformatics and experimental structure-function analyses. The GH16 subfamily classification is publicly available in the CAZy database. The sequence similarity network workflow used here, SSNpipe, is freely available from GitHub.

SUBMITTER: Viborg AH 

PROVIDER: S-EPMC6827312 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A subfamily roadmap of the evolutionarily diverse glycoside hydrolase family 16 (GH16).

Viborg Alexander Holm AH   Terrapon Nicolas N   Lombard Vincent V   Michel Gurvan G   Czjzek Mirjam M   Henrissat Bernard B   Brumer Harry H  

The Journal of biological chemistry 20190909 44


Glycoside hydrolase family (GH) 16 comprises a large and taxonomically diverse family of glycosidases and transglycosidases that adopt a common β-jelly-roll fold and are active on a range of terrestrial and marine polysaccharides. Presently, broadly insightful sequence-function correlations in GH16 are hindered by a lack of a systematic subfamily structure. To fill this gap, we have used a highly scalable protein sequence similarity network analysis to delineate nearly 23,000 GH16 sequences into  ...[more]

Similar Datasets

| S-EPMC10074150 | biostudies-literature
| S-EPMC3526467 | biostudies-literature
| S-EPMC8453022 | biostudies-literature
| S-EPMC6715846 | biostudies-literature
| S-EPMC5290943 | biostudies-literature
| S-EPMC4722856 | biostudies-literature
| S-EPMC8952944 | biostudies-literature
| S-EPMC3928506 | biostudies-literature
| S-EPMC5934979 | biostudies-literature
| S-EPMC10223318 | biostudies-literature