Unknown

Dataset Information

0

Predicting Flavonoid UGT Regioselectivity.


ABSTRACT: MACHINE LEARNING WAS APPLIED TO A CHALLENGING AND BIOLOGICALLY SIGNIFICANT PROTEIN CLASSIFICATION PROBLEM: the prediction of avonoid UGT acceptor regioselectivity from primary sequence. Novel indices characterizing graphical models of residues were proposed and found to be widely distributed among existing amino acid indices and to cluster residues appropriately. UGT subsequences biochemically linked to regioselectivity were modeled as sets of index sequences. Several learning techniques incorporating these UGT models were compared with classifications based on standard sequence alignment scores. These techniques included an application of time series distance functions to protein classification. Time series distances defined on the index sequences were used in nearest neighbor and support vector machine classifiers. Additionally, Bayesian neural network classifiers were applied to the index sequences. The experiments identified improvements over the nearest neighbor and support vector machine classifications relying on standard alignment similarity scores, as well as strong correlations between specific subsequences and regioselectivities.

SUBMITTER: Jackson R 

PROVIDER: S-EPMC3130495 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting Flavonoid UGT Regioselectivity.

Jackson Rhydon R   Knisley Debra D   McIntosh Cecilia C   Pfeiffer Phillip P  

Advances in bioinformatics 20110630


MACHINE LEARNING WAS APPLIED TO A CHALLENGING AND BIOLOGICALLY SIGNIFICANT PROTEIN CLASSIFICATION PROBLEM: the prediction of avonoid UGT acceptor regioselectivity from primary sequence. Novel indices characterizing graphical models of residues were proposed and found to be widely distributed among existing amino acid indices and to cluster residues appropriately. UGT subsequences biochemically linked to regioselectivity were modeled as sets of index sequences. Several learning techniques incorpo  ...[more]

Similar Datasets

| S-EPMC3205324 | biostudies-literature
| S-EPMC4994406 | biostudies-literature
| S-EPMC4938162 | biostudies-literature
| S-EPMC2277483 | biostudies-literature
| S-EPMC10702079 | biostudies-literature
| S-EPMC4775324 | biostudies-literature
| S-EPMC7042106 | biostudies-literature
| S-EPMC5939911 | biostudies-literature
| S-EPMC3973126 | biostudies-literature
| S-EPMC3874916 | biostudies-literature