Unknown

Dataset Information

0

Supervised multivariate analysis of sequence groups to identify specificity determining residues.


ABSTRACT: BACKGROUND: Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. RESULTS: We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. CONCLUSION: This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.

SUBMITTER: Wallace IM 

PROVIDER: S-EPMC1878507 | biostudies-literature | 2007

REPOSITORIES: biostudies-literature

altmetric image

Publications

Supervised multivariate analysis of sequence groups to identify specificity determining residues.

Wallace Iain M IM   Higgins Desmond G DG  

BMC bioinformatics 20070423


<h4>Background</h4>Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments.<h4>Results</h4>We demonstrate the usefulness of this method on three different test case  ...[more]

Similar Datasets

2023-12-05 | GSE233827 | GEO
| S-EPMC11006913 | biostudies-literature
| S-EPMC2718669 | biostudies-literature
| S-EPMC2709924 | biostudies-literature
| S-EPMC2686543 | biostudies-literature
| S-EPMC2655090 | biostudies-literature
| S-EPMC3358571 | biostudies-literature
| S-EPMC1183107 | biostudies-literature
| S-EPMC2286703 | biostudies-literature
| S-EPMC5122672 | biostudies-literature