Unknown

Dataset Information

0

Structural characterization of proteins using residue environments.


ABSTRACT: A primary challenge for structural genomics is the automated functional characterization of protein structures. We have developed a sequence-independent method called S-BLEST (Structure-Based Local Environment Search Tool) for the annotation of previously uncharacterized protein structures. S-BLEST encodes the local environment of an amino acid as a vector of structural property values. It has been applied to all amino acids in a nonredundant database of protein structures to generate a searchable structural resource. Given a query amino acid from an experimentally determined or modeled structure, S-BLEST quickly identifies similar amino acid environments using a K-nearest neighbor search. In addition, the method gives an estimation of the statistical significance of each result. We validated S-BLEST on X-ray crystal structures from the ASTRAL 40 nonredundant dataset. We then applied it to 86 crystallographically determined proteins in the protein data bank (PDB) with unknown function and with no significant sequence neighbors in the PDB. S-BLEST was able to associate 20 proteins with at least one local structural neighbor and identify the amino acid environments that are most similar between those neighbors.

SUBMITTER: Mooney SD 

PROVIDER: S-EPMC2483305 | biostudies-literature | 2005 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structural characterization of proteins using residue environments.

Mooney Sean D SD   Liang Mike Hsin-Ping MH   DeConde Rob R   Altman Russ B RB  

Proteins 20051201 4


A primary challenge for structural genomics is the automated functional characterization of protein structures. We have developed a sequence-independent method called S-BLEST (Structure-Based Local Environment Search Tool) for the annotation of previously uncharacterized protein structures. S-BLEST encodes the local environment of an amino acid as a vector of structural property values. It has been applied to all amino acids in a nonredundant database of protein structures to generate a searchab  ...[more]

Similar Datasets

| S-EPMC3424558 | biostudies-literature
| S-EPMC4972195 | biostudies-literature
| S-EPMC3203928 | biostudies-literature
| S-EPMC6544178 | biostudies-literature
| S-EPMC6960613 | biostudies-literature
| S-EPMC2642882 | biostudies-literature
| S-EPMC3534501 | biostudies-literature
| S-EPMC4159253 | biostudies-literature
| S-EPMC4546970 | biostudies-literature
| S-EPMC8927995 | biostudies-literature