Unknown

Dataset Information

0

Structural analysis based on state-space modeling.


ABSTRACT: A new method has been developed to compute the probability that each amino acid in a protein sequence is in a particular secondary structural element. Each of these probabilities is computed using the entire sequence and a set of predefined structural class models. This set of structural classes is patterned after Jane Richardson's taxonomy for the domains of globular proteins. For each structural class considered, a mathematical model is constructed to represent constraints on the pattern of secondary structural elements characteristic of that class. These are stochastic models having discrete state spaces (referred to as hidden Markov models by researchers in signal processing and automatic speech recognition). Each model is a mathematical generator of amino acid sequences; the sequence under consideration is modeled as having been generated by one model in the set of candidates. The probability that each model generated the given sequence is computed using a filtering algorithm. The protein is then classified as belonging to the structural class having the most probable model. The secondary structure of the sequence is then analyzed using a "smoothing" algorithm that is optimal for that structural class model. For each residue position in the sequence, the smoother computes the probability that the residue is contained within each of the defined secondary structural elements of the model. This method has two important advantages: (1) the probability of each residue being in each of the modeled secondary structural elements is computed using the totality of the amino acid sequence, and (2) these probabilities are consistent with prior knowledge of realizable domain folds as encoded in each model. As an example of the method's utility, we present its application to flavodoxin, a prototypical alpha/beta protein having a central beta-sheet, and to thioredoxin, which belongs to a similar structural class but shares no significant sequence similarity.

SUBMITTER: Stultz CM 

PROVIDER: S-EPMC2142382 | biostudies-other | 1993 Mar

REPOSITORIES: biostudies-other

altmetric image

Publications

Structural analysis based on state-space modeling.

Stultz C M CM   White J V JV   Smith T F TF  

Protein science : a publication of the Protein Society 19930301 3


A new method has been developed to compute the probability that each amino acid in a protein sequence is in a particular secondary structural element. Each of these probabilities is computed using the entire sequence and a set of predefined structural class models. This set of structural classes is patterned after Jane Richardson's taxonomy for the domains of globular proteins. For each structural class considered, a mathematical model is constructed to represent constraints on the pattern of se  ...[more]

Similar Datasets

| S-EPMC8145286 | biostudies-literature
| S-EPMC5575519 | biostudies-literature
| S-EPMC7271698 | biostudies-literature
| S-EPMC8536256 | biostudies-literature
| S-EPMC3049912 | biostudies-other
| S-EPMC5776784 | biostudies-literature
| S-EPMC6928115 | biostudies-literature
| S-EPMC6680062 | biostudies-literature
| S-EPMC4987932 | biostudies-literature
| S-EPMC6539555 | biostudies-literature