Unknown

Dataset Information

0

Analysis of an optimal hidden Markov model for secondary structure prediction.


ABSTRACT:

Background

Secondary structure prediction is a useful first step toward 3D structure prediction. A number of successful secondary structure prediction methods use neural networks, but unfortunately, neural networks are not intuitively interpretable. On the contrary, hidden Markov models are graphical interpretable models. Moreover, they have been successfully used in many bioinformatic applications. Because they offer a strong statistical background and allow model interpretation, we propose a method based on hidden Markov models.

Results

Our HMM is designed without prior knowledge. It is chosen within a collection of models of increasing size, using statistical and accuracy criteria. The resulting model has 36 hidden states: 15 that model alpha-helices, 12 that model coil and 9 that model beta-strands. Connections between hidden states and state emission probabilities reflect the organization of protein structures into secondary structure segments. We start by analyzing the model features and see how it offers a new vision of local structures. We then use it for secondary structure prediction. Our model appears to be very efficient on single sequences, with a Q3 score of 68.8%, more than one point above PSIPRED prediction on single sequences. A straightforward extension of the method allows the use of multiple sequence alignments, rising the Q3 score to 75.5%.

Conclusion

The hidden Markov model presented here achieves valuable prediction results using only a limited number of parameters. It provides an interpretable framework for protein secondary structure architecture. Furthermore, it can be used as a tool for generating protein sequences with a given secondary structure content.

SUBMITTER: Martin J 

PROVIDER: S-EPMC1769381 | biostudies-literature | 2006 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis of an optimal hidden Markov model for secondary structure prediction.

Martin Juliette J   Gibrat Jean-François JF   Rodolphe François F  

BMC structural biology 20061213


<h4>Background</h4>Secondary structure prediction is a useful first step toward 3D structure prediction. A number of successful secondary structure prediction methods use neural networks, but unfortunately, neural networks are not intuitively interpretable. On the contrary, hidden Markov models are graphical interpretable models. Moreover, they have been successfully used in many bioinformatic applications. Because they offer a strong statistical background and allow model interpretation, we pro  ...[more]

Similar Datasets

| S-EPMC1479840 | biostudies-literature
| S-EPMC2840161 | biostudies-literature
| S-EPMC3009500 | biostudies-literature
| S-EPMC4277924 | biostudies-literature
| S-EPMC8204269 | biostudies-literature
2012-10-18 | GSE34490 | GEO
| S-EPMC6373422 | biostudies-literature
| S-EPMC2735038 | biostudies-literature
| S-EPMC3114652 | biostudies-literature