Unknown

Dataset Information

0

Prediction of heme binding residues from protein sequences with integrative sequence profiles.


ABSTRACT: BACKGROUND:The heme-protein interactions are essential for various biological processes such as electron transfer, catalysis, signal transduction and the control of gene expression. The knowledge of heme binding residues can provide crucial clues to understand these activities and aid in functional annotation, however, insufficient work has been done on the research of heme binding residues from protein sequence information. METHODS:We propose a sequence-based approach for accurate prediction of heme binding residues by a novel integrative sequence profile coupling position specific scoring matrices with heme specific physicochemical properties. In order to select the informative physicochemical properties, we design an intuitive feature selection scheme by combining a greedy strategy with correlation analysis. RESULTS:Our integrative sequence profile approach for prediction of heme binding residues outperforms the conventional methods using amino acid and evolutionary information on the 5-fold cross validation and the independent tests. CONCLUSIONS:The novel feature of an integrative sequence profile achieves good performance using a reduced set of feature vector elements.

SUBMITTER: Xiong Y 

PROVIDER: S-EPMC3380730 | biostudies-literature | 2012 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of heme binding residues from protein sequences with integrative sequence profiles.

Xiong Yi Y   Liu Juan J   Zhang Wen W   Zeng Tao T  

Proteome science 20120621


<h4>Background</h4>The heme-protein interactions are essential for various biological processes such as electron transfer, catalysis, signal transduction and the control of gene expression. The knowledge of heme binding residues can provide crucial clues to understand these activities and aid in functional annotation, however, insufficient work has been done on the research of heme binding residues from protein sequence information.<h4>Methods</h4>We propose a sequence-based approach for accurat  ...[more]

Similar Datasets

| S-EPMC2709252 | biostudies-literature
| S-EPMC3184988 | biostudies-literature
| S-EPMC2896099 | biostudies-literature
| S-EPMC6612887 | biostudies-literature
| S-EPMC3289083 | biostudies-literature
| S-EPMC5796408 | biostudies-literature
| S-EPMC5773889 | biostudies-literature
| S-EPMC3237601 | biostudies-literature