Unknown

Dataset Information

0

IDBPs: a web server for the identification of DNA binding proteins.


ABSTRACT: SUMMARY: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. AVAILABILITY: http://idbps.tau.ac.il/

SUBMITTER: Nimrod G 

PROVIDER: S-EPMC2828122 | biostudies-literature | 2010 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

iDBPs: a web server for the identification of DNA binding proteins.

Nimrod Guy G   Schushan Maya M   Szilágyi András A   Leslie Christina C   Ben-Tal Nir N  

Bioinformatics (Oxford, England) 20100119 5


<h4>Summary</h4>The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potent  ...[more]

Similar Datasets

| S-EPMC4987955 | biostudies-literature
| S-EPMC4086114 | biostudies-literature
| S-EPMC4086085 | biostudies-literature
| S-EPMC3394329 | biostudies-literature
| S-EPMC2703923 | biostudies-literature
| S-EPMC1160188 | biostudies-literature
| S-EPMC3125782 | biostudies-literature
| S-EPMC4184157 | biostudies-literature
| S-EPMC4593602 | biostudies-literature
| S-EPMC3125764 | biostudies-literature