Unknown

Dataset Information

0

A novel neural response algorithm for protein function prediction.


ABSTRACT: BACKGROUND: Large amounts of data are being generated by high-throughput genome sequencing methods. But the rate of the experimental functional characterization falls far behind. To fill the gap between the number of sequences and their annotations, fast and accurate automated annotation methods are required. Many methods, such as GOblet, GOFigure, and Gotcha, are designed based on the BLAST search. Unfortunately, the sequence coverage of these methods is low as they cannot detect the remote homologues. Adding to this, the lack of annotation specificity advocates the need to improve automated protein function prediction. RESULTS: We designed a novel automated protein functional assignment method based on the neural response algorithm, which simulates the neuronal behavior of the visual cortex in the human brain. Firstly, we predict the most similar target protein for a given query protein and thereby assign its GO term to the query sequence. When assessed on test set, our method ranked the actual leaf GO term among the top 5 probable GO terms with accuracy of 86.93%. CONCLUSIONS: The proposed algorithm is the first instance of neural response algorithm being used in the biological domain. The use of HMM profiles along with the secondary structure information to define the neural response gives our method an edge over other available methods on annotation accuracy. Results of the 5-fold cross validation and the comparison with PFP and FFPred servers indicate the prominent performance by our method. The program, the dataset, and help files are available at http://www.jjwanglab.org/NRProF/.

SUBMITTER: Yalamanchili HK 

PROVIDER: S-EPMC3403322 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel neural response algorithm for protein function prediction.

Yalamanchili Hari Krishna HK   Xiao Quan-Wu QW   Wang Junwen J  

BMC systems biology 20120716


<h4>Background</h4>Large amounts of data are being generated by high-throughput genome sequencing methods. But the rate of the experimental functional characterization falls far behind. To fill the gap between the number of sequences and their annotations, fast and accurate automated annotation methods are required. Many methods, such as GOblet, GOFigure, and Gotcha, are designed based on the BLAST search. Unfortunately, the sequence coverage of these methods is low as they cannot detect the rem  ...[more]

Similar Datasets

| S-EPMC3562060 | biostudies-literature
| S-EPMC6151571 | biostudies-literature
| S-EPMC4132322 | biostudies-other
| S-EPMC8294856 | biostudies-literature
| S-EPMC7242519 | biostudies-literature
2022-05-16 | GSE189510 | GEO
| S-EPMC3720179 | biostudies-literature
| S-EPMC7832895 | biostudies-literature
| S-EPMC5144062 | biostudies-literature
| S-EPMC4687927 | biostudies-literature