Unknown

Dataset Information

0

HHsvm: fast and accurate classification of profile-profile matches identified by HHsearch.


ABSTRACT:

Motivation

Recently developed profile-profile methods rival structural comparisons in their ability to detect homology between distantly related proteins. Despite this tremendous progress, many genuine relationships between protein families cannot be recognized as comparisons of their profiles result in scores that are statistically insignificant.

Results

Using known evolutionary relationships among protein superfamilies in SCOP database, support vector machines were trained on four sets of discriminatory features derived from the output of HHsearch. Upon validation, it was shown that the automatic classification of all profile-profile matches was superior to fixed threshold-based annotation in terms of sensitivity and specificity. The effectiveness of this approach was demonstrated by annotating several domains of unknown function from the Pfam database.

Availability

Programs and scripts implementing the methods described in this manuscript are freely available from http://hhsvm.dlakiclab.org/.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Dlakic M 

PROVIDER: S-EPMC2913662 | biostudies-literature | 2009 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

HHsvm: fast and accurate classification of profile-profile matches identified by HHsearch.

Dlakić Mensur M  

Bioinformatics (Oxford, England) 20090922 23


<h4>Motivation</h4>Recently developed profile-profile methods rival structural comparisons in their ability to detect homology between distantly related proteins. Despite this tremendous progress, many genuine relationships between protein families cannot be recognized as comparisons of their profiles result in scores that are statistically insignificant.<h4>Results</h4>Using known evolutionary relationships among protein superfamilies in SCOP database, support vector machines were trained on fo  ...[more]

Similar Datasets

| S-EPMC5409309 | biostudies-literature
| S-EPMC9853099 | biostudies-literature
| S-EPMC6395045 | biostudies-literature
| S-EPMC9624007 | biostudies-literature
| S-EPMC8127778 | biostudies-literature
| S-EPMC5034655 | biostudies-literature
| S-EPMC6905549 | biostudies-literature
| S-EPMC4428112 | biostudies-literature
| S-EPMC6104016 | biostudies-literature
| S-EPMC5389551 | biostudies-literature