Unknown

Dataset Information

0

SPOTONE: Hot Spots on Protein Complexes with Extremely Randomized Trees via Sequence-Only Features.


ABSTRACT: Protein Hot-Spots (HS) are experimentally determined amino acids, key to small ligand binding and tend to be structural landmarks on protein-protein interactions. As such, they were extensively approached by structure-based Machine Learning (ML) prediction methods. However, the availability of a much larger array of protein sequences in comparison to determined tree-dimensional structures indicates that a sequence-based HS predictor has the potential to be more useful for the scientific community. Herein, we present SPOTONE, a new ML predictor able to accurately classify protein HS via sequence-only features. This algorithm shows accuracy, AUROC, precision, recall and F1-score of 0.82, 0.83, 0.91, 0.82 and 0.85, respectively, on an independent testing set. The algorithm is deployed within a free-to-use webserver at http://moreiralab.com/resources/spotone, only requiring the user to submit a FASTA file with one or more protein sequences.

SUBMITTER: Preto AJ 

PROVIDER: S-EPMC7582262 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

SPOTONE: Hot Spots on Protein Complexes with Extremely Randomized Trees via Sequence-Only Features.

Preto A J AJ   Moreira Irina S IS  

International journal of molecular sciences 20201001 19


Protein Hot-Spots (HS) are experimentally determined amino acids, key to small ligand binding and tend to be structural landmarks on protein-protein interactions. As such, they were extensively approached by structure-based Machine Learning (ML) prediction methods. However, the availability of a much larger array of protein sequences in comparison to determined tree-dimensional structures indicates that a sequence-based HS predictor has the potential to be more useful for the scientific communit  ...[more]

Similar Datasets

| S-EPMC137846 | biostudies-literature
| S-EPMC3176265 | biostudies-literature
| S-EPMC6658830 | biostudies-literature
| S-EPMC3623692 | biostudies-literature
| S-EPMC3595357 | biostudies-literature
| S-EPMC1924567 | biostudies-literature
| S-EPMC3414429 | biostudies-literature
| S-EPMC4995058 | biostudies-literature
| S-EPMC3158149 | biostudies-literature
| S-EPMC5556074 | biostudies-literature