Unknown

Dataset Information

0

Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model


ABSTRACT: The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve the prediction performance, an approach that is time-consuming and costly. In this study, we leveraged traditional features and machine learning classifiers to boost the performance of vesicle transport protein identification and make the prediction process faster. We adopt the pseudo position-specific scoring matrix (PsePSSM) feature and our proposed new classifier hypergraph regularized k-local hyperplane distance nearest neighbour (HG-HKNN) to classify vesicular transport proteins. We address dataset imbalances with random undersampling. The results show that our strategy has an area under the receiver operating characteristic curve (AUC) of 0.870 and a Matthews correlation coefficient (MCC) of 0.53 on the benchmark dataset, outperforming all state-of-the-art methods on the same dataset, and other metrics of our model are also comparable to existing methods.

SUBMITTER: Fan R 

PROVIDER: S-EPMC9326258 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3703613 | biostudies-literature
| S-EPMC9738408 | biostudies-literature
| S-EPMC8096236 | biostudies-literature
| S-EPMC3931648 | biostudies-literature
| S-EPMC4106157 | biostudies-other
| S-EPMC5489678 | biostudies-literature
| S-EPMC4448799 | biostudies-literature
| S-EPMC7228543 | biostudies-literature
| S-EPMC7689739 | biostudies-literature
| S-EPMC6784847 | biostudies-literature