Unknown

Dataset Information

0

Prediction of unconventional protein secretion by exosomes.


ABSTRACT:

Motivation

In eukaryotes, proteins targeted for secretion contain a signal peptide, which allows them to proceed through the conventional ER/Golgi-dependent pathway. However, an important number of proteins lacking a signal peptide can be secreted through unconventional routes, including that mediated by exosomes. Currently, no method is available to predict protein secretion via exosomes.

Results

Here, we first assembled a dataset including the sequences of 2992 proteins secreted by exosomes and 2961 proteins that are not secreted by exosomes. Subsequently, we trained different random forests models on feature vectors derived from the sequences in this dataset. In tenfold cross-validation, the best model was trained on dipeptide composition, reaching an accuracy of 69.88% ± 2.08 and an area under the curve (AUC) of 0.76 ± 0.03. In an independent dataset, this model reached an accuracy of 75.73% and an AUC of 0.840. After these results, we developed ExoPred, a web-based tool that uses random forests to predict protein secretion by exosomes.

Conclusion

ExoPred is available for free public use at http://imath.med.ucm.es/exopred/ . Datasets are available at http://imath.med.ucm.es/exopred/datasets/ .

SUBMITTER: Ras-Carmona A 

PROVIDER: S-EPMC8210391 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2020-09-13 | GSE157864 | GEO
| S-EPMC3680731 | biostudies-other
| S-EPMC6923414 | biostudies-literature
| S-EPMC6162777 | biostudies-literature
| S-EPMC7030921 | biostudies-literature
| S-EPMC4848490 | biostudies-literature
| S-EPMC9127312 | biostudies-literature
2023-03-11 | PXD031113 | Pride
| PRJNA663090 | ENA
| S-EPMC9284464 | biostudies-literature