Unknown

Dataset Information

0

ECMPride: prediction of human extracellular matrix proteins based on the ideal dataset using hybrid features with domain evidence.


ABSTRACT: Extracellular matrix (ECM) proteins play an essential role in various biological processes in multicellular organisms, and their abnormal regulation can lead to many diseases. For large-scale ECM protein identification, especially through proteomic-based techniques, a theoretical reference database of ECM proteins is required. In this study, based on the experimentally verified ECM datasets and by the integration of protein domain features and a machine learning model, we developed ECMPride, a flexible and scalable tool for predicting ECM proteins. ECMPride achieved excellent performance in predicting ECM proteins, with appropriate balanced accuracy and sensitivity, and the performance of ECMPride was shown to be superior to the previously developed tool. A new theoretical dataset of human ECM components was also established by applying ECMPride to all human entries in the SwissProt database, containing a significant number of putative ECM proteins as well as the abundant biological annotations. This dataset might serve as a valuable reference resource for ECM protein identification.

SUBMITTER: Liu B 

PROVIDER: S-EPMC7195829 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

ECMPride: prediction of human extracellular matrix proteins based on the ideal dataset using hybrid features with domain evidence.

Liu Binghui B   Leng Ling L   Sun Xuer X   Wang Yunfang Y   Ma Jie J   Zhu Yunping Y  

PeerJ 20200429


Extracellular matrix (ECM) proteins play an essential role in various biological processes in multicellular organisms, and their abnormal regulation can lead to many diseases. For large-scale ECM protein identification, especially through proteomic-based techniques, a theoretical reference database of ECM proteins is required. In this study, based on the experimentally verified ECM datasets and by the integration of protein domain features and a machine learning model, we developed ECMPride, a f  ...[more]

Similar Datasets

| S-EPMC4334504 | biostudies-literature
2014-01-18 | E-MEXP-3924 | biostudies-arrayexpress
| S-EPMC5563743 | biostudies-literature
| S-EPMC3346385 | biostudies-literature
| S-EPMC7696558 | biostudies-literature
| S-EPMC10301747 | biostudies-literature
| S-EPMC2529034 | biostudies-literature
| S-EPMC2168726 | biostudies-literature
| S-EPMC8552119 | biostudies-literature
| S-EPMC7940610 | biostudies-literature