Unknown

Dataset Information

0

Prediction and characterization of human ageing-related proteins by using machine learning.


ABSTRACT: Ageing has a huge impact on human health and economy, but its molecular basis - regulation and mechanism - is still poorly understood. By today, more than three hundred genes (almost all of them function as protein-coding genes) have been related to human ageing. Although individual ageing-related genes or some small subsets of these genes have been intensively studied, their analysis as a whole has been highly limited. To fill this gap, for each human protein we extracted 21000 protein features from various databases, and using these data as an input to state-of-the-art machine learning methods, we classified human proteins as ageing-related or non-ageing-related. We found a simple classification model based on only 36 protein features, such as the "number of ageing-related interaction partners", "response to oxidative stress", "damaged DNA binding", "rhythmic process" and "extracellular region". Predicted values of the model quantify the relevance of a given protein in the regulation or mechanisms of the human ageing process. Furthermore, we identified new candidate proteins having strong computational evidence of their important role in ageing. Some of them, like Cytochrome b-245 light chain (CY24A) and Endoribonuclease ZC3H12A (ZC12A) have no previous ageing-associated annotations.

SUBMITTER: Kerepesi C 

PROVIDER: S-EPMC5840292 | biostudies-literature | 2018 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction and characterization of human ageing-related proteins by using machine learning.

Kerepesi Csaba C   Daróczy Bálint B   Sturm Ádám Á   Vellai Tibor T   Benczúr András A  

Scientific reports 20180306 1


Ageing has a huge impact on human health and economy, but its molecular basis - regulation and mechanism - is still poorly understood. By today, more than three hundred genes (almost all of them function as protein-coding genes) have been related to human ageing. Although individual ageing-related genes or some small subsets of these genes have been intensively studied, their analysis as a whole has been highly limited. To fill this gap, for each human protein we extracted 21000 protein features  ...[more]

Similar Datasets

2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
2013-01-01 | GSE29210 | GEO
| S-EPMC10004995 | biostudies-literature
| S-EPMC8804200 | biostudies-literature
| S-EPMC8845408 | biostudies-literature
| S-EPMC10683244 | biostudies-literature
| S-EPMC10057777 | biostudies-literature
| S-EPMC7463567 | biostudies-literature
| S-EPMC6241126 | biostudies-other
| S-ECPF-GEOD-29210 | biostudies-other