Unknown

Dataset Information

0

Genome-scale prediction of moonlighting proteins using diverse protein association information.


ABSTRACT: Moonlighting proteins (MPs) show multiple cellular functions within a single polypeptide chain. To understand the overall landscape of their functional diversity, it is important to establish a computational method that can identify MPs on a genome scale. Previously, we have systematically characterized MPs using functional and omics-scale information. In this work, we develop a computational prediction model for automatic identification of MPs using a diverse range of protein association information.We incorporated a diverse range of protein association information to extract characteristic features of MPs, which range from gene ontology (GO), protein-protein interactions, gene expression, phylogenetic profiles, genetic interactions and network-based graph properties to protein structural properties, i.e. intrinsically disordered regions in the protein chain. Then, we used machine learning classifiers using the broad feature space for predicting MPs. Because many known MPs lack some proteomic features, we developed an imputation technique to fill such missing features. Results on the control dataset show that MPs can be predicted with over 98% accuracy when GO terms are available. Furthermore, using only the omics-based features the method can still identify MPs with over 75% accuracy. Last, we applied the method on three genomes: Saccharomyces cerevisiae, Caenorhabditis elegans and Homo sapiens, and found that about 2-10% of proteins in the genomes are potential MPs.Code available at http://kiharalab.org/MPpredictiondkihara@purdue.eduSupplementary data are available at Bioinformatics online.

SUBMITTER: Khan IK 

PROVIDER: S-EPMC4965633 | biostudies-literature | 2016 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-scale prediction of moonlighting proteins using diverse protein association information.

Khan Ishita K IK   Kihara Daisuke D  

Bioinformatics (Oxford, England) 20160326 15


<h4>Motivation</h4>Moonlighting proteins (MPs) show multiple cellular functions within a single polypeptide chain. To understand the overall landscape of their functional diversity, it is important to establish a computational method that can identify MPs on a genome scale. Previously, we have systematically characterized MPs using functional and omics-scale information. In this work, we develop a computational prediction model for automatic identification of MPs using a diverse range of protein  ...[more]

Similar Datasets

| S-EPMC4307903 | biostudies-literature
| S-EPMC8019903 | biostudies-literature
| S-EPMC8142502 | biostudies-literature
| PRJEB16758 | ENA
| S-EPMC6602452 | biostudies-literature
| S-EPMC9302406 | biostudies-literature
| S-EPMC6404977 | biostudies-literature
| S-EPMC3664249 | biostudies-literature
| S-EPMC4478894 | biostudies-literature
| S-EPMC6221071 | biostudies-literature