Unknown

Dataset Information

0

De-novo protein function prediction using DNA binding and RNA binding proteins as a test case.


ABSTRACT: Of the currently identified protein sequences, 99.6% have never been observed in the laboratory as proteins and their molecular function has not been established experimentally. Predicting the function of such proteins relies mostly on annotated homologs. However, this has resulted in some erroneous annotations, and many proteins have no annotated homologs. Here we propose a de-novo function prediction approach based on identifying biophysical features that underlie function. Using our approach, we discover DNA and RNA binding proteins that cannot be identified based on homology and validate these predictions experimentally. For example, FGF14, which belongs to a family of secreted growth factors was predicted to bind DNA. We verify this experimentally and also show that FGF14 is localized to the nucleus. Mutating the predicted binding site on FGF14 abrogated DNA binding. These results demonstrate the feasibility of automated de-novo function prediction based on identifying function-related biophysical features.

SUBMITTER: Peled S 

PROVIDER: S-EPMC5121330 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

De-novo protein function prediction using DNA binding and RNA binding proteins as a test case.

Peled Sapir S   Leiderman Olga O   Charar Rotem R   Efroni Gilat G   Shav-Tal Yaron Y   Ofran Yanay Y  

Nature communications 20161121


Of the currently identified protein sequences, 99.6% have never been observed in the laboratory as proteins and their molecular function has not been established experimentally. Predicting the function of such proteins relies mostly on annotated homologs. However, this has resulted in some erroneous annotations, and many proteins have no annotated homologs. Here we propose a de-novo function prediction approach based on identifying biophysical features that underlie function. Using our approach,  ...[more]

Similar Datasets

| S-EPMC3874201 | biostudies-literature
| S-EPMC6511874 | biostudies-literature
| S-EPMC3907290 | biostudies-literature
| S-EPMC3072456 | biostudies-literature
| S-EPMC1955458 | biostudies-literature
| S-EPMC6293442 | biostudies-literature
| S-EPMC2691844 | biostudies-other
| S-EPMC3360076 | biostudies-literature
| S-EPMC5715010 | biostudies-literature
| S-EPMC10868834 | biostudies-literature