Unknown

Dataset Information

0

TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs.


ABSTRACT: In recent years, hundreds of novel RNA-binding proteins (RBPs) have been identified, leading to the discovery of novel RNA-binding domains. Furthermore, unstructured or disordered low-complexity regions of RBPs have been identified to play an important role in interactions with nucleic acids. However, these advances in understanding RBPs are limited mainly to eukaryotic species and we only have limited tools to faithfully predict RNA-binders in bacteria. Here, we describe a support vector machine-based method, called TriPepSVM, for the prediction of RNA-binding proteins. TriPepSVM applies string kernels to directly handle protein sequences using tri-peptide frequencies. Testing the method in human and bacteria, we find that several RBP-enriched tri-peptides occur more often in structurally disordered regions of RBPs. TriPepSVM outperforms existing applications, which consider classical structural features of RNA-binding or homology, in the task of RBP prediction in both human and bacteria. Finally, we predict 66 novel RBPs in Salmonella Typhimurium and validate the bacterial proteins ClpX, DnaJ and UbiG to associate with RNA in vivo.

SUBMITTER: Bressin A 

PROVIDER: S-EPMC6511874 | biostudies-literature | 2019 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs.

Bressin Annkatrin A   Schulte-Sasse Roman R   Figini Davide D   Urdaneta Erika C EC   Beckmann Benedikt M BM   Marsico Annalisa A  

Nucleic acids research 20190501 9


In recent years, hundreds of novel RNA-binding proteins (RBPs) have been identified, leading to the discovery of novel RNA-binding domains. Furthermore, unstructured or disordered low-complexity regions of RBPs have been identified to play an important role in interactions with nucleic acids. However, these advances in understanding RBPs are limited mainly to eukaryotic species and we only have limited tools to faithfully predict RNA-binders in bacteria. Here, we describe a support vector machin  ...[more]

Similar Datasets

| S-EPMC1524891 | biostudies-literature
| S-EPMC5121330 | biostudies-literature
| S-EPMC5570144 | biostudies-other
| S-EPMC4792456 | biostudies-literature
| S-EPMC3874201 | biostudies-literature
| S-EPMC3072456 | biostudies-literature
| S-EPMC5961109 | biostudies-literature
| S-EPMC6748772 | biostudies-literature
| S-EPMC1955458 | biostudies-literature
| S-EPMC5353594 | biostudies-literature