Unknown

Dataset Information

0

Prediction of clustered RNA-binding protein motif sites in the mammalian genome.


ABSTRACT: Sequence-specific interactions of RNA-binding proteins (RBPs) with their target transcripts are essential for post-transcriptional gene expression regulation in mammals. However, accurate prediction of RBP motif sites has been difficult because many RBPs recognize short and degenerate sequences. Here we describe a hidden Markov model (HMM)-based algorithm mCarts to predict clustered functional RBP-binding sites by effectively integrating the number and spacing of individual motif sites, their accessibility in local RNA secondary structures and cross-species conservation. This algorithm learns and quantifies rules of these features, taking advantage of a large number of in vivo RBP-binding sites obtained from cross-linking and immunoprecipitation data. We applied this algorithm to study two representative RBP families, Nova and Mbnl, which regulate tissue-specific alternative splicing through interacting with clustered YCAY and YGCY elements, respectively, and predicted their binding sites in the mouse transcriptome. Despite the low information content in individual motif elements, our algorithm made specific predictions for successful experimental validation. Analysis of predicted sites also revealed cases of extensive and distal RBP-binding sites important for splicing regulation. This algorithm can be readily applied to other RBPs to infer their RNA-regulatory networks. The software is freely available at http://zhanglab.c2b2.columbia.edu/index.php/MCarts.

SUBMITTER: Zhang C 

PROVIDER: S-EPMC3737533 | biostudies-literature | 2013 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of clustered RNA-binding protein motif sites in the mammalian genome.

Zhang Chaolin C   Lee Kuang-Yung KY   Swanson Maurice S MS   Darnell Robert B RB  

Nucleic acids research 20130518 14


Sequence-specific interactions of RNA-binding proteins (RBPs) with their target transcripts are essential for post-transcriptional gene expression regulation in mammals. However, accurate prediction of RBP motif sites has been difficult because many RBPs recognize short and degenerate sequences. Here we describe a hidden Markov model (HMM)-based algorithm mCarts to predict clustered functional RBP-binding sites by effectively integrating the number and spacing of individual motif sites, their ac  ...[more]

Similar Datasets

| S-EPMC2665242 | biostudies-literature
| S-EPMC2922897 | biostudies-literature
| S-EPMC3772059 | biostudies-literature
| S-EPMC3737542 | biostudies-literature
| S-EPMC5127382 | biostudies-literature
| S-EPMC2774325 | biostudies-literature
2018-07-10 | GSE116770 | GEO
| S-EPMC5219607 | biostudies-literature
| S-EPMC6531468 | biostudies-literature
| S-EPMC3278845 | biostudies-other