Project description:The ability to design a protein to bind specifically to a target RNA enables numerous applications, with the modular architecture of the PUF domain lending itself to new RNA-binding specificities. For each repeat of the Pumilio-1 PUF domain, we generate a library that contains the 8,000 possible combinations of amino acid substitutions at residues critical for RNA contact. We carry out yeast three-hybrid selections with each library against the RNA recognition sequence for Pumilio-1, with any possible base present at the position recognized by the randomized repeat. We use sequencing to score the binding of each variant, identifying many variants with highly repeat-specific interactions. From these data, we generate an RNA binding code specific to each repeat and base. We use this code to design PUF domains against 16 RNAs, and find that some of these domains recognize RNAs with two, three or four changes from the wild type sequence.
Project description:Specific manipulation of RNA is necessary for the research in biotechnology and medicine. The RNA-binding domains of Pumilio/fem-3 mRNA binding factors (PUF domains) are programmable RNA binding scaffolds used to engineer artificial proteins that specifically modulate RNAs. However, the native PUF domains generally recognize 8-nt RNAs, limiting their applications. Here, we modify the PUF domain of human Pumilio1 to engineer PUFs that recognize RNA targets of different length. The engineered PUFs bind to their RNA targets specifically and PUFs with more repeats have higher binding affinity than the canonical eight-repeat domains; however, the binding affinity reaches the peak at those with 9 and 10 repeats. Structural analysis on PUF with nine repeats reveals a higher degree of curvature, and the RNA binding unexpectedly and dramatically opens the curved structure. Investigation of the residues positioned in between two RNA bases demonstrates that tyrosine and arginine have favored stacking interactions. Further tests on the availability of the engineered PUFs in vitro and in splicing function assays indicate that our engineered PUFs bind RNA targets with high affinity in a programmable way.
Project description:PUF proteins are a conserved group of sequence specific RNA-binding proteins that bind to RNA in a modular fashion. The RNA-binding domain of PUF proteins typically consists of eight clustered Puf repeats. Plant genomes code for large families of PUF proteins that show significant variability in their predicted Puf repeat number, organization, and amino acid sequence. Here we sought to determine whether the observed variability in the RNA-binding domains of four plant PUFs results in a preference for nonclassical PUF RNA target sequences. We report the identification of a novel RNA binding sequence for a nucleolar Arabidopsis PUF protein that contains an atypical RNA-binding domain. The Arabidopsis PUM23 (APUM23) binding sequence was 10 nucleotides in length, contained a centrally located UUGA core element, and had a preferred cytosine at nucleotide position 8. These RNA sequence characteristics differ from those of other PUF proteins, because all natural PUFs studied to date bind to RNAs that contain a conserved UGU sequence at their 5' end and lack specificity for cytosine. Gel mobility shift assays validated the identity of the APUM23 binding sequence and supported the location of 3 of the 10 predicted Puf repeats in APUM23, including the cytosine-binding repeat. The preferred 10-nucleotide sequence bound by APUM23 is present within the 18S rRNA sequence, supporting the known role of APUM23 in 18S rRNA maturation. This work also reveals that APUM23, an ortholog of yeast Nop9, could provide an advanced structural backbone for Puf repeat engineering and target-specific regulation of cellular RNAs.
Project description:PUF proteins, named for Drosophila Pumilio (PUM) and Caenorhabditis elegans fem-3-binding factor (FBF), recognize specific sequences in the mRNAs they bind and control. RNA binding by classical PUF proteins is mediated by a characteristic PUM homology domain (PUM-HD). The Puf1 and Puf2 proteins possess a distinct architecture and comprise a highly conserved subfamily among fungal species. Puf1/Puf2 proteins contain two types of RNA-binding domain: a divergent PUM-HD and an RNA recognition motif (RRM). They recognize RNAs containing UAAU motifs, often in clusters. Here, we report a crystal structure of the PUM-HD of a fungal Puf1 in complex with a dual UAAU motif RNA. Each of the two UAAU tetranucleotides are bound by a Puf1 PUM-HD forming a 2:1 protein-to-RNA complex. We also determined crystal structures of the Puf1 RRM domain that identified a dimerization interface. The PUM-HD and RRM domains act in concert to determine RNA-binding specificity: the PUM-HD dictates binding to UAAU, and dimerization of the RRM domain favors binding to dual UAAU motifs rather than a single UAAU. Cooperative action of the RRM and PUM-HD identifies a new mechanism by which multiple RNA-binding modules in a single protein collaborate to create a unique RNA-binding specificity.
Project description:Pumilio/fem-3 mRNA-binding factor (PUF) proteins possess a recognition code for bases A, U, and G, allowing designed RNA sequence specificity of their modular Pumilio (PUM) repeats. However, recognition side chains in a PUM repeat for cytosine are unknown. Here we report identification of a cytosine-recognition code by screening random amino acid combinations at conserved RNA recognition positions using a yeast three-hybrid system. This C-recognition code is specific and modular as specificity can be transferred to different positions in the RNA recognition sequence. A crystal structure of a modified PUF domain reveals specific contacts between an arginine side chain and the cytosine base. We applied the C-recognition code to design PUF domains that recognize targets with multiple cytosines and to generate engineered splicing factors that modulate alternative splicing. Finally, we identified a divergent yeast PUF protein, Nop9p, that may recognize natural target RNAs with cytosine. This work deepens our understanding of natural PUF protein target recognition and expands the ability to engineer PUF domains to recognize any RNA sequence.
Project description:mRNA control networks depend on recognition of specific RNA sequences. Pumilio-fem-3 mRNA binding factor (PUF) RNA-binding proteins achieve that specificity through variations on a conserved scaffold. Saccharomyces cerevisiae Puf3p achieves specificity through an additional binding pocket for a cytosine base upstream of the core RNA recognition site. Here we demonstrate that this chemically simple adaptation is prevalent and contributes to the diversity of RNA specificities among PUF proteins. Bioinformatics analysis shows that mRNAs associated with Caenorhabditis elegans fem-3 mRNA binding factor (FBF)-2 in vivo contain an upstream cytosine required for biological regulation. Crystal structures of FBF-2 and C. elegans PUF-6 reveal binding pockets structurally similar to that of Puf3p, whereas sequence alignments predict a pocket in PUF-11. For Puf3p, FBF-2, PUF-6, and PUF-11, the upstream pockets and a cytosine are required for maximal binding to RNA, but the quantitative impact on binding affinity varies. Furthermore, the position of the upstream cytosine relative to the core PUF recognition site can differ, which in the case of FBF-2 originally masked the identification of this consensus sequence feature. Importantly, other PUF proteins lack the pocket and so do not discriminate upstream bases. A structure-based alignment reveals that these proteins lack key residues that would contact the cytosine, and in some instances, they also present amino acid side chains that interfere with binding. Loss of the pocket requires only substitution of one serine, as appears to have occurred during the evolution of certain fungal species.
Project description:Pumilio/FBF (PUF) family proteins are found in eukaryotic organisms and regulate gene expression post-transcriptionally by binding to sequences in the 3' untranslated region of target transcripts. PUF proteins contain an RNA binding domain that typically comprises eight alpha-helical repeats, each of which recognizes one RNA base. Some PUF proteins, including yeast Puf4p, have altered RNA binding specificity and use their eight repeats to bind to RNA sequences with nine or ten bases. Here we report the crystal structures of Puf4p alone and in complex with a 9-nucleotide (nt) target RNA sequence, revealing that Puf4p accommodates an 'extra' nucleotide by modest adaptations allowing one base to be turned away from the RNA binding surface. Using structural information and sequence comparisons, we created a mutant Puf4p protein that preferentially binds to an 8-nt target RNA sequence over a 9-nt sequence and restores binding of each protein repeat to one RNA base.
Project description:Pumilio and FBF homology (PUF) proteins represent highly promising candidates for engineering sequence-specific RNA recognition, but were only known to recognize G, A, and U, significantly limiting applications. Two groups (Filipovska et al., 2011; Dong et al., 2011) have now reported the discovery of the cytosine-recognition code for PUF proteins.
Project description:The double-stranded RNA-binding domain (dsRBD) is a common RNA-binding motif found in many proteins involved in RNA maturation and localization. To determine how this domain recognizes RNA, we have studied the third dsRBD from Drosophila Staufen. The domain binds optimally to RNA stem-loops containing 12 uninterrupted base pairs, and we have identified the amino acids required for this interaction. By mutating these residues in a staufen transgene, we show that the RNA-binding activity of dsRBD3 is required in vivo for Staufen-dependent localization of bicoid and oskar mRNAs. Using high-resolution NMR, we have determined the structure of the complex between dsRBD3 and an RNA stem-loop. The dsRBD recognizes the shape of A-form dsRNA through interactions between conserved residues within loop 2 and the minor groove, and between loop 4 and the phosphodiester backbone across the adjacent major groove. In addition, helix alpha1 interacts with the single-stranded loop that caps the RNA helix. Interactions between helix alpha1 and single-stranded RNA may be important determinants of the specificity of dsRBD proteins.