Project description:Proteins from the intrinsically disordered group (IDP) focus the attention of many researchers engaged in protein structure analysis. The main criteria used in their identification are lack of secondary structure and significant structural variability. This variability takes forms that cannot be identified in the X-ray technique. In the present study, different criteria were used to assess the status of IDP proteins and their fragments recognized as intrinsically disordered regions (IDRs). The status of the hydrophobic core in proteins identified as IDPs and in their complexes was assessed. The status of IDRs as components of the ordering structure resulting from the construction of the hydrophobic core was also assessed. The hydrophobic core is understood as a structure encompassing the entire molecule in the form of a centrally located high concentration of hydrophobicity and a shell with a gradually decreasing level of hydrophobicity until it reaches a level close to zero on the protein surface. It is a model assuming that the protein folding process follows a micellization pattern aiming at exposing polar residues on the surface, with the simultaneous isolation of hydrophobic amino acids from the polar aquatic environment. The use of the model of hydrophobicity distribution in proteins in the form of the 3D Gaussian distribution described on the protein particle introduces the possibility of assessing the degree of similarity to the assumed micelle-like distribution and also enables the identification of deviations and mismatch between the actual distribution and the idealized distribution. The FOD (fuzzy oil drop) model and its modified FOD-M version allow for the quantitative assessment of these differences and the assessment of the relationship of these areas to the protein function. In the present work, the sections of IDRs in protein complexes classified as IDPs are analyzed. The classification "disordered" in the structural sense (lack of secondary structure or high flexibility) does not always entail a mismatch with the structure of the hydrophobic core. Particularly, the interface area, often consisting of IDRs, in many analyzed complexes shows the compliance of the hydrophobicity distribution with the idealized distribution, which proves that matching to the structure of the hydrophobic core does not require secondary structure ordering.
Project description:Structural disorder in proteins arises from a complex interplay between weak hydrophobicity and unfavorable electrostatic interactions. The extent to which the hydrophobic effect contributes to the unique and compact native state of proteins is, however, confounded by large compensation between multiple entropic and energetic terms. Here we show that protein structural order and cooperativity arise as emergent properties upon hydrophobic substitutions in a disordered system with non-intuitive effects on folding and function. Aided by sequence-structure analysis, equilibrium, and kinetic spectroscopic studies, we engineer two hydrophobic mutations in the disordered DNA-binding domain of CytR that act synergistically, but not in isolation, to promote structure, compactness, and stability. The double mutant, with properties of a fully ordered domain, exhibits weak cooperativity with a complex and rugged conformational landscape. The mutant, however, binds cognate DNA with an affinity only marginally higher than that of the wild type, though nontrivial differences are observed in the binding to noncognate DNA. Our work provides direct experimental evidence of the dominant role of non-additive hydrophobic effects in shaping the molecular evolution of order in disordered proteins and vice versa, which could be generalized to even folded proteins with implications for protein design and functional manipulation.
Project description:The conformation flexibility of natural protein causes both complexity and difficulty to understand the relationship between structure and function. The prediction of intrinsically disordered protein primarily is focusing on to disclose the regions with structural flexibility involving relevant biological functions and various diseases. The order of amino acids in protein sequence determines possible conformations, folding flexibility and biological function. Although many methods provided the information of intrinsically disordered protein (IDP), but the results are mainly limited to determine the locations of regions without knowledge of possible folding conformations. Here, the developed protein folding fingerprint adopted the protein folding variation matrix (PFVM) to reveal all possible folding patterns for the intrinsically disordered protein along its sequence. The PFVM integrally exhibited the intrinsically disordered protein with disordering regions, degree of disorder as well as folding pattern. The advantage of PFVM will not only provide rich information for IDP, but also may promote the study of protein folding problem.
Project description:Reversible protein phosphorylation provides a major regulatory mechanism in eukaryotic cells. Due to the high variability of amino acid residues flanking a relatively limited number of experimentally identified phosphorylation sites, reliable prediction of such sites still remains an important issue. Here we report the development of a new web-based tool for the prediction of protein phosphorylation sites, DISPHOS (DISorder-enhanced PHOSphorylation predictor, http://www.ist.temple. edu/DISPHOS). We observed that amino acid compositions, sequence complexity, hydrophobicity, charge and other sequence attributes of regions adjacent to phosphorylation sites are very similar to those of intrinsically disordered protein regions. Thus, DISPHOS uses position-specific amino acid frequencies and disorder information to improve the discrimination between phosphorylation and non-phosphorylation sites. Based on the estimates of phosphorylation rates in various protein categories, the outputs of DISPHOS are adjusted in order to reduce the total number of misclassified residues. When tested on an equal number of phosphorylated and non-phosphorylated residues, the accuracy of DISPHOS reaches 76% for serine, 81% for threonine and 83% for tyrosine. The significant enrichment in disorder-promoting residues surrounding phosphorylation sites together with the results obtained by applying DISPHOS to various protein functional classes and proteomes, provide strong support for the hypothesis that protein phosphorylation predominantly occurs within intrinsically disordered protein regions.
Project description:Intrinsically disordered proteins often form dynamic complexes with their ligands. Yet, the speed and amplitude of these motions are hidden in classical binding kinetics. Here, we directly measure the dynamics in an exceptionally mobile, high-affinity complex. We show that the disordered tail of the cell adhesion protein E-cadherin dynamically samples a large surface area of the protooncogene β-catenin. Single-molecule experiments and molecular simulations resolve these motions with high resolution in space and time. Contacts break and form within hundreds of microseconds without a dissociation of the complex. The energy landscape of this complex is rugged with many small barriers (3 to 4 kBT) and reconciles specificity, high affinity, and extreme disorder. A few persistent contacts provide specificity, whereas unspecific interactions boost affinity.
Project description:We describe an approach for the development of fluorescent sensors of metabolite binding in which a genetically encoded fluorescent non-canonical amino acid (fNCAA) containing a 7-hydroxycoumarin moiety (7-HCAA) forms a FRET pair with native tryptophan residues. Although previous studies demonstrated the potential for using 7-HCAA as an acceptor for tryptophan, this approach has not yet been explored within a single protein containing multiple tryptophan residues. A structure-based analysis of a hexokinase enzyme with multiple native tryptophan residues identified glutamate 50 as a potential site of 7-HCAA incorporation; Glu50 moves closer to the native tryptophans upon substrate binding. Substitution of 7-HCAA at residue 50 led to an increase in FRET efficiency in the presence of the substrate; this effect was not observed in a control protein where no change in distance between 7-HCAA and the native tryptophans occurs on substrate binding. This system was then used to directly observe differences in binding affinity of the hexokinase that occur at a number of pH values. Our approach builds on previous research in that it eliminates the need for the incorporation of multiple fNCAAs or fluorescent labels within a target protein and can be used to study substrate binding with native ligands. As such, it serves to expand the versatility of FRET-based techniques.
Project description:Accurate prediction of the binding affinities of small-molecule ligands to their biological targets is fundamental for structure-based drug design but remains a very challenging task. In this paper, we have performed computational studies to predict the binding models of 31 small-molecule Smac (the second mitochondria-derived activator of caspase) mimetics to their target, the XIAP (X-linked inhibitor of apoptosis) protein, and their binding affinities. Our results showed that computational docking was able to reliably predict the binding models, as confirmed by experimentally determined crystal structures of some Smac mimetics complexed with XIAP. However, all the computational methods we have tested, including an empirical scoring function, two knowledge-based scoring functions, and MM-GBSA (molecular mechanics and generalized Born surface area), yield poor to modest prediction for binding affinities. The linear correlation coefficient (r(2)) value between the predicted affinities and the experimentally determined affinities was found to be between 0.21 and 0.36. Inclusion of ensemble protein-ligand conformations obtained from molecular dynamic simulations did not significantly improve the prediction. However, major improvement was achieved when the free-energy change for ligands between their free- and bound-states, or "ligand-reorganization free energy", was included in the MM-GBSA calculation, and the r(2) value increased from 0.36 to 0.66. The prediction was validated using 10 additional Smac mimetics designed and evaluated by an independent group. This study demonstrates that ligand reorganization free energy plays an important role in the overall binding free energy between Smac mimetics and XIAP. This term should be evaluated for other ligand-protein systems and included in the development of new scoring functions. To our best knowledge, this is the first computational study to demonstrate the importance of ligand reorganization free energy for the prediction of protein-ligand binding free energy.
Project description:Intrinsically disordered proteins (IDPs) were found to be widely associated with human diseases and may serve as potential drug design targets. However, drug design targeting IDPs is still in the very early stages. Progress in drug design is usually achieved using experimental screening; however, the structural disorder of IDPs makes it difficult to characterize their interaction with ligands using experiments alone. To better understand the structure of IDPs and their interactions with small molecule ligands, we performed extensive simulations on the c-Myc??????? peptide and its binding to a reported small molecule inhibitor, ligand 10074-A4. We found that the conformational space of the apo c-Myc??????? peptide was rather dispersed and that the conformations of the peptide were stabilized mainly by charge interactions and hydrogen bonds. Under the binding of the ligand, c-Myc??????? remained disordered. The ligand was found to bind to c-Myc??????? at different sites along the chain and behaved like a 'ligand cloud'. In contrast to ligand binding to more rigid target proteins that usually results in a dominant bound structure, ligand binding to IDPs may better be described as ligand clouds around protein clouds. Nevertheless, the binding of the ligand and a non-ligand to the c-Myc??????? target could be clearly distinguished. The present study provides insights that will help improve rational drug design that targets IDPs.
Project description:Epistasis complicates our understanding of protein sequence-function relationships and impedes our ability to build accurate predictive models for novel genotypes. Although pairwise epistasis has been extensively studied in proteins, the significance of higher-order epistasis for protein sequence-function relationships remains contentious, largely due to challenges in fitting higher-order epistatatic interactions for full-length proteins. Here, we introduce a novel transformer-based approach. The key feature of our method is that we can adjust the order of interactions fit by the model by changing the number of attention layers while also accounting for any global nonlinearity induced by the experimental conditions. This allows us to test if inclusion of higher-order interactions leads to enhanced model performance. Applying our method to 10 large protein sequence-function datasets, we found that the importance of higher-order epistasis differs substantially between proteins, accounting for up to 60% of the total variance attributed to epistasis. We also found that including higher-order epistasis is particularly important for generalizing locally sampled fitness data to distant regions of sequence space and for modeling an additional multi-peak fitness landscape derived from combining mutagenesis data from 4 orthologous green fluorescencent proteins. Our findings suggest that higher-order epistasis often does play an important role in protein sequence-function relationships, and thus should be properly incorporated during protein engineering and evolutionary data analysis.
Project description:Intrinsically disordered proteins (IDPs) carry out important biological functions and offer an instructive model system for folding and binding studies. However, their structural characterization in the absence of interactors is hindered by their highly dynamic conformation. The cyclin-dependent-kinase inhibitor (Cki) Sic1 from Saccharomyces cerevisiae is a key regulator of the yeast cell cycle, which controls entrance into S phase and coordination between cell growth and proliferation. Its last 70 out of 284 residues display functional and structural homology to the inhibitory domain of mammalian p21 and p27. Sic1 has escaped systematic structural characterization until now. Here, complementary biophysical methods are applied to the study of conformational properties of pure Sic1 in solution. Based on sequence analysis, gel filtration, circular dichroism (CD), electrospray-ionization mass spectrometry (ESI-MS), and limited proteolysis, it can be concluded that the whole molecule exists in a highly disordered state and can, therefore, be classified as an IDP. However, the results of these experiments indicate, at the same time, that the protein displays some content in secondary and tertiary structure, having properties similar to those of molten globules or premolten globules. Proteolysis-hypersensitive sites cluster at the N-terminus and in the middle of the molecule, whereas the most structured region resides at the C-terminus, including part of the inhibitory domain and the casein-kinase-2 (CK2) phosphorylation target S201. The mutations S201A and S201E, which are known to affect Sic1 function, do not have significant effects on the conformational properties of the pure protein.