Project description:Many cellular functions involve protein complexes that are formed by multiple interacting proteins. Tandem Affinity Purification (TAP) is a popular experimental method for detecting such multi-protein interactions. However, current computational methods that predict protein complexes from TAP data require converting the co-complex relationships in TAP data into binary interactions. The resulting pairwise protein-protein interaction (PPI) network is then mined for densely connected regions that are identified as putative protein complexes. Converting the TAP data into PPI data not only introduces errors but also loses useful information about the underlying multi-protein relationships that can be exploited to detect the internal organization (i.e., core-attachment structures) of protein complexes. In this article, we propose a method called CACHET that detects protein complexes with Core-AttaCHment structures directly from bipartitETAP data. CACHET models the TAP data as a bipartite graph in which the two vertex sets are the baits and the preys, respectively. The edges between the two vertex sets represent bait-prey relationships. CACHET first focuses on detecting high-quality protein-complex cores from the bipartite graph. To minimize the effects of false positive interactions, the bait-prey relationships are indexed with reliability scores. Only non-redundant, reliable bicliques computed from the TAP bipartite graph are regarded as protein-complex cores. CACHET constructs protein complexes by including attachment proteins into the cores. We applied CACHET on large-scale TAP datasets and found that CACHET outperformed existing methods in terms of prediction accuracy (i.e., F-measure and functional homogeneity of predicted complexes). In addition, the protein complexes predicted by CACHET are equipped with core-attachment structures that provide useful biological insights into the inherent functional organization of protein complexes. Our supplementary material can be found at http://www1.i2r.a-star.edu.sg/~xlli/CACHET/CACHET.htm ; binary executables can also be found there. Supplementary Material is also available at www.liebertonline.com/cmb.
Project description:In eukaryotic cells, mitochondria host ancient essential bioenergetic and biosynthetic pathways. LYR (leucine/tyrosine/arginine) motif proteins (LYRMs) of the Complex1_LYR-like superfamily interact with protein complexes of bacterial origin. Many LYR proteins function as extra subunits (LYRM3 and LYRM6) or novel assembly factors (LYRM7, LYRM8, ACN9 and FMC1) of the oxidative phosphorylation (OXPHOS) core complexes. Structural insights into complex I accessory subunits LYRM6 and LYRM3 have been provided by analyses of EM and X-ray structures of complex I from bovine and the yeast Yarrowia lipolytica, respectively. Combined structural and biochemical studies revealed that LYRM6 resides at the matrix arm close to the ubiquinone reduction site. For LYRM3, a position at the distal proton-pumping membrane arm facing the matrix space is suggested. Both LYRMs are supposed to anchor an acyl-carrier protein (ACPM) independently to complex I. The function of this duplicated protein interaction of ACPM with respiratory complex I is still unknown. Analysis of protein-protein interaction screens, genetic analyses and predicted multi-domain LYRMs offer further clues on an interaction network and adaptor-like function of LYR proteins in mitochondria.
Project description:Termination and ribosome recycling are essential processes in translation. In eukaryotes, a stop codon in the ribosomal A site is decoded by a ternary complex consisting of release factors eRF1 and guanosine triphosphate (GTP)-bound eRF3. After GTP hydrolysis, eRF3 dissociates, and ABCE1 can bind to eRF1-loaded ribosomes to stimulate peptide release and ribosomal subunit dissociation. Here, we present cryoelectron microscopic (cryo-EM) structures of a pretermination complex containing eRF1-eRF3 and a termination/prerecycling complex containing eRF1-ABCE1. eRF1 undergoes drastic conformational changes: its central domain harboring the catalytically important GGQ loop is either packed against eRF3 or swung toward the peptidyl transferase center when bound to ABCE1. Additionally, in complex with eRF3, the N-terminal domain of eRF1 positions the conserved NIKS motif proximal to the stop codon, supporting its suggested role in decoding, yet it appears to be delocalized in the presence of ABCE1. These results suggest that stop codon decoding and peptide release can be uncoupled during termination.
Project description:The operon coding for a respiratory quinol oxidase was cloned from thermoacidophilic archaebacterium Sulfolobus acidocaldarius. It contains three genes, soxA, soxB and soxC. The first two genes code for proteins related to the cytochrome c oxidase subunits II and I, respectively. soxC encodes a protein homologous to cytochrome b, which is a subunit of the mitochondrial and bacterial cytochrome c reductases and the chloroplast cytochrome b6f complex. soxA is preceded by a promoter and the genes are cotranscribed into a 4 kb mRNA. Their protein products form a complex which has been partially purified and has quinol oxidase activity. The reduced minus oxidized absorption spectrum of the complex has two maxima at 586 and 606 nm. The latter is typical of cytochrome c oxidase. The complex contains four haems A. Two haems belong to the 'cytochrome oxidase' part of the complex and two are probably bound to be apocytochrome b (SoxC) and responsible for the 586 nm absorption peak. The homology between the sox gene products and their mitochondrial counterparts suggests that energy conservation coupled to the quinol oxidation catalysed either by the Sulfolobus oxidase or two mitochondrial respiratory enzymes may have a similar mechanism.
Project description:Electron transfer reactions are essential for life because they underpin oxidative phosphorylation and photosynthesis, processes leading to the generation of ATP, and are involved in many reactions of intermediary metabolism. Key to these roles is the formation of transient inter-protein electron transfer complexes. The structural basis for the control of specificity between partner proteins is lacking because these weak transient complexes have remained largely intractable for crystallographic studies. Inter-protein electron transfer processes are central to all of the key steps of denitrification, an alternative form of respiration in which bacteria reduce nitrate or nitrite to N2 through the gaseous intermediates nitric oxide (NO) and nitrous oxide (N2O) when oxygen concentrations are limiting. The one-electron reduction of nitrite to NO, a precursor to N2O, is performed by either a haem- or copper-containing nitrite reductase (CuNiR) where they receive an electron from redox partner proteins a cupredoxin or a c-type cytochrome. Here we report the structures of the newly characterized three-domain haem-c-Cu nitrite reductase from Ralstonia pickettii (RpNiR) at 1.01?Å resolution and its M92A and P93A mutants. Very high resolution provides the first view of the atomic detail of the interface between the core trimeric cupredoxin structure of CuNiR and the tethered cytochrome c domain that allows the enzyme to function as an effective self-electron transfer system where the donor and acceptor proteins are fused together by genomic acquisition for functional advantage. Comparison of RpNiR with the binary complex of a CuNiR with a donor protein, AxNiR-cytc551 (ref. 6), and mutagenesis studies provide direct evidence for the importance of a hydrogen-bonded water at the interface in electron transfer. The structure also provides an explanation for the preferential binding of nitrite to the reduced copper ion at the active site in RpNiR, in contrast to other CuNiRs where reductive inactivation occurs, preventing substrate binding.
Project description:DNA replication in Eukaryotes is a highly dynamic process that involves several dozens of proteins. Some of these proteins form stable complexes that are amenable to high-resolution structure determination by cryo-EM, thanks to the recent advent of the direct electron detector and powerful image analysis algorithm. But many of these proteins associate only transiently and flexibly, precluding traditional biochemical purification. We found that direct mixing of the component proteins followed by 2D and 3D image sorting can capture some very weakly interacting complexes. Even at 2D average level and at low resolution, EM images of these flexible complexes can provide important biological insights. It is often necessary to positively identify the feature-of-interest in a low resolution EM structure. We found that systematically fusing or inserting maltose binding protein (MBP) to selected proteins is highly effective in these situations. In this chapter, we describe the EM studies of several protein complexes involved in the eukaryotic DNA replication over the past decade or so. We suggest that some of the approaches used in these studies may be applicable to structural analysis of other biological systems.
Project description:BackgroundCharacterizing the structural properties of protein interaction networks will help illuminate the organizational and functional relationships among elements in biological systems.ResultsIn this paper, we present a systematic exploration of the core/periphery structures in protein interaction networks (PINs). First, the concepts of cores and peripheries in PINs are defined. Then, computational methods are proposed to identify two types of cores, k-plex cores and star cores, from PINs. Application of these methods to a yeast protein interaction network has identified 110 k-plex cores and 109 star cores. We find that the k-plex cores consist of either "party" proteins, "date" proteins, or both. We also reveal that there are two classes of 1-peripheral proteins: "party" peripheries, which are more likely to be part of protein complex, and "connector" peripheries, which are more likely connected to different proteins or protein complexes. Our results also show that, besides connectivity, other variations in structural properties are related to the variation in biological properties. Furthermore, the negative correlation between evolutionary rate and connectivity are shown toysis. Moreover, the core/periphery structures help to reveal the existence of multiple levels of protein expression dynamics.ConclusionOur results show that both the structure and connectivity can be used to characterize topological properties in protein interaction networks, illuminating the functional organization of cellular systems.
Project description:BACKGROUND: All sequenced eukaryotic genomes have been shown to possess at least a few introns. This includes those unicellular organisms, which were previously suspected to be intron-less. Therefore, gene splicing must have been present at least in the last common ancestor of the eukaryotes. To explain the evolution of introns, basically two mutually exclusive concepts have been developed. The introns-early hypothesis says that already the very first protein-coding genes contained introns while the introns-late concept asserts that eukaryotic genes gained introns only after the emergence of the eukaryotic lineage. A very important aspect in this respect is the conservation of intron positions within homologous genes of different taxa. RESULTS: GenePainter is a standalone application for mapping gene structure information onto protein multiple sequence alignments. Based on the multiple sequence alignments the gene structures are aligned down to single nucleotides. GenePainter accounts for variable lengths in exons and introns, respects split codons at intron junctions and is able to handle sequencing and assembly errors, which are possible reasons for frame-shifts in exons and gaps in genome assemblies. Thus, even gene structures of considerably divergent proteins can properly be compared, as it is needed in phylogenetic analyses. Conserved intron positions can also be mapped to user-provided protein structures. For their visualization GenePainter provides scripts for the molecular graphics system PyMol. CONCLUSIONS: GenePainter is a tool to analyse gene structure conservation providing various visualization options. A stable version of GenePainter for all operating systems as well as documentation and example data are available at http://www.motorprotein.de/genepainter.html.
Project description:On the basis of a structural analysis of 240 protein-DNA complexes contained in the Protein Data Bank (PDB), we have classified the DNA-binding proteins involved into eight different structural/functional groups, which are further classified into 54 structural families. Here we present this classification and review the functions, structures and binding interactions of these protein-DNA complexes.
Project description:Homology modelling has matured into an important technique in structural biology, significantly contributing to narrowing the gap between known protein sequences and experimentally determined structures. Fully automated workflows and servers simplify and streamline the homology modelling process, also allowing users without a specific computational expertise to generate reliable protein models and have easy access to modelling results, their visualization and interpretation. Here, we present an update to the SWISS-MODEL server, which pioneered the field of automated modelling 25 years ago and been continuously further developed. Recently, its functionality has been extended to the modelling of homo- and heteromeric complexes. Starting from the amino acid sequences of the interacting proteins, both the stoichiometry and the overall structure of the complex are inferred by homology modelling. Other major improvements include the implementation of a new modelling engine, ProMod3 and the introduction a new local model quality estimation method, QMEANDisCo. SWISS-MODEL is freely available at https://swissmodel.expasy.org.