Project description:BackgroundIdentification and selection of protein particles in cryo-electron micrographs is an important step in single particle analysis. In this study, we developed a deep learning-based particle picking network to automatically detect particle centers from cryoEM micrographs. This is a challenging task due to the nature of cryoEM data, having low signal-to-noise ratios with variable particle sizes, shapes, distributions, grayscale variations as well as other undesirable artifacts.ResultsWe propose a double convolutional neural network (CNN) cascade for automated detection of particles in cryo-electron micrographs. This approach, entitled Deep Regression Picker Network or "DRPnet", is simple but very effective in recognizing different particle sizes, shapes, distributions and grayscale patterns corresponding to 2D views of 3D particles. Particles are detected by the first network, a fully convolutional regression network (FCRN), which maps the particle image to a continuous distance map that acts like a probability density function of particle centers. Particles identified by FCRN are further refined to reduce false particle detections by the second classification CNN. DRPnet's first CNN pretrained with only a single cryoEM dataset can be used to detect particles from different datasets without retraining. Compared to RELION template-based autopicking, DRPnet results in better particle picking performance with drastically reduced user interactions and processing time. DRPnet also outperforms the state-of-the-art particle picking networks in terms of the supervised detection evaluation metrics recall, precision, and F-measure. To further highlight quality of the picked particle sets, we compute and present additional performance metrics assessing the resulting 3D reconstructions such as number of 2D class averages, efficiency/angular coverage, Rosenthal-Henderson plots and local/global 3D reconstruction resolution.ConclusionDRPnet shows greatly improved time-savings to generate an initial particle dataset compared to manual picking, followed by template-based autopicking. Compared to other networks, DRPnet has equivalent or better performance. DRPnet excels on cryoEM datasets that have low contrast or clumped particles. Evaluating other performance metrics, DRPnet is useful for higher resolution 3D reconstructions with decreased particle numbers or unknown symmetry, detecting particles with better angular orientation coverage.
Project description:Cryo-electron microscopy is a popular method for the determination of protein structures; however, identifying a sufficient number of particles for analysis can take months of manual effort. Current computational approaches find many false positives and require ad hoc postprocessing, especially for unusually shaped particles. To address these shortcomings, we develop Topaz, an efficient and accurate particle-picking pipeline using neural networks trained with a general-purpose positive-unlabeled learning method. This framework enables particle detection models to be trained with few sparsely labeled particles and no labeled negatives. Topaz retrieves many more real particles than conventional picking methods while maintaining low false-positive rates, is capable of picking challenging unusually shaped proteins (for example, small, non-globular and asymmetric particles), produces more representative particle sets and does not require post hoc curation. We demonstrate the performance of Topaz on two difficult datasets and three conventional datasets. Topaz is modular, standalone, free and open source ( http://topaz.csail.mit.edu ).
Project description:Foamy viruses (FV) belong to the genus Spumavirus, which forms a distinct lineage in the Retroviridae family. Although the infection in natural hosts and zoonotic transmission to humans is asymptomatic, FVs can replicate well in human cells making it an attractive gene therapy vector candidate. Here we present cryo-electron microscopy and (cryo-)electron tomography ultrastructural data on purified prototype FV (PFV) and PFV infected cells. Mature PFV particles have a distinct morphology with a capsid of constant dimension as well as a less ordered shell of density between the capsid and the membrane likely formed by the Gag N-terminal domain and the cytoplasmic part of the Env leader peptide gp18LP. The viral membrane contains trimeric Env glycoproteins partly arranged in interlocked hexagonal assemblies. In situ 3D reconstruction by subtomogram averaging of wild type Env and of a Env gp48TM- gp80SU cleavage site mutant showed a similar spike architecture as well as stabilization of the hexagonal lattice by clear connections between lower densities of neighboring trimers. Cryo-EM was employed to obtain a 9 Å resolution map of the glycoprotein in its pre-fusion state, which revealed extensive trimer interactions by the receptor binding subunit gp80SU at the top of the spike and three central helices derived from the fusion protein subunit gp48TM. The lower part of Env, presumably composed of interlaced parts of gp48TM, gp80SU and gp18LP anchors the spike at the membrane. We propose that the gp48TM density continues into three central transmembrane helices, which interact with three outer transmembrane helices derived from gp18LP. Our ultrastructural data and 9 Å resolution glycoprotein structure provide important new insights into the molecular architecture of PFV and its distinct evolutionary relationship with other members of the Retroviridae.
Project description:The increasing power and popularity of cryo-electron microscopy (cryo-EM) in structural biology brought about the development of so-called hybrid methods, which permit the interpretation of cryo-EM density maps beyond their nominal resolution in terms of atomic models. The Cryo-EM Modeling Challenge 2010 is the first community effort to bring together developers of hybrid methods as well as cryo-EM experimentalists. Participating in the challenge, the molecular dynamics flexible fitting (MDFF) method was applied to a number of cryo-EM density maps. The results are described here with special emphasis on the use of symmetry-based restraints to improve the quality of atomic models derived from density maps of symmetric complexes; on a comparison of the stereochemical quality of atomic models resulting from different hybrid methods; and on application of MDFF to electron crystallography data.
Project description:The basic unit of chromatin, the nucleosome core particle (NCP), controls how DNA in eukaryotic cells is compacted, replicated and read. Since its discovery, biochemists have sought to understand how this protein-DNA complex can help to control so many diverse tasks. Recent electron-microscopy (EM) studies on NCP-containing assemblies have helped to describe important chromatin transactions at a molecular level. With the implementation of recent technical advances in single-particle EM, our understanding of how nucleosomes are recognized and read looks to take a leap forward. In this review, the authors highlight recent advances in the architectural understanding of chromatin biology elucidated by EM.
Project description:Image restoration techniques are used to obtain, given experimental measurements, the best possible approximation of the original object within the limits imposed by instrumental conditions and noise level in the data. In molecular electron microscopy (EM), we are mainly interested in linear methods that preserve the respective relationships between mass densities within the restored map. Here, we describe the methodology of image restoration in structural EM, and more specifically, we will focus on the problem of the optimum recovery of Fourier amplitudes given electron microscope data collected under various defocus settings. We discuss in detail two classes of commonly used linear methods, the first of which consists of methods based on pseudoinverse restoration, and which is further subdivided into mean-square error, chi-square error, and constrained based restorations, where the methods in the latter two subclasses explicitly incorporates non-white distribution of noise in the data. The second class of methods is based on the Wiener filtration approach. We show that the Wiener filter-based methodology can be used to obtain a solution to the problem of amplitude correction (or "sharpening") of the EM map that makes it visually comparable to maps determined by X-ray crystallography, and thus amenable to comparative interpretation. Finally, we present a semiheuristic Wiener filter-based solution to the problem of image restoration given sets of heterogeneous solutions. We conclude the chapter with a discussion of image restoration protocols implemented in commonly used single particle software packages.
Project description:Mitochondria are dynamic organelles that continually adapt their morphology by fusion and fission events. An imbalance between fusion and fission has been linked to major neurodegenerative diseases, including Huntington's, Alzheimer's, and Parkinson's diseases. A member of the Dynamin superfamily, dynamin-related protein 1 (DRP1), a dynamin-related GTPase, is required for mitochondrial membrane fission. Self-assembly of DRP1 into oligomers in a GTP-dependent manner likely drives the division process. We show here that DRP1 self-assembles in two ways: i) in the presence of the non-hydrolysable GTP analog GMP-PNP into spiral-like structures of ~36 nm diameter; and ii) in the presence of GTP into rings composed of 13-18 monomers. The most abundant rings were composed of 16 monomers and had an outer and inner ring diameter of ~30 nm and ~20 nm, respectively. Three-dimensional analysis was performed with rings containing 16 monomers. The single-particle cryo-electron microscopy map of the 16 monomer DRP1 rings suggests a side-by-side assembly of the monomer with the membrane in a parallel fashion. The inner ring diameter of 20 nm is insufficient to allow four membranes to exist as separate entities. Furthermore, we observed that mitochondria were tubulated upon incubation with DRP1 protein in vitro. The tubes had a diameter of ~ 30nm and were decorated with protein densities. These findings suggest DRP1 tubulates mitochondria, and that additional steps may be required for final mitochondrial fission.
Project description:Correct reconstruction of macromolecular structure by cryo-electron microscopy (cryo-EM) relies on accurate determination of the orientation of single-particle images. For small (<100 kDa) DNA-binding proteins, obtaining particle images with sufficiently asymmetric features to correctly guide alignment is challenging. We apply DNA origami to construct molecular goniometers-instruments that precisely orient objects-and use them to dock a DNA-binding protein on a double-helix stage that has user-programmable tilt and rotation angles. We construct goniometers with 14 different stage configurations to orient and visualize the protein just above the cryo-EM grid surface. Each goniometer has a distinct barcode pattern that we use during particle classification to assign angle priors to the bound protein. We use goniometers to obtain a 6.5-Å structure of BurrH, an 82-kDa DNA-binding protein whose helical pseudosymmetry prevents accurate image orientation using traditional cryo-EM. Our approach should be adaptable to other DNA-binding proteins as well as small proteins fused to DNA-binding domains.
Project description:Huntingtin (HTT) is a large (348?kDa) protein that is essential for embryonic development and is involved in diverse cellular activities such as vesicular transport, endocytosis, autophagy and the regulation of transcription. Although an integrative understanding of the biological functions of HTT is lacking, the large number of identified HTT interactors suggests that it serves as a protein-protein interaction hub. Furthermore, Huntington's disease is caused by a mutation in the HTT gene, resulting in a pathogenic expansion of a polyglutamine repeat at the amino terminus of HTT. However, only limited structural information regarding HTT is currently available. Here we use cryo-electron microscopy to determine the structure of full-length human HTT in a complex with HTT-associated protein 40 (HAP40; encoded by three F8A genes in humans) to an overall resolution of 4 Å. HTT is largely ?-helical and consists of three major domains. The amino- and carboxy-terminal domains contain multiple HEAT (huntingtin, elongation factor 3, protein phosphatase 2A and lipid kinase TOR) repeats arranged in a solenoid fashion. These domains are connected by a smaller bridge domain containing different types of tandem repeats. HAP40 is also largely ?-helical and has a tetratricopeptide repeat-like organization. HAP40 binds in a cleft and contacts the three HTT domains by hydrophobic and electrostatic interactions, thereby stabilizing the conformation of HTT. These data rationalize previous biochemical results and pave the way for improved understanding of the diverse cellular functions of HTT.