Dataset Information

Assembling the Community-Scale Discoverable Human Proteome.

ABSTRACT: The increasing throughput and sharing of proteomics mass spectrometry data have now yielded over one-third of a million public mass spectrometry runs. However, these discoveries are not continuously aggregated in an open and error-controlled manner, which limits their utility. To facilitate the reusability of these data, we built the MassIVE Knowledge Base (MassIVE-KB), a community-wide, continuously updating knowledge base that aggregates proteomics mass spectrometry discoveries into an open reusable format with full provenance information for community scrutiny. Reusing >31 TB of public human data stored in a mass spectrometry interactive virtual environment (MassIVE), the MassIVE-KB contains >2.1 million precursors from 19,610 proteins (48% larger than before; 97% of the total) and doubles proteome coverage to 6 million amino acids (54% of the proteome) with strict library-scale false discovery controls, thereby providing evidence for 430 proteins for which sufficient protein-level evidence was previously missing. Furthermore, MassIVE-KB can inform experimental design, helps identify and quantify new data, and provides tools for community construction of specialized spectral libraries.

SUBMITTER: Wang M

PROVIDER: S-EPMC6279426 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Assembling the Community-Scale Discoverable Human Proteome.

Wang Mingxun M Wang Jian J Carver Jeremy J Pullman Benjamin S BS Cha Seong Won SW Bandeira Nuno N

Cell systems 20180829 4

The increasing throughput and sharing of proteomics mass spectrometry data have now yielded over one-third of a million public mass spectrometry runs. However, these discoveries are not continuously aggregated in an open and error-controlled manner, which limits their utility. To facilitate the reusability of these data, we built the MassIVE Knowledge Base (MassIVE-KB), a community-wide, continuously updating knowledge base that aggregates proteomics mass spectrometry discoveries into an open re ...[more]

PMID: 30172843

Similar Datasets

Project description:Because proteins are the main mediators of most cellular processes they are also prime therapeutic targets. Identifying physical links among proteins and between drugs and their protein targets is essential in order to understand the mechanisms through which both proteins themselves and the molecules they are targeted with act. Thus, there is a strong need for sensitive methods that enable mapping out these biomolecular interactions. Here we present a robust and sensitive approach to screen proteome-scale collections of proteins for binding to proteins or small molecules using the well validated MAPPIT (Mammalian Protein-Protein Interaction Trap) and MASPIT (Mammalian Small Molecule-Protein Interaction Trap) assays. Using high-density reverse transfected cell microarrays, a close to proteome-wide collection of human ORF clones can be screened for interactors at high throughput. The versatility of the platform is demonstrated through several examples. With MAPPIT, we screened a 15k ORF library for binding partners of RNF41, an E3 ubiquitin protein ligase implicated in receptor sorting, identifying known and novel interacting proteins. The potential related to the fact that MAPPIT operates in living human cells is illustrated in a screen where the protein collection is scanned for interactions with the glucocorticoid receptor (GR) in its unliganded versus dexamethasone-induced activated state. Several proteins were identified the interaction of which is modulated upon ligand binding to the GR, including a number of previously reported GR interactors. Finally, the screening technology also enables detecting small molecule target proteins, which in many drug discovery programs represents an important hurdle. We show the efficiency of MASPIT-based target profiling through screening with tamoxifen, a first-line breast cancer drug, and reversine, an investigational drug with interesting dedifferentiation and antitumor activity. In both cases, cell microarray screens yielded known and new potential drug targets highlighting the utility of the technology beyond fundamental biology.

Dataset Information

Assembling the Community-Scale Discoverable Human Proteome.

Publications

Assembling the Community-Scale Discoverable Human Proteome.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets