Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Deep Human Proteome Sequencing Enables Global Detection of Mutations and Alternative Splicing

ABSTRACT: Mass spectrometry-based proteomics now routinely enables identification of over 10,000 human proteins from a single sample. However, proteins are typically identified by peptide sequences representing about 20% of all proteinogenic amino acids encoded in the transcriptome. Deeper protein sequencing - detection of all amino acids - is imperative for proteoform discovery and quantitative comparison. Here, we utilized six ENCODE cell lines, six proteases, and three tandem mass spectrometry (MS/MS) fragmentation methods to collect 2,491 raw MS data files. From these data we identified 17,717 protein groups with a median sequence coverage of 79.2%, confirming over eight million unique human amino acid residues. We compare our proteomics data with transcriptomics data and demonstrate how such deep proteome coverage can enable detection of over 7,000 proteoforms including 70.9 to 90.6% of all non-synonymous mutations and over 5,000 alternative splicing event junctions. Our dataset represents a valuable resource as the largest human proteome with the highest sequence coverage ever reported.

INSTRUMENT(S): Orbitrap Fusion Lumos, Orbitrap Fusion

ORGANISM(S): Homo Sapiens (ncbitaxon:9606)

SUBMITTER: Joshua Coon Juergen Cox

PROVIDER: MSV000086944 | MassIVE | Wed Feb 24 09:31:00 GMT 2021

SECONDARY ACCESSION(S): PXD024364

REPOSITORIES: MassIVE

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Translational control through differential ribosome pausing during amino acid limitation in mammalian cells

Project description:Limitation for amino acids is thought to regulate translation in mammalian cells primarily by signaling through the kinases mTORC1 and GCN2. We find that limitation for the amino acid arginine causes a selective loss of tRNA charging, which regulates translation through ribosome pausing at two of six arginine codons. Interestingly, limitation for leucine, an essential and abundant amino acid in protein, results in little or no ribosome pausing. Chemical and genetic perturbation of mTORC1 and GCN2 signaling revealed that their robust response to leucine limitation prevents ribosome pausing, while an insufficient response to arginine limitation led to loss of arginine tRNA charging and ribosome pausing. Codon-specific ribosome pausing decreased protein production and triggered premature ribosome termination without significantly reducing mRNA levels. Together, our results suggest that amino acids which are not optimally sensed by the mTORC1 and GCN2 pathways still regulate translation through an evolutionarily conserved mechanism based on synonymous codon usage.

2018-07-19 | GSE113751 | GEO

Amino acid misincorporation promotes mRNA instability

Project description:Messenger RNA (mRNA) translation can lead to higher rates of mRNA decay, suggesting a role for the ribosome in mRNA destruction. Furthermore, features of an mRNA, such as codon identities, that are directly probed by the ribosome also correlate with mRNA decay rates. Specifically, many amino acids are encoded by synonymous codons, and some synonymous codons are decoded by more abundant tRNAs leading to more optimal translation and increased mRNA stability. In addition to different translation rates, the presence of individual codons can lead to higher or lower rates of amino acid misincorporation which could potentially lead to protein misfolding if an individual amino acid makes many critical contacts in a structure. Here, we directly test whether amino acid misincorporation affects mRNA stability, taking advantage of an aminoglycoside antibiotic (G418) which promotes higher error rates in the ribosome. We observe that G418 decreases firefly luciferase mRNA stability in an in vitro system, and we similarly observe that G418 reduces mRNA stability in mouse embryonic stem cells (mESCs). G418-sensitive mRNAs are enriched for suboptimal hydrophobic amino acid codons as well as other codons that are known to result in higher rates of amino acid misincorporation. Since protein folding is highly sensitive to the identity of hydrophobic amino acids, these results strongly suggest that defects in protein folding are linked to mRNA decay.

2022-03-01 | GSE184874 | GEO

UNC Hepatocellular Carcinoma Study by Exome Sequencing (HCCSES)

Project description:<p>Genetic alterations in specific driver genes lead to disruption of cellular pathways and are critical events in the instigation and progression of hepatocellular carcinoma. As a prerequisite for individualized cancer treatment, we sought to characterize the landscape of recurrent somatic mutations in hepatocellular carcinoma. We performed whole exome sequencing on 87 hepatocellular carcinomas and matched normal adjacent tissues to an average coverage of 59x. The overall mutation rate was roughly 2 mutations per Mb, with a median of 45 non-synonymous mutations that altered the amino acid sequence (range 2 to 381). We found recurrent mutations in several genes with high transcript levels: TP53 (18%), CTNNB1 (10%), KEAP1 (8%), C16orf62 (8%), MLL4 (7%) and RAC2 (5%). Significantly affected gene families include the nucleotide-binding domain and leucine rich repeat containing family, calcium channel subunits, and histone methyltransferases. In particular, the MLL family methyltransferases for histone H3 lysine 4 were mutated in 20% of tumors. Conclusion: The NFE2L2-KEAP1 and MLL pathways are recurrently mutated in multiple cohorts of hepatocellular carcinoma.</p>

| phs000627 | dbGaP

An internal nutrient sensor detects specific dietary amino acids and promotes food consumption in Drosophila

Project description:Adequate protein intake is crucial for animals. Despite the recent progress in understanding protein hunger and satiety in the fruit fly Drosophila melanogaster, how fruit flies assess prospective dietary protein sources and ensure protein consumption remains elusive. We show here that three specific amino acids, L-glutamate (L-Glu), L-alanine (L-Ala), and L-aspartate (L-Asp), but not the D-enantiomers, rapidly promote food consumption in fruit flies when present in food. The effect of dietary amino acids to promote food consumption is independent of mating experience and internal nutritional status. Calcium imaging experiments show that six brain neurons expressing diuretic hormone 44 (DH44) can be rapidly and directly activated by these three amino acids during feeding. Genetic analysis shows that DH44+ neurons are both necessary and sufficient for dietary amino acids to promote food consumption. By conducting single cell RNAseq analysis, we also identify a amino acid transporter, CG13248, which is highly expressed in DH44+ neurons and is required for dietary amino acids to promote food consumption. Therefore, these data suggest that dietary amino acids may enter DH44+ neurons via CG13248 and modulate their activity and hence food consumption. Taken together, these data identify an internal amino acid sensor in the fly brain that evaluate food sources post-ingestively and facilitates adequate protein intake. These results shed critical light on the regulation of protein homeostasis at organismal levels by the nervous system.

2018-05-04 | GSE113990 | GEO

Alanine scan of 11 amino acid long novel linear epitope from C.jejuni cj0669 protein [2]

Project description:A short sequence of 11 amino acids belonging to the cj0669 protein from Campylobacter jejuni NCTC 11168 which was previously identified as potentially immunogenig was analyzed via alanine scanning to narrow down the significant amino acid residues within the sequence.

2013-05-06 | GSE46651 | GEO

Alanine scan of 11 amino acid long novel linear epitope from C. jejuni cj0669 protein

Project description:A short sequence of 11 amino acids belonging to the cj0669 protein from Campylobacter jejuni NCTC 11168, which was previously identified as potentially immunogenic, was analyzed via alanine scanning to narrow down the significant amino acid residues within the sequence.

2013-04-06 | GSE45556 | GEO

Multi-protease approach for the improved identification and molecular characterization of small proteins and short open reading frame-encoded peptides

Project description:The identification of proteins below 70 amino acids in bottom-up proteomics is still a challenging task due to the limited number of peptides generated by proteolytic digestion. This includes the short open reading frame-encoded peptides (SEP), which are a subset of the small proteins that were not previously annotated or that are alternatively encoded. Here, we systematically investigated the use of multiple proteases (trypsin, chymotrypsin, LysC, LysArgiNase and GluC) in GeLC-MS/MS analysis to improve the sequence coverage and the number of identified peptides for small proteins (<70 amino acids), with a focus on SEP, in the archaeon Methanosarcina mazei. Combining the data of all proteases, we identified 63 small proteins and additional 28 SEP with at least two unique peptides, while only 55 small proteins and 22 SEP could be identified using trypsin only. For 27 small proteins and 12 SEP, a 100 % sequence coverage could be achieved. Moreover, for five SEP, incorrectly predicted translation start points were identified, confirming the data of a previous top-down proteomics study of this organism. The results show clearly that a multi-protease approach can improve the identification and molecular characterization of small proteins and SEP.

2021-04-01 | PXD023921 | Pride

D-amino acids enrichment

Project description:D-amino acids enrichment Raw sequence reads

| PRJNA745484 | ENA

Codon usage bias is correlated with gene expression levels in the fission yeast Schizosaccharomyces pombe.

Project description:Usage of synonymous codons represents a characteristic pattern of preference in each organism. It has been inferred that such bias of codon usage has evolved as a result of adaptation for efficient synthesis of proteins. Here we examined synonymous codon usage in genes of the fission yeast Schizosaccharomyces pombe, and compared codon usage bias with expression levels of the gene. In this organism, synonymous codon usage bias was correlated with expression levels of the gene; the bias was most obvious in two-codon amino acids. A similar pattern of the codon usage bias was also observed in Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans, but was not obvious in Oryza sativa, Drosophila melanogaster, Takifugu rubripes and Homo sapiens. As codons of the highly expressed genes have greater influence on translational efficiency than codons of genes expressed at lower levels, it is likely that codon usage in the S. pombe genome has been optimized by translational selection through evolution. Relative amounts of mRNA for each ORF were measured by DNA microarray using genomic DNA as a reference, and the copy number of mRNA was calculated using an estimate of the total mRNA number in the cell as 100,000 copies.

2010-06-20 | E-GEOD-13554 | biostudies-arrayexpress

SLC25A45 is required for mitochondrial uptake of methylated basic amino acids and de novo carnitine biosynthesis

Project description:Methylated amino acids accumulate upon the degradation of methylated proteins and are implicated in diverse metabolic and signalling pathways. Consequently, disturbed methylated amino acid homeostasis is associated with cardiovascular disease and renal failure. Mitochondria are core processing hubs in conventional amino acid metabolism but how they interact with methylated amino acids is unclear. Here, we reveal that the orphan mitochondrial solute carrier SLC25A45 is required for the mitochondrial uptake of methylated amino acids. SLC25A45 binds with dimethylarginine and trimethyllysine but has no affinity for unmethylated arginine and lysine. A non-synonymous mutation of human SLC25A45 (R285C) stabilises the carrier by limiting its proteolytic degradation by the m-AAA protease and associates with altered methylated amino acids in human plasma. Metabolic tracing of trimethyllysine in cancer cells demonstrates that SLC25A45 drives the biosynthesis of the key metabolite carnitine. Furthermore, depletion of SLC25A45 limits the proliferation and survival of ovarian cancer cells upon glucose deprivation. SLC25A45 is therefore an essential mediator of compartmentalised methylated amino acid metabolism with diverse cellular roles.

2025-06-30 | MSV000098382 | MassIVE

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data