Unknown

Dataset Information

0

Proteome-wide copy-number estimation from transcriptomics


ABSTRACT: Protein copy numbers constrain systems-level properties of regulatory networks, but proportional proteomic data remain scarce compared to RNA-seq. We related mRNA to protein statistically using best-available data from quantitative proteomics-transcriptomics for 4366 genes in 369 cell lines. The approach starts with a protein's median copy number and hierarchically appends mRNA-protein and mRNA-mRNA dependencies to define an optimal gene-specific model linking mRNAs to protein. For dozens of cell lines and primary samples, these protein inferences from mRNA outmatch stringent null models, a count-based protein-abundance repository, empirical protein-to-mRNA ratios, and a proteogenomic DREAM challenge winner. The optimal mRNA-to-protein relationships capture biological processes along with hundreds of known protein-protein complexes, suggesting mechanistic relationships. We use the method to identify a viral-receptor abundance threshold for coxsackievirus B3 susceptibility from 1489 systems-biology infection models parameterized by protein inference. When applied to 796 RNA-seq profiles of breast cancer, inferred copy-number estimates collectively reclassify 26-29% of luminal tumors. By adopting a gene-centered perspective of mRNA-protein covariation across different biological contexts, we achieve accuracies comparable to the technical reproducibility of contemporary proteomics.

SUBMITTER: Dr Andrew, J. Sweatt 

PROVIDER: S-SCDT-10_1038-S44320-024-00064-3 | biostudies-other |

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC10369941 | biostudies-literature
| S-EPMC9431437 | biostudies-literature
| S-EPMC6994099 | biostudies-literature
| S-EPMC5112954 | biostudies-literature
| S-EPMC3526607 | biostudies-literature
| S-EPMC2852203 | biostudies-literature
2016-02-01 | GSE71470 | GEO
| S-EPMC2885374 | biostudies-literature
| S-EPMC10697815 | biostudies-literature
2022-10-13 | PXD019959 | Pride