Proteomics

Dataset Information

0

A multiscale functional map of somatic mutations in cancer integrating protein structure and network topology


ABSTRACT: A major goal of cancer biology is to understand the mechanisms underlying tumorigenesis driven by somatically acquired mutations. Two distinct types of computational methodologies have emerged: one focuses on analyzing clustering of mutations within protein sequences and 3D structures, while the other characterizes mutations by leveraging the topology of protein-protein interaction network. Their insights are largely non-overlapping, offering complementary strengths. Here, we established a unified, end-to-end 3D structurally-informed protein interaction network propagation framework, NetFlow3D, that systematically maps the multiscale mechanistic effects of somatic mutations in cancer. The establishment of NetFlow3D hinges upon the Human Protein Structurome, a comprehensive repository we compiled that incorporates the 3D structures of every single protein as well as the binding interfaces of all known protein interactions in humans. NetFlow3D leverages the Structurome to integrate information across atomic, residue, protein and network levels: It conducts 3D clustering of mutations across atomic and residue levels on protein structures to identify potential driver mutations. It then anisotropically propagates their impacts across the protein interaction network, with propagation guided by the specific 3D structural interfaces involved, to identify significantly interconnected network "modules", thereby uncovering key biological processes underlying disease etiology. Applied to 1,038,899 somatic protein-altering mutations in 9,946 TCGA tumors across 33 cancer types, NetFlow3D identified 12,378 significant 3D clusters throughout the Human Protein Structurome, of which ~54% would not have been found if using only experimentally-determined structures. It then identified 28 significantly interconnected modules that encompass ~8-fold more proteins than applying standard network analyses.

INSTRUMENT(S): Orbitrap Fusion Lumos

ORGANISM(S): Human 293t Cells

SUBMITTER: Haiyuan Yu  

PROVIDER: MSV000094298 | MassIVE | Tue Mar 12 15:26:00 GMT 2024

SECONDARY ACCESSION(S): PXD050561

REPOSITORIES: MassIVE

Similar Datasets

| PRJNA172039 | ENA
| PRJNA172040 | ENA
2023-01-25 | MSV000091153 | MassIVE
2015-03-18 | GSE64168 | GEO
2015-03-18 | GSE60034 | GEO
2024-06-18 | PXD053208 | Pride
2022-12-27 | GSE202022 | GEO
2022-12-27 | GSE202021 | GEO
2022-12-27 | GSE202020 | GEO
| 2749335 | ecrin-mdr-crc