Dataset Information

Concise Polygenic Models for Cancer-Specific Identification of Drug-Sensitive Tumors from Their Multi-Omics Profiles.

ABSTRACT: In silico models to predict which tumors will respond to a given drug are necessary for Precision Oncology. However, predictive models are only available for a handful of cases (each case being a given drug acting on tumors of a specific cancer type). A way to generate predictive models for the remaining cases is with suitable machine learning algorithms that are yet to be applied to existing in vitro pharmacogenomics datasets. Here, we apply XGBoost integrated with a stringent feature selection approach, which is an algorithm that is advantageous for these high-dimensional problems. Thus, we identified and validated 118 predictive models for 62 drugs across five cancer types by exploiting four molecular profiles (sequence mutations, copy-number alterations, gene expression, and DNA methylation). Predictive models were found in each cancer type and with every molecular profile. On average, no omics profile or cancer type obtained models with higher predictive accuracy than the rest. However, within a given cancer type, some molecular profiles were overrepresented among predictive models. For instance, CNA profiles were predictive in breast invasive carcinoma (BRCA) cell lines, but not in small cell lung cancer (SCLC) cell lines where gene expression (GEX) and DNA methylation profiles were the most predictive. Lastly, we identified the best XGBoost model per cancer type and analyzed their selected features. For each model, some of the genes in the selected list had already been found to be individually linked to the response to that drug, providing additional evidence of the usefulness of these models and the merits of the feature selection scheme.

SUBMITTER: Naulaerts S

PROVIDER: S-EPMC7356608 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Concise Polygenic Models for Cancer-Specific Identification of Drug-Sensitive Tumors from Their Multi-Omics Profiles.

Naulaerts Stefan S Menden Michael P MP Ballester Pedro J PJ

Biomolecules 20200626 6

In silico models to predict which tumors will respond to a given drug are necessary for Precision Oncology. However, predictive models are only available for a handful of cases (each case being a given drug acting on tumors of a specific cancer type). A way to generate predictive models for the remaining cases is with suitable machine learning algorithms that are yet to be applied to existing in vitro pharmacogenomics datasets. Here, we apply XGBoost integrated with a stringent feature selection ...[more]

PMID: 32604779

Similar Datasets

Project description:Comprehensive studies on cancer patients with different smoking histories, including non-smokers, former smokers, and current smokers, remain elusive. Therefore, we conducted a multi-omics analysis to explore the effect of smoking history on cancer patients. Patients with smoking history were screened from The Cancer Genome Atlas database, and their multi-omics data and clinical information were downloaded. A total of 2,317 patients were included in this study, whereby current smokers presented the worst prognosis, followed by former smokers, while non-smokers showed the best prognosis. More importantly, smoking history was an independent prognosis factor. Patients with different smoking histories exhibited different immune content, and former smokers had the highest immune cells and tumor immune microenvironment. Smokers are under a higher incidence of genomic instability that can be reversed following smoking cessation in some changes. We also noted that smoking reduced the sensitivity of patients to chemotherapeutic drugs, whereas smoking cessation can reverse the situation. Competing endogenous RNA network revealed that mir-193b-3p, mir-301b, mir-205-5p, mir-132-3p, mir-212-3p, mir-1271-5p, and mir-137 may contribute significantly in tobacco-mediated tumor formation. We identified 11 methylation driver genes (including EIF5A2, GBP6, HGD, HS6ST1, ITGA5, NR2F2, PLS1, PPP1R18, PTHLH, SLC6A15, and YEATS2), and methylation modifications of some of these genes have not been reported to be associated with tumors. We constructed a 46-gene model that predicted overall survival with good predictive power. We next drew nomograms of each cancer type. Interestingly, calibration diagrams and concordance indexes are verified that the nomograms were highly accurate for the prognosis of patients. Meanwhile, we found that the 46-gene model has good applicability to the overall survival as well as to disease-specific survival and progression-free intervals. The results of this research provide new and valuable insights for the diagnosis, treatment, and follow-up of cancer patients with different smoking histories.

Dataset Information

Concise Polygenic Models for Cancer-Specific Identification of Drug-Sensitive Tumors from Their Multi-Omics Profiles.

Publications

Concise Polygenic Models for Cancer-Specific Identification of Drug-Sensitive Tumors from Their Multi-Omics Profiles.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets