Dataset Information

A computational method for the identification of candidate drugs for non-small cell lung cancer.

ABSTRACT: Lung cancer causes a large number of deaths per year. Until now, a cure for this disease has not been found or developed. Finding an effective drug through traditional experimental methods invariably costs millions of dollars and takes several years. It is imperative that computational methods be developed to integrate several types of existing information to identify candidate drugs for further study, which could reduce the cost and time of development. In this study, we tried to advance this effort by proposing a computational method to identify candidate drugs for non-small cell lung cancer (NSCLC), a major type of lung cancer. The method used three steps: (1) preliminary screening, (2) screening compounds by an association test and a permutation test, (3) screening compounds using an EM clustering algorithm. In the first step, based on the chemical-chemical interaction information reported in STITCH, a well-known database that reports interactions between chemicals and proteins, and approved NSCLC drugs, compounds that can interact with at least one approved NSCLC drug were picked. In the second step, the association test selected compounds that can interact with at least one NSCLC-related chemical and at least one NSCLC-related gene, and subsequently, the permutation test was used to discard nonspecific compounds from the remaining compounds. In the final step, core compounds were selected using a powerful clustering algorithm, the EM algorithm. Six putative compounds, protoporphyrin IX, hematoporphyrin, canertinib, lapatinib, pelitinib, and dacomitinib, were identified by this method. Previously published data show that all of the selected compounds have been reported to possess anti-NSCLC activity, indicating high probabilities of these compounds being novel candidate drugs for NSCLC.

SUBMITTER: Chen L

PROVIDER: S-EPMC5562320 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A computational method for the identification of candidate drugs for non-small cell lung cancer.

Chen Lei L Lu Jing J Huang Tao T Cai Yu-Dong YD

PloS one 20170818 8

Lung cancer causes a large number of deaths per year. Until now, a cure for this disease has not been found or developed. Finding an effective drug through traditional experimental methods invariably costs millions of dollars and takes several years. It is imperative that computational methods be developed to integrate several types of existing information to identify candidate drugs for further study, which could reduce the cost and time of development. In this study, we tried to advance this e ...[more]

PMID: 28820893

Similar Datasets

Project description:Background: Non-small-cell lung cancer (NSCLC) remains the leading cause of cancer morbidity and mortality worldwide. In the present study, we identified novel biomarkers associated with the pathogenesis of NSCLC aiming to provide new diagnostic and therapeutic approaches for NSCLC. Methods: The microarray datasets of GSE18842, GSE30219, GSE31210, GSE32863 and GSE40791 from Gene Expression Omnibus database were downloaded. The differential expressed genes (DEGs) between NSCLC and normal samples were identified by limma package. The construction of protein-protein interaction (PPI) network, module analysis and enrichment analysis were performed using bioinformatics tools. The expression and prognostic values of hub genes were validated by GEPIA database and real-time quantitative PCR. Based on these DEGs, the candidate small molecules for NSCLC were identified by the CMap database. Results: A total of 408 overlapping DEGs including 109 up-regulated and 296 down-regulated genes were identified; 300 nodes and 1283 interactions were obtained from the PPI network. The most significant biological process and pathway enrichment of DEGs were response to wounding and cell adhesion molecules, respectively. Six DEGs (PTTG1, TYMS, ECT2, COL1A1, SPP1 and CDCA5) which significantly up-regulated in NSCLC tissues, were selected as hub genes according to the results of module analysis. The GEPIA database further confirmed that patients with higher expression levels of these hub genes experienced a shorter overall survival. Additionally, CMap predicted the 20 most significant small molecules as potential therapeutic drugs for NSCLC. DL-thiorphan was the most promising small molecule to reverse the NSCLC gene expression. Conclusions: Based on the gene expression profiles of 696 NSCLC samples and 237 normal samples, we first revealed that PTTG1, TYMS, ECT2, COL1A1, SPP1 and CDCA5 could act as the promising novel diagnostic and therapeutic targets for NSCLC. Our work will contribute to clarifying the molecular mechanisms of NSCLC initiation and progression.

Project description:BackgroundNon-small cell lung cancer (NSCLC) is the most prevalent malignant tumor of the lung cancer, for which the molecular mechanisms remain unknown. In this study, we identified novel biomarkers associated with the pathogenesis of NSCLC aiming to provide new diagnostic and therapeutic approaches for NSCLC by bioinformatics analysis.MethodsFrom the Gene Expression Omnibus database, GSE118370 and GSE10072 microarray datasets were obtained. Identifying the differentially expressed genes (DEGs) between lung adenocarcinoma and normal samples was done. By using bioinformatics tools, a protein-protein interaction (PPI) network was constructed, modules were analyzed, and enrichment analyses were performed. The expression and prognostic values of 14 hub genes were validated by the GEPIA database, and the correlation between hub genes and survival in lung adenocarcinoma was assessed by UALCAN, cBioPortal, String and Cytoscape, and Timer tools.ResultsWe found three genes (PIK3R1, SPP1, and PECAM1) that have a clear correlation with OS in the lung adenocarcinoma patient. It has been found that lung adenocarcinoma exhibits high expression of SPP1 and that this has been associated with poor prognosis, while low expression of PECAM1 and PIK3R1 is associated with poor prognosis (P < 0.05). We also found that the expression of SPP1 was associated with miR-146a-5p, while the high expression of miR-146a-5p was related to good prognosis (P < 0.05). On the contrary, the lower miR-21-5p on upstream of PIK3R1 is associated with a higher surviving rate in cancer patients (P < 0.05). Finally, we found that the immune checkpoint genes CD274(PD-L1) and PDCD1LG2(PD-1) were also related to SPP1 in lung adenocarcinoma.ConclusionsThe results indicated that SPP1 is a cancer promoter (oncogene), while PECAM1 and PIK3R1 are cancer suppressor genes. These genes take part in the regulation of biological activities in lung adenocarcinoma, which provides a basis for improving detection and immunotherapeutic targets for lung adenocarcinoma.

Dataset Information

A computational method for the identification of candidate drugs for non-small cell lung cancer.

Publications

A computational method for the identification of candidate drugs for non-small cell lung cancer.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets