Dataset Information

Machine Learning Model Analysis and Data Visualization with Small Molecules Tested in a Mouse Model of Mycobacterium tuberculosis Infection (2014-2015).

ABSTRACT: The renewed urgency to develop new treatments for Mycobacterium tuberculosis (Mtb) infection has resulted in large-scale phenotypic screening and thousands of new active compounds in vitro. The next challenge is to identify candidates to pursue in a mouse in vivo efficacy model as a step to predicting clinical efficacy. We previously analyzed over 70 years of this mouse in vivo efficacy data, which we used to generate and validate machine learning models. Curation of 60 additional small molecules with in vivo data published in 2014 and 2015 was undertaken to further test these models. This represents a much larger test set than for the previous models. Several computational approaches have now been applied to analyze these molecules and compare their molecular properties beyond those attempted previously. Our previous machine learning models have been updated, and a novel aspect has been added in the form of mouse liver microsomal half-life (MLM t1/2) and in vitro-based Mtb models incorporating cytotoxicity data that were used to predict in vivo activity for comparison. Our best Mtb in vivo models possess fivefold ROC values > 0.7, sensitivity > 80%, and concordance > 60%, while the best specificity value is >40%. Use of an MLM t1/2 Bayesian model affords comparable results for scoring the 60 compounds tested. Combining MLM stability and in vitro Mtb models in a novel consensus workflow in the best cases has a positive predicted value (hit rate) > 77%. Our results indicate that Bayesian models constructed with literature in vivo Mtb data generated by different laboratories in various mouse models can have predictive value and may be used alongside MLM t1/2 and in vitro-based Mtb models to assist in selecting antitubercular compounds with desirable in vivo efficacy. We demonstrate for the first time that consensus models of any kind can be used to predict in vivo activity for Mtb. In addition, we describe a new clustering method for data visualization and apply this to the in vivo training and test data, ultimately making the method accessible in a mobile app.

SUBMITTER: Ekins S

PROVIDER: S-EPMC4962118 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine Learning Model Analysis and Data Visualization with Small Molecules Tested in a Mouse Model of Mycobacterium tuberculosis Infection (2014-2015).

Ekins Sean S Perryman Alexander L AL Clark Alex M AM Reynolds Robert C RC Freundlich Joel S JS

Journal of chemical information and modeling 20160701 7

The renewed urgency to develop new treatments for Mycobacterium tuberculosis (Mtb) infection has resulted in large-scale phenotypic screening and thousands of new active compounds in vitro. The next challenge is to identify candidates to pursue in a mouse in vivo efficacy model as a step to predicting clinical efficacy. We previously analyzed over 70 years of this mouse in vivo efficacy data, which we used to generate and validate machine learning models. Curation of 60 additional small molecule ...[more]

PMID: 27335215

Similar Datasets

Project description:Mycobacterium tuberculosis remains a significant threat to global health. Macrophages are the host cell for M. tuberculosis infection, and although bacteria are able to replicate intracellularly under certain conditions, it is also clear that macrophages are capable of killing M. tuberculosis if appropriately activated. The outcome of infection is determined at least in part by the host-pathogen interaction within the macrophage; however, we lack a complete understanding of which host pathways are critical for bacterial survival and replication. To add to our understanding of the molecular processes involved in intracellular infection, we performed a chemical screen using a high-content microscopic assay to identify small molecules that restrict mycobacterial growth in macrophages by targeting host functions and pathways. The identified host-targeted inhibitors restrict bacterial growth exclusively in the context of macrophage infection and predominantly fall into five categories: G-protein coupled receptor modulators, ion channel inhibitors, membrane transport proteins, anti-inflammatories, and kinase modulators. We found that fluoxetine, a selective serotonin reuptake inhibitor, enhances secretion of pro-inflammatory cytokine TNF-α and induces autophagy in infected macrophages, and gefitinib, an inhibitor of the Epidermal Growth Factor Receptor (EGFR), also activates autophagy and restricts growth. We demonstrate that during infection signaling through EGFR activates a p38 MAPK signaling pathway that prevents macrophages from effectively responding to infection. Inhibition of this pathway using gefitinib during in vivo infection reduces growth of M. tuberculosis in the lungs of infected mice. Our results support the concept that screening for inhibitors using intracellular models results in the identification of tool compounds for probing pathways during in vivo infection and may also result in the identification of new anti-tuberculosis agents that work by modulating host pathways. Given the existing experience with some of our identified compounds for other therapeutic indications, further clinically-directed study of these compounds is merited.

Project description:Interrupting transmission is an attractive anti-tuberculosis (TB) strategy but it remains underexplored owing to our poor understanding of the events surrounding transfer of Mycobacterium tuberculosis (Mtb) between hosts. Determining when live, infectious Mtb bacilli are released and by whom has proven especially challenging. Consequently, transmission chains are inferred only retrospectively, when new cases are diagnosed. This process, which relies on molecular analyses of Mtb isolates for epidemiological fingerprinting, is confounded by the prolonged infectious period of TB and the potential for transmission from transient exposures. We developed a Respiratory Aerosol Sampling Chamber (RASC) equipped with high-efficiency filtration and sampling technologies for liquid-capture of all particulate matter (including Mtb) released during respiration and non-induced cough. Combining the mycobacterial cell wall probe, DMN-trehalose, with fluorescence microscopy of RASC-captured bioaerosols, we detected and quantified putative live Mtb bacilli in bioaerosol samples arrayed in nanowell devices. The RASC enabled non-invasive capture and isolation of viable Mtb from bioaerosol within 24 hours of collection. A median 14 live Mtb bacilli (range 0-36) were isolated in single-cell format from 90% of confirmed TB patients following 60 minutes bioaerosol sampling. This represented a significant increase over previous estimates of transmission potential, implying that many more organisms might be released daily than commonly assumed. Moreover, variations in DMN-trehalose incorporation profiles suggested metabolic heterogeneity in aerosolized Mtb. Finally, preliminary analyses indicated the capacity for serial image capture and analysis of nanowell-arrayed bacilli for periods extending into weeks. These observations support the application of this technology to longstanding questions in TB transmission including the propensity for asymptomatic transmission, the impact of TB treatment on Mtb bioaerosol release, and the physiological state of aerosolized bacilli.

Dataset Information

Machine Learning Model Analysis and Data Visualization with Small Molecules Tested in a Mouse Model of Mycobacterium tuberculosis Infection (2014-2015).

Publications

Machine Learning Model Analysis and Data Visualization with Small Molecules Tested in a Mouse Model of Mycobacterium tuberculosis Infection (2014-2015).

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets