Using a gene expression biomarker to identify DNA damage-inducing agents in microarray profiles.
Ontology highlight
ABSTRACT: High-throughput transcriptomic technologies are increasingly being used to screen environmental chemicals in vitro to provide mechanistic context for regulatory testing. The TGx-DDI biomarker is a 64-gene expression profile generated from testing 28 model chemicals or treatments (13 that cause DNA damage and 15 that do not) in human TK6 cells. While the biomarker is very accurate at predicting DNA damage inducing (DDI) potential using the nearest shrunken centroid method, the broad utility of the biomarker using other computational methods is not fully known. Here, we determined the accuracy of the biomarker used with the Running Fisher test, a nonparametric correlation test. In TK6 cells, the methods could readily differentiate DDI and non-DDI compounds with balanced accuracies of 87-97%, depending on the threshold for determining DDI positives. The methods identified DDI agents in the metabolically competent hepatocyte cell line HepaRG (accuracy = 90%) but not in HepG2 cells or hepatocytes derived from embryonic stem cells (60 and 80%, respectively). DDI was also accurately classified when the gene expression changes were derived using the nCounter technology (accuracy = 89%). In addition, we found: (1) not all genes contributed equally to the correlations; (2) the minimal overlap in genes between the biomarker and the individual comparisons required for significant positive correlation was 10 genes, but usually was much higher; and (3) different sets of genes in the biomarker can by themselves contribute to the significant correlations. Overall, these results demonstrate the utility of the biomarker to accurately classify DDI agents. Environ. Mol. Mutagen. 59:772-784, 2018. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.
SUBMITTER: Corton JC
PROVIDER: S-EPMC7875442 | biostudies-literature | 2018 Dec
REPOSITORIES: biostudies-literature
ACCESS DATA