Dataset Information

Benchmarking Sepsis Gene Expression Diagnostics Using Public Data.

ABSTRACT: OBJECTIVE:In response to a need for better sepsis diagnostics, several new gene expression classifiers have been recently published, including the 11-gene "Sepsis MetaScore," the "FAIM3-to-PLAC8" ratio, and the Septicyte Lab. We performed a systematic search for publicly available gene expression data in sepsis and tested each gene expression classifier in all included datasets. We also created a public repository of sepsis gene expression data to encourage their future reuse. DATA SOURCES:We searched National Institutes of Health Gene Expression Omnibus and EBI ArrayExpress for human gene expression microarray datasets. We also included the Glue Grant trauma gene expression cohorts. STUDY SELECTION:We selected clinical, time-matched, whole blood studies of sepsis and acute infections as compared to healthy and/or noninfectious inflammation patients. We identified 39 datasets composed of 3,241 samples from 2,604 patients. DATA EXTRACTION:All data were renormalized from raw data, when available, using consistent methods. DATA SYNTHESIS:Mean validation areas under the receiver operating characteristic curve for discriminating septic patients from patients with noninfectious inflammation for the Sepsis MetaScore, the FAIM3-to-PLAC8 ratio, and the Septicyte Lab were 0.82 (range, 0.73-0.89), 0.78 (range, 0.49-0.96), and 0.73 (range, 0.44-0.90), respectively. Paired-sample t tests of validation datasets showed no significant differences in area under the receiver operating characteristic curves. Mean validation area under the receiver operating characteristic curves for discriminating infected patients from healthy controls for the Sepsis MetaScore, FAIM3-to-PLAC8 ratio, and Septicyte Lab were 0.97 (range, 0.85-1.0), 0.94 (range, 0.65-1.0), and 0.71 (range, 0.24-1.0), respectively. There were few significant differences in any diagnostics due to pathogen type. CONCLUSIONS:The three diagnostics do not show significant differences in overall ability to distinguish noninfectious systemic inflammatory response syndrome from sepsis, though the performance in some datasets was low (area under the receiver operating characteristic curve, < 0.7) for the FAIM3-to-PLAC8 ratio and Septicyte Lab. The Septicyte Lab also demonstrated significantly worse performance in discriminating infections as compared to healthy controls. Overall, public gene expression data are a useful tool for benchmarking gene expression diagnostics.

SUBMITTER: Sweeney TE

PROVIDER: S-EPMC5518756 | biostudies-literature | 2017 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Benchmarking Sepsis Gene Expression Diagnostics Using Public Data.

Sweeney Timothy E TE Khatri Purvesh P

Critical care medicine 20170101 1

<h4>Objective</h4>In response to a need for better sepsis diagnostics, several new gene expression classifiers have been recently published, including the 11-gene "Sepsis MetaScore," the "FAIM3-to-PLAC8" ratio, and the Septicyte Lab. We performed a systematic search for publicly available gene expression data in sepsis and tested each gene expression classifier in all included datasets. We also created a public repository of sepsis gene expression data to encourage their future reuse.<h4>Data so ...[more]

PMID: 27681387

Dataset Information

Benchmarking Sepsis Gene Expression Diagnostics Using Public Data.

Publications

Benchmarking Sepsis Gene Expression Diagnostics Using Public Data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Sepsis diagnostics using MinION
| PRJEB28340 | ENA

Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study.
| S-EPMC7520048 | biostudies-literature

Discovery and preclinical validation of drug indications using compendia of public gene expression data.
| S-EPMC3502016 | biostudies-literature

Comparison of machine-learning methodologies for accurate diagnosis of sepsis using microarray gene expression data.
| S-EPMC8128240 | biostudies-literature

Benchmarking RNA-seq differential expression analysis methods using spike-in and simulation data.
| S-EPMC7192453 | biostudies-literature

Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data.
| S-EPMC5431503 | biostudies-literature

Optimization of the medical diagnostics and therapy managing in colorectal tumors using F-18-Deoxyglucose (FDG) positron emission tomography (PET) and gene expression data (GenPET Colon-2008).
| 2428821 | ecrin-mdr-crc

Analytical Strategy to Prioritize Alzheimer's Disease Candidate Genes in Gene Regulatory Networks Using Public Expression Data.
| S-EPMC5611835 | biostudies-literature

Benchmarking ortholog identification methods using functional genomics data.
| S-EPMC1557999 | biostudies-literature

Replication Study: Discovery and preclinical validation of drug indications using compendia of public gene expression data.
| S-EPMC5245962 | biostudies-other