Unknown

Dataset Information

0

A Multilevel Bayesian Approach to Improve Effect Size Estimation in Regression Modeling of Metabolomics Data Utilizing Imputation with Uncertainty.


ABSTRACT: To ensure scientific reproducibility of metabolomics data, alternative statistical methods are needed. A paradigm shift away from the p-value toward an embracement of uncertainty and interval estimation of a metabolite's true effect size may lead to improved study design and greater reproducibility. Multilevel Bayesian models are one approach that offer the added opportunity of incorporating imputed value uncertainty when missing data are present. We designed simulations of metabolomics data to compare multilevel Bayesian models to standard logistic regression with corrections for multiple hypothesis testing. Our simulations altered the sample size and the fraction of significant metabolites truly different between two outcome groups. We then introduced missingness to further assess model performance. Across simulations, the multilevel Bayesian approach more accurately estimated the effect size of metabolites that were significantly different between groups. Bayesian models also had greater power and mitigated the false discovery rate. In the presence of increased missing data, Bayesian models were able to accurately impute the true concentration and incorporating the uncertainty of these estimates improved overall prediction. In summary, our simulations demonstrate that a multilevel Bayesian approach accurately quantifies the estimated effect size of metabolite predictors in regression modeling, particularly in the presence of missing data.

SUBMITTER: Gillies CE 

PROVIDER: S-EPMC7465156 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Multilevel Bayesian Approach to Improve Effect Size Estimation in Regression Modeling of Metabolomics Data Utilizing Imputation with Uncertainty.

Gillies Christopher E CE   Jennaro Theodore S TS   Puskarich Michael A MA   Sharma Ruchi R   Ward Kevin R KR   Fan Xudong X   Jones Alan E AE   Stringer Kathleen A KA  

Metabolites 20200806 8


To ensure scientific reproducibility of metabolomics data, alternative statistical methods are needed. A paradigm shift away from the <i>p</i>-value toward an embracement of uncertainty and interval estimation of a metabolite's true effect size may lead to improved study design and greater reproducibility. Multilevel Bayesian models are one approach that offer the added opportunity of incorporating imputed value uncertainty when missing data are present. We designed simulations of metabolomics d  ...[more]

Similar Datasets

| S-EPMC3024253 | biostudies-literature
| S-EPMC4575250 | biostudies-literature
| S-EPMC6874355 | biostudies-literature
| S-EPMC7259793 | biostudies-literature
| S-EPMC7891623 | biostudies-literature
| S-EPMC6028324 | biostudies-literature
| S-EPMC4373540 | biostudies-literature
| S-EPMC4076806 | biostudies-other
| S-EPMC6343124 | biostudies-literature
| S-EPMC9314905 | biostudies-literature