Unknown

Dataset Information

0

A Zero-inflated Beta-binomial Model for Microbiome Data Analysis.


ABSTRACT: The microbiome is increasingly recognized as an important aspect of the health of host species, involved in many biological pathways and processes and potentially useful as health biomarkers. Taking advantage of high-throughput sequencing technologies, modern bacterial microbiome studies are metagenomic, interrogating thousands of taxa simultaneously. Several data analysis frameworks have been proposed for microbiome sequence read count data and determining the most significant features. However, there is still room for improvement. We introduce a zero-inflated beta-binomial (ZIBB) to model the distribution of microbiome count data and to determine association with a continuous or categorical phenotype of interest. The approach can exploit mean-variance relationships to improve power and adjust for covariates. The proposed method is a mixture model with two components: (i) a zero model accounting for excess zeros and (ii) a count model to capture the remaining component by beta-binomial regression, allowing for overdispersion effects. Simulation studies show that our proposed method effectively controls type I error and has higher power than competing methods to detect taxa associated with phenotype. An R package ZIBBSeqDiscovery is available on R CRAN.

SUBMITTER: Hu T 

PROVIDER: S-EPMC6124506 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Zero-inflated Beta-binomial Model for Microbiome Data Analysis.

Hu Tao T   Gallins Paul P   Zhou Yi-Hui YH  

Stat (International Statistical Institute) 20180619 1


The microbiome is increasingly recognized as an important aspect of the health of host species, involved in many biological pathways and processes and potentially useful as health biomarkers. Taking advantage of high-throughput sequencing technologies, modern bacterial microbiome studies are metagenomic, interrogating thousands of taxa simultaneously. Several data analysis frameworks have been proposed for microbiome sequence read count data and determining the most significant features. However  ...[more]

Similar Datasets

| S-EPMC8487663 | biostudies-literature
| S-EPMC7410344 | biostudies-literature
| S-EPMC4692373 | biostudies-literature
| S-EPMC6873424 | biostudies-literature
| S-EPMC7768662 | biostudies-literature
| S-EPMC9011970 | biostudies-literature
| S-EPMC10500952 | biostudies-literature
| S-EPMC6109378 | biostudies-literature
| S-EPMC8892984 | biostudies-literature
| S-EPMC7652264 | biostudies-literature