Unknown

Dataset Information

0

HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity.


ABSTRACT: The human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct microbial networks from the microbiome sequencing count data. As microbiome count data are high-dimensional and suffer from uneven sampling depth, over-dispersion, and zero-inflation, these characteristics can bias the network estimation and require specialized analytical tools. Here we propose a general framework, HARMONIES, Hybrid Approach foR MicrobiOme Network Inferences via Exploiting Sparsity, to infer a sparse microbiome network. HARMONIES first utilizes a zero-inflated negative binomial (ZINB) distribution to model the skewness and excess zeros in the microbiome data, as well as incorporates a stochastic process prior for sample-wise normalization. This approach infers a sparse and stable network by imposing non-trivial regularizations based on the Gaussian graphical model. In comprehensive simulation studies, HARMONIES outperformed four other commonly used methods. When using published microbiome data from a colorectal cancer study, it discovered a novel community with disease-enriched bacteria. In summary, HARMONIES is a novel and useful statistical framework for microbiome network inference, and it is available at https://github.com/shuangj00/HARMONIES.

SUBMITTER: Jiang S 

PROVIDER: S-EPMC7283552 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity.

Jiang Shuang S   Xiao Guanghua G   Koh Andrew Y AY   Chen Yingfei Y   Yao Bo B   Li Qiwei Q   Zhan Xiaowei X  

Frontiers in genetics 20200603


The human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct microbial networks from the microbiome sequencing count data. As microbiome count data are high-dimensional and suffer from uneven sampling depth, over-dispersion, and zero-inflation, these characteristi  ...[more]

Similar Datasets

| S-EPMC7736637 | biostudies-literature
| S-EPMC3662697 | biostudies-other
| S-EPMC7426627 | biostudies-literature
| S-EPMC4827608 | biostudies-literature
| S-EPMC2781012 | biostudies-literature
| S-EPMC4868975 | biostudies-literature
| PRJEB48705 | ENA
| PRJEB48706 | ENA
| S-EPMC2258286 | biostudies-literature
| S-EPMC2848617 | biostudies-literature