Unknown

Dataset Information

0

GePMI: A statistical model for personal intestinal microbiome identification.


ABSTRACT: Human gut microbiomes consist of a large number of microbial genomes, which vary by diet and health conditions and from individual to individual. In the present work, we asked whether such variation or similarity could be measured and, if so, whether the results could be used for personal microbiome identification (PMI). To address this question, we herein propose a method to estimate the significance of similarity among human gut metagenomic samples based on reference-free, long k-mer features. Using these features, we find that pairwise similarities between the metagenomes of any two individuals obey a beta distribution and that a p value derived accordingly well characterizes whether two samples are from the same individual or not. We develop a computational framework called GePMI (Generating inter-individual similarity distribution for Personal Microbiome Identification) and apply it to several human gut metagenomic datasets (>300 individuals and >600 samples in total). From the results of GePMI, most of the human gut microbiomes can be identified (auROC?=?0.9470, auPRC?=?0.8702). Even after antibiotic treatment or fecal microbiota transplantation, the individual k-mer signature still maintains a certain specificity.

SUBMITTER: Wang Z 

PROVIDER: S-EPMC6123480 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

GePMI: A statistical model for personal intestinal microbiome identification.

Wang Zicheng Z   Lou Huazhe H   Wang Ying Y   Shamir Ron R   Jiang Rui R   Chen Ting T  

NPJ biofilms and microbiomes 20180904


Human gut microbiomes consist of a large number of microbial genomes, which vary by diet and health conditions and from individual to individual. In the present work, we asked whether such variation or similarity could be measured and, if so, whether the results could be used for personal microbiome identification (PMI). To address this question, we herein propose a method to estimate the significance of similarity among human gut metagenomic samples based on reference-free, long <i>k</i>-mer fe  ...[more]

Similar Datasets

| S-EPMC6059399 | biostudies-literature
| S-EPMC7857535 | biostudies-literature
| S-EPMC6959583 | biostudies-literature
| S-EPMC4629800 | biostudies-literature
| S-EPMC9869231 | biostudies-literature
| S-EPMC5532649 | biostudies-other
| S-EPMC10249256 | biostudies-literature
| S-EPMC5612788 | biostudies-other
| S-EPMC9294433 | biostudies-literature
| S-EPMC4148184 | biostudies-literature