Unknown

Dataset Information

0

Pet-Human Gut Microbiome Host Classifier Using Data from Different Studies.


ABSTRACT: (1) Background: microbiome host classification can be used to identify sources of contamination in environmental data. However, there is no ready-to-use host classifier. Here, we aimed to build a model that would be able to discriminate between pet and human microbiomes samples. The challenge of the study was to build a classifier using data solely from publicly available studies that normally contain sequencing data for only one type of host. (2) Results: we have developed a random forest model that distinguishes human microbiota from domestic pet microbiota (cats and dogs) with 97% accuracy. In order to prevent overfitting, samples from several (at least four) different projects were necessary. Feature importance analysis revealed that the model relied on several taxa known to be key components in domestic cat and dog microbiomes (such as Fusobacteriaceae and Peptostreptococcaeae), as well as on some taxa exclusively found in humans (as Akkermansiaceae). (3) Conclusion: we have shown that it is possible to make a reliable pet/human gut microbiome classifier on the basis of the data collected from different studies.

SUBMITTER: Bykova N 

PROVIDER: S-EPMC7602744 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pet-Human Gut Microbiome Host Classifier Using Data from Different Studies.

Bykova Nadia N   Litovka Nikita N   Popenko Anna A   Musienko Sergey S  

Microorganisms 20201015 10


(1) Background: microbiome host classification can be used to identify sources of contamination in environmental data. However, there is no ready-to-use host classifier. Here, we aimed to build a model that would be able to discriminate between pet and human microbiomes samples. The challenge of the study was to build a classifier using data solely from publicly available studies that normally contain sequencing data for only one type of host. (2) Results: we have developed a random forest model  ...[more]

Similar Datasets

| S-EPMC6522487 | biostudies-other
| S-EPMC6776654 | biostudies-literature
| S-EPMC7909775 | biostudies-literature
| S-EPMC9685977 | biostudies-literature
| S-EPMC7677204 | biostudies-literature
2022-07-29 | PXD024815 | Pride
| S-EPMC8515199 | biostudies-literature
| S-EPMC11307328 | biostudies-literature
| S-EPMC9087276 | biostudies-literature
| S-EPMC6563668 | biostudies-literature