Unknown

Dataset Information

0

Capturing the most wanted taxa through cross-sample correlations.


ABSTRACT: The Human Microbiome Project (HMP) identified the 16S rRNA gene sequences of 'most wanted' taxa-prevalent in the healthy human microbiota but distant from previously known sequences. Since 2012, few of the corresponding genomes have been isolated and sequenced, and only through advanced isolation techniques. We demonstrate that the genomes of the most wanted taxa can be identified computationally through their correlation in abundance across multiple public metagenomic data sets. We link over 200 most wanted sequences with nearly complete genome sequences, including half of the taxa identified as high-priority targets by the HMP. The genomes we identify have strong similarity to genomes reconstructed through expensive isolation techniques, and provide a more complete functional characterization of these organisms than can be extrapolated from their 16S rRNA gene. We also provide insights into the function of organisms for which 16S rRNA gene signatures were recently reported to be associated with health and host genetic factors.

SUBMITTER: Almeida M 

PROVIDER: S-EPMC5030688 | biostudies-literature | 2016 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Capturing the most wanted taxa through cross-sample correlations.

Almeida Mathieu M   Pop Mihai M   Le Chatelier Emmanuelle E   Prifti Edi E   Pons Nicolas N   Ghozlane Amine A   Ehrlich S Dusko SD  

The ISME journal 20160304 10


The Human Microbiome Project (HMP) identified the 16S rRNA gene sequences of 'most wanted' taxa-prevalent in the healthy human microbiota but distant from previously known sequences. Since 2012, few of the corresponding genomes have been isolated and sequenced, and only through advanced isolation techniques. We demonstrate that the genomes of the most wanted taxa can be identified computationally through their correlation in abundance across multiple public metagenomic data sets. We link over 20  ...[more]

Similar Datasets

| S-EPMC3406062 | biostudies-literature
| S-EPMC4103313 | biostudies-literature
| S-EPMC4880562 | biostudies-literature
| S-EPMC4491744 | biostudies-literature
2024-05-30 | GSE226231 | GEO
| S-EPMC4234437 | biostudies-literature
| S-EPMC6121835 | biostudies-other
| S-EPMC3897163 | biostudies-literature
| S-EPMC3525842 | biostudies-literature
2024-05-30 | GSE226229 | GEO