Unknown

Dataset Information

0

A comprehensive evaluation of binning methods to recover human gut microbial species from a non-redundant reference gene catalog.


ABSTRACT: The human gut microbiota performs functions that are essential for the maintenance of the host physiology. However, characterizing the functioning of microbial communities in relation to the host remains challenging in reference-based metagenomic analyses. Indeed, as taxonomic and functional analyses are performed independently, the link between genes and species remains unclear. Although a first set of species-level bins was built by clustering co-abundant genes, no reference bin set is established on the most used gut microbiota catalog, the Integrated Gene Catalog (IGC). With the aim to identify the best suitable method to group the IGC genes, we benchmarked nine taxonomy-independent binners implementing abundance-based, hybrid and integrative approaches. To this purpose, we designed a simulated non-redundant gene catalog (SGC) and computed adapted assessment metrics. Overall, the best trade-off between the main metrics is reached by an integrative binner. For each approach, we then compared the results of the best-performing binner with our expected community structures and applied the method to the IGC. The three approaches are distinguished by specific advantages, and by inherent or scalability limitations. Hybrid and integrative binners show promising and potentially complementary results but require improvements to be used on the IGC to recover human gut microbial species.

SUBMITTER: Borderes M 

PROVIDER: S-EPMC7936653 | biostudies-literature | 2021 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A comprehensive evaluation of binning methods to recover human gut microbial species from a non-redundant reference gene catalog.

Borderes Marianne M   Gasc Cyrielle C   Prestat Emmanuel E   Galvão Ferrarini Mariana M   Vinga Susana S   Boucinha Lilia L   Sagot Marie-France MF  

NAR genomics and bioinformatics 20210301 1


The human gut microbiota performs functions that are essential for the maintenance of the host physiology. However, characterizing the functioning of microbial communities in relation to the host remains challenging in reference-based metagenomic analyses. Indeed, as taxonomic and functional analyses are performed independently, the link between genes and species remains unclear. Although a first set of species-level bins was built by clustering co-abundant genes, no reference bin set is establi  ...[more]

Similar Datasets

| S-BSST297 | biostudies-other
| S-EPMC7801254 | biostudies-literature
| S-EPMC7044274 | biostudies-literature
| S-EPMC8394144 | biostudies-literature
| S-EPMC7889623 | biostudies-literature
| S-EPMC4563710 | biostudies-other
| S-EPMC4129434 | biostudies-literature
| S-EPMC2940224 | biostudies-literature
2015-04-09 | GSE61564 | GEO
| S-EPMC4534496 | biostudies-literature