Unknown

Dataset Information

0

CAMAMED: a pipeline for composition-aware mapping-based analysis of metagenomic data.


ABSTRACT: Metagenomics is the study of genomic DNA recovered from a microbial community. Both assembly-based and mapping-based methods have been used to analyze metagenomic data. When appropriate gene catalogs are available, mapping-based methods are preferred over assembly based approaches, especially for analyzing the data at the functional level. In this study, we introduce CAMAMED as a composition-aware mapping-based metagenomic data analysis pipeline. This pipeline can analyze metagenomic samples at both taxonomic and functional profiling levels. Using this pipeline, metagenome sequences can be mapped to non-redundant gene catalogs and the gene frequency in the samples are obtained. Due to the highly compositional nature of metagenomic data, the cumulative sum-scaling method is used at both taxa and gene levels for compositional data analysis in our pipeline. Additionally, by mapping the genes to the KEGG database, annotations related to each gene can be extracted at different functional levels such as KEGG ortholog groups, enzyme commission numbers and reactions. Furthermore, the pipeline enables the user to identify potential biomarkers in case-control metagenomic samples by investigating functional differences. The source code for this software is available from https://github.com/mhnb/camamed. Also, the ready to use Docker images are available at https://hub.docker.com.

SUBMITTER: Norouzi-Beirami MH 

PROVIDER: S-EPMC7787360 | biostudies-literature | 2021 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

CAMAMED: a pipeline for composition-aware mapping-based analysis of metagenomic data.

Norouzi-Beirami Mohammad H MH   Marashi Sayed-Amir SA   Banaei-Moghaddam Ali M AM   Kavousi Kaveh K  

NAR genomics and bioinformatics 20210106 1


Metagenomics is the study of genomic DNA recovered from a microbial community. Both assembly-based and mapping-based methods have been used to analyze metagenomic data. When appropriate gene catalogs are available, mapping-based methods are preferred over assembly based approaches, especially for analyzing the data at the functional level. In this study, we introduce CAMAMED as a composition-aware mapping-based metagenomic data analysis pipeline. This pipeline can analyze metagenomic samples at  ...[more]

Similar Datasets

| S-EPMC10072060 | biostudies-literature
| S-EPMC6138922 | biostudies-literature
| S-EPMC8256542 | biostudies-literature
| S-EPMC6543708 | biostudies-literature
2021-07-26 | E-MTAB-9189 | biostudies-arrayexpress
2021-07-26 | E-MTAB-9191 | biostudies-arrayexpress
| S-EPMC6353838 | biostudies-literature
| S-EPMC6676585 | biostudies-literature
| S-EPMC7319573 | biostudies-literature
| S-EPMC5907781 | biostudies-literature