Unknown

Dataset Information

0

Metalign: efficient alignment-based metagenomic profiling via containment min hash.


ABSTRACT: Metagenomic profiling, predicting the presence and relative abundances of microbes in a sample, is a critical first step in microbiome analysis. Alignment-based approaches are often considered accurate yet computationally infeasible. Here, we present a novel method, Metalign, that performs efficient and accurate alignment-based metagenomic profiling. We use a novel containment min hash approach to pre-filter the reference database prior to alignment and then process both uniquely aligned and multi-aligned reads to produce accurate abundance estimates. In performance evaluations on both real and simulated datasets, Metalign is the only method evaluated that maintained high performance and competitive running time across all datasets.

SUBMITTER: LaPierre N 

PROVIDER: S-EPMC7488264 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Metalign: efficient alignment-based metagenomic profiling via containment min hash.

LaPierre Nathan N   Alser Mohammed M   Eskin Eleazar E   Koslicki David D   Mangul Serghei S  

Genome biology 20200910 1


Metagenomic profiling, predicting the presence and relative abundances of microbes in a sample, is a critical first step in microbiome analysis. Alignment-based approaches are often considered accurate yet computationally infeasible. Here, we present a novel method, Metalign, that performs efficient and accurate alignment-based metagenomic profiling. We use a novel containment min hash approach to pre-filter the reference database prior to alignment and then process both uniquely aligned and mul  ...[more]

Similar Datasets

| S-EPMC9027990 | biostudies-literature
| S-EPMC5845352 | biostudies-literature
| S-EPMC4687345 | biostudies-other
| S-EPMC5860611 | biostudies-literature
| S-EPMC9651046 | biostudies-literature
| S-EPMC3315303 | biostudies-literature
| S-EPMC6935933 | biostudies-literature
| S-EPMC8777787 | biostudies-literature
| S-EPMC1579236 | biostudies-literature
| S-EPMC6821304 | biostudies-literature