Unknown

Dataset Information

0

Swarm: robust and fast clustering method for amplicon-based studies.


ABSTRACT: Popular de novo amplicon clustering methods suffer from two fundamental flaws: arbitrary global clustering thresholds, and input-order dependency induced by centroid selection. Swarm was developed to address these issues by first clustering nearly identical amplicons iteratively using a local threshold, and then by using clusters' internal structure and amplicon abundances to refine its results. This fast, scalable, and input-order independent approach reduces the influence of clustering parameters and produces robust operational taxonomic units.

SUBMITTER: Mahe F 

PROVIDER: S-EPMC4178461 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Swarm: robust and fast clustering method for amplicon-based studies.

Mahé Frédéric F   Rognes Torbjørn T   Quince Christopher C   de Vargas Colomban C   Dunthorn Micah M  

PeerJ 20140925


Popular de novo amplicon clustering methods suffer from two fundamental flaws: arbitrary global clustering thresholds, and input-order dependency induced by centroid selection. Swarm was developed to address these issues by first clustering nearly identical amplicons iteratively using a local threshold, and then by using clusters' internal structure and amplicon abundances to refine its results. This fast, scalable, and input-order independent approach reduces the influence of clustering paramet  ...[more]

Similar Datasets

| S-EPMC8696092 | biostudies-literature
| S-EPMC4690345 | biostudies-literature
| S-EPMC5531809 | biostudies-literature
| S-EPMC3465711 | biostudies-literature
| S-EPMC5579835 | biostudies-literature
| S-EPMC6581440 | biostudies-literature
| S-EPMC2648900 | biostudies-literature
| S-EPMC5395793 | biostudies-literature
| S-EPMC7826264 | biostudies-literature
| S-EPMC4979957 | biostudies-literature