Unknown

Dataset Information

0

A Clustering Optimization Strategy for Molecular Taxonomy Applied to Planktonic Foraminifera SSU rDNA.


ABSTRACT: Identifying species is challenging in the case of organisms for which primarily molecular data are available. Even if morphological features are available, molecular taxonomy is often necessary to revise taxonomic concepts and to analyze environmental DNA sequences. However, clustering approaches to delineate molecular operational taxonomic units often rely on arbitrary parameter choices. Also, distance calculation is difficult for highly alignment-ambiguous sequences. Here, we applied a recently described clustering optimization method to highly divergent planktonic foraminifera SSU rDNA sequences. We determined the distance function and the clustering setting that result in the highest agreement with morphological reference data. Alignment-free distance calculation, when adapted to the use with partly non-homologous sequences caused by distinct primer pairs, outperformed multiple sequence alignment. Clustering optimization offers new perspectives for the barcoding of species diversity and for environmental sequencing. It bridges the gap between traditional and modern taxonomic disciplines by specifically addressing the issue of how to optimally account for both genetic divergence and given species concepts.

SUBMITTER: Goker M 

PROVIDER: S-EPMC2964048 | biostudies-literature | 2010 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Clustering Optimization Strategy for Molecular Taxonomy Applied to Planktonic Foraminifera SSU rDNA.

Göker Markus M   Grimm Guido W GW   Auch Alexander F AF   Aurahs Ralf R   Kučera Michal M  

Evolutionary bioinformatics online 20100909


Identifying species is challenging in the case of organisms for which primarily molecular data are available. Even if morphological features are available, molecular taxonomy is often necessary to revise taxonomic concepts and to analyze environmental DNA sequences. However, clustering approaches to delineate molecular operational taxonomic units often rely on arbitrary parameter choices. Also, distance calculation is difficult for highly alignment-ambiguous sequences. Here, we applied a recentl  ...[more]

Similar Datasets

| S-EPMC4131912 | biostudies-literature
| S-EPMC2808177 | biostudies-literature
| S-EPMC10143585 | biostudies-literature
| S-EPMC2694157 | biostudies-literature
| S-EPMC3361484 | biostudies-literature
| S-EPMC6346091 | biostudies-literature
| S-EPMC4810817 | biostudies-other
| S-EPMC6111889 | biostudies-literature
| S-EPMC6791547 | biostudies-literature
| S-EPMC7562931 | biostudies-literature