Unknown

Dataset Information

0

MorphoCluster: Efficient Annotation of Plankton Images by Clustering.


ABSTRACT: In this work, we present MorphoCluster, a software tool for data-driven, fast, and accurate annotation of large image data sets. While already having surpassed the annotation rate of human experts, volume and complexity of marine data will continue to increase in the coming years. Still, this data requires interpretation. MorphoCluster augments the human ability to discover patterns and perform object classification in large amounts of data by embedding unsupervised clustering in an interactive process. By aggregating similar images into clusters, our novel approach to image annotation increases consistency, multiplies the throughput of an annotator, and allows experts to adapt the granularity of their sorting scheme to the structure in the data. By sorting a set of 1.2 M objects into 280 data-driven classes in 71 h (16 k objects per hour), with 90% of these classes having a precision of 0.889 or higher. This shows that MorphoCluster is at the same time fast, accurate, and consistent; provides a fine-grained and data-driven classification; and enables novelty detection.

SUBMITTER: Schroder SM 

PROVIDER: S-EPMC7308937 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

MorphoCluster: Efficient Annotation of Plankton Images by Clustering.

Schröder Simon-Martin SM   Kiko Rainer R   Koch Reinhard R  

Sensors (Basel, Switzerland) 20200528 11


In this work, we present MorphoCluster, a software tool for data-driven, fast, and accurate annotation of large image data sets. While already having surpassed the annotation rate of human experts, volume and complexity of marine data will continue to increase in the coming years. Still, this data requires interpretation. MorphoCluster augments the human ability to discover patterns and perform object classification in large amounts of data by embedding unsupervised clustering in an interactive  ...[more]

Similar Datasets

| S-EPMC7376023 | biostudies-literature
| S-EPMC1845149 | biostudies-literature
| S-EPMC552314 | biostudies-literature
| S-EPMC10340780 | biostudies-literature
| S-EPMC4803255 | biostudies-literature
| S-EPMC6437941 | biostudies-literature
| S-EPMC7902667 | biostudies-literature
| S-EPMC11326248 | biostudies-literature
| S-EPMC9477531 | biostudies-literature
| S-EPMC4213798 | biostudies-literature