Dataset Information

Solving text clustering problem using a memetic differential evolution algorithm.

ABSTRACT: The text clustering is considered as one of the most effective text document analysis methods, which is applied to cluster documents as a consequence of the expanded big data and online information. Based on the review of the related work of the text clustering algorithms, these algorithms achieved reasonable clustering results for some datasets, while they failed on a wide variety of benchmark datasets. Furthermore, the performance of these algorithms was not robust due to the inefficient balance between the exploitation and exploration capabilities of the clustering algorithm. Accordingly, this research proposes a Memetic Differential Evolution algorithm (MDETC) to solve the text clustering problem, which aims to address the effect of the hybridization between the differential evolution (DE) mutation strategy with the memetic algorithm (MA). This hybridization intends to enhance the quality of text clustering and improve the exploitation and exploration capabilities of the algorithm. Our experimental results based on six standard text clustering benchmark datasets (i.e. the Laboratory of Computational Intelligence (LABIC)) have shown that the MDETC algorithm outperformed other compared clustering algorithms based on AUC metric, F-measure, and the statistical analysis. Furthermore, the MDETC is compared with the state of art text clustering algorithms and obtained almost the best results for the standard benchmark datasets.

SUBMITTER: Mustafa HMJ

PROVIDER: S-EPMC7289410 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Solving text clustering problem using a memetic differential evolution algorithm.

Mustafa Hossam M J HMJ Ayob Masri M Albashish Dheeb D Abu-Taleb Sawsan S

PloS one 20200611 6

The text clustering is considered as one of the most effective text document analysis methods, which is applied to cluster documents as a consequence of the expanded big data and online information. Based on the review of the related work of the text clustering algorithms, these algorithms achieved reasonable clustering results for some datasets, while they failed on a wide variety of benchmark datasets. Furthermore, the performance of these algorithms was not robust due to the inefficient balan ...[more]

PMID: 32525869

Dataset Information

Solving text clustering problem using a memetic differential evolution algorithm.

Publications

Solving text clustering problem using a memetic differential evolution algorithm.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Neighborhood Grid Clustering Algorithm for Solving Localization Problem in WSN Using Genetic Algorithm.
| S-EPMC9256383 | biostudies-literature

A heuristic algorithm solving the mutual-exclusivity-sorting problem.
| S-EPMC9857977 | biostudies-literature

A hybrid particle swarm optimization algorithm for solving engineering problem.
| S-EPMC11375002 | biostudies-literature

A New Look at Infant Problem-Solving: Using DeepLabCut to Investigate Exploratory Problem-Solving Approaches.
| S-EPMC8606407 | biostudies-literature

Clustering knowledge and dispersing abilities enhances collective problem solving in a network.
| S-EPMC6853876 | biostudies-literature

A whale optimization algorithm based on atom-like structure differential evolution for solving engineering design problems.
| S-EPMC10774322 | biostudies-literature

Preconditioned alternating projection algorithm for solving the penalized-likelihood SPECT reconstruction problem.
| S-EPMC5573596 | biostudies-literature

Complex harmonic regularization with differential evolution in a memetic framework for biomarker selection.
| S-EPMC6375558 | biostudies-literature

Solving optimization problems simultaneously: the variants of the traveling salesman problem with time windows using multifactorial evolutionary algorithm.
| S-EPMC10280250 | biostudies-literature

A novel algorithm combining finite state method and genetic algorithm for solving crude oil scheduling problem.
| S-EPMC3948204 | biostudies-other