Dataset Information

Clustering of fMRI data: the elusive optimal number of clusters.

ABSTRACT: Model-free methods are widely used for the processing of brain fMRI data collected under natural stimulations, sleep, or rest. Among them is the popular fuzzy c-mean algorithm, commonly combined with cluster validity (CV) indices to identify the 'true' number of clusters (components), in an unsupervised way. CV indices may however reveal different optimal c-partitions for the same fMRI data, and their effectiveness can be hindered by the high data dimensionality, the limited signal-to-noise ratio, the small proportion of relevant voxels, and the presence of artefacts or outliers. Here, the author investigated the behaviour of seven robust CV indices. A new CV index that incorporates both compactness and separation measures is also introduced. Using both artificial and real fMRI data, the findings highlight the importance of looking at the behavior of different compactness and separation measures, defined here as building blocks of CV indices, to depict a full description of the data structure, in particular when no agreement is found between CV indices. Overall, for fMRI, it makes sense to relax the assumption that only one unique c-partition exists, and appreciate that different c-partitions (with different optimal numbers of clusters) can be useful explanations of the data, given the hierarchical organization of many brain networks.

SUBMITTER: Seghier ML

PROVIDER: S-EPMC6173948 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Clustering of fMRI data: the elusive optimal number of clusters.

Seghier Mohamed L ML

PeerJ 20181003

Model-free methods are widely used for the processing of brain fMRI data collected under natural stimulations, sleep, or rest. Among them is the popular fuzzy <i>c</i>-mean algorithm, commonly combined with cluster validity (CV) indices to identify the 'true' number of clusters (components), in an unsupervised way. CV indices may however reveal different optimal <i>c</i>-partitions for the same fMRI data, and their effectiveness can be hindered by the high data dimensionality, the limited signal ...[more]

PMID: 30310731

Dataset Information

Clustering of fMRI data: the elusive optimal number of clusters.

Publications

Clustering of fMRI data: the elusive optimal number of clusters.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Penalized model-based clustering of fMRI data.
| S-EPMC9293048 | biostudies-literature

Cross-Clustering: A Partial Clustering Algorithm with Automatic Estimation of the Number of Clusters.
| S-EPMC4807765 | biostudies-literature

Normalized cut group clustering of resting-state FMRI data.
| S-EPMC2291558 | biostudies-literature

Analysis of whole-brain resting-state FMRI data using hierarchical clustering approach.
| S-EPMC3799854 | biostudies-literature

Elusive copy number variation in the mouse genome.
| S-EPMC2943477 | biostudies-literature

Multilook SAR Image Segmentation with an Unknown Number of Clusters Using a Gamma Mixture Model and Hierarchical Clustering.
| S-EPMC5470790 | biostudies-other

Optimal clustering under uncertainty.
| S-EPMC6168142 | biostudies-literature

NIFTI: an evolutionary approach for finding number of clusters in microarray data.
| S-EPMC2669482 | biostudies-literature

Optimal allocation of subjects in a matched pair cluster-randomized trial with fixed number of heterogeneous clusters.
| S-EPMC9097976 | biostudies-literature

Clust: automatic extraction of optimal co-expressed gene clusters from gene expression data.
| S-EPMC6203272 | biostudies-literature