Ontology highlight
ABSTRACT:
SUBMITTER: Hu CW
PROVIDER: S-EPMC4533525 | biostudies-literature | 2015 Aug
REPOSITORIES: biostudies-literature
Hu Chenyue W CW Kornblau Steven M SM Slater John H JH Qutub Amina A AA
Scientific reports 20150812
Estimating the optimal number of clusters is a major challenge in applying cluster analysis to any type of dataset, especially to biomedical datasets, which are high-dimensional and complex. Here, we introduce an improved method, Progeny Clustering, which is stability-based and exceptionally efficient in computing, to find the ideal number of clusters. The algorithm employs a novel Progeny Sampling method to reconstruct cluster identity, a co-occurrence probability matrix to assess the clusterin ...[more]