Unknown

Dataset Information

0

Tight clustering for large datasets with an application to gene expression data.


ABSTRACT: This article proposes a practical and scalable version of the tight clustering algorithm. The tight clustering algorithm provides tight and stable relevant clusters as output while leaving a set of points as noise or scattered points, that would not go into any cluster. However, the computational limitation to achieve this precise target of tight clusters prohibits it from being used for large microarray gene expression data or any other large data set, which are common nowadays. We propose a pragmatic and scalable version of the tight clustering method that is applicable to data sets of very large size and deduce the properties of the proposed algorithm. We validate our algorithm with extensive simulation study and multiple real data analyses including analysis of real data on gene expression.

SUBMITTER: Karmakar B 

PROVIDER: S-EPMC6395712 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Tight clustering for large datasets with an application to gene expression data.

Karmakar Bikram B   Das Sarmistha S   Bhattacharya Sohom S   Sarkar Rohan R   Mukhopadhyay Indranil I  

Scientific reports 20190228 1


This article proposes a practical and scalable version of the tight clustering algorithm. The tight clustering algorithm provides tight and stable relevant clusters as output while leaving a set of points as noise or scattered points, that would not go into any cluster. However, the computational limitation to achieve this precise target of tight clusters prohibits it from being used for large microarray gene expression data or any other large data set, which are common nowadays. We propose a pr  ...[more]

Similar Datasets

| S-EPMC5135122 | biostudies-other
| S-EPMC2492882 | biostudies-literature
| S-EPMC2855327 | biostudies-literature
| S-EPMC1890301 | biostudies-literature
| S-EPMC6138422 | biostudies-literature
| S-EPMC2896182 | biostudies-literature
| S-EPMC6329488 | biostudies-literature
| S-EPMC156590 | biostudies-literature
| S-EPMC4138177 | biostudies-literature
| S-EPMC2672630 | biostudies-literature