Unknown

Dataset Information

0

Approach for text classification based on the similarity measurement between normal cloud models.


ABSTRACT: The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers.

SUBMITTER: Dai J 

PROVIDER: S-EPMC3953649 | biostudies-other | 2014

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC7721552 | biostudies-literature
| S-EPMC6550425 | biostudies-literature
| S-EPMC8330432 | biostudies-literature
| S-EPMC2939881 | biostudies-literature
| S-EPMC7285297 | biostudies-literature
| S-EPMC6988483 | biostudies-literature
| S-EPMC4003000 | biostudies-literature
| S-EPMC4738796 | biostudies-other
| S-EPMC2788367 | biostudies-literature
| S-EPMC524507 | biostudies-literature