Unknown

Dataset Information

0

Efficient feature selection and classification for microarray data.


ABSTRACT: Feature selection and classification are the main topics in microarray data analysis. Although many feature selection methods have been proposed and developed in this field, SVM-RFE (Support Vector Machine based on Recursive Feature Elimination) is proved as one of the best feature selection methods, which ranks the features (genes) by training support vector machine classification model and selects key genes combining with recursive feature elimination strategy. The principal drawback of SVM-RFE is the huge time consumption. To overcome this limitation, we introduce a more efficient implementation of linear support vector machines and improve the recursive feature elimination strategy and then combine them together to select informative genes. Besides, we propose a simple resampling method to preprocess the datasets, which makes the information distribution of different kinds of samples balanced and the classification results more credible. Moreover, the applicability of four common classifiers is also studied in this paper. Extensive experiments are conducted on six most frequently used microarray datasets in this field, and the results show that the proposed methods have not only reduced the time consumption greatly but also obtained comparable classification performance.

SUBMITTER: Li Z 

PROVIDER: S-EPMC6101392 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4105478 | biostudies-literature
| S-EPMC1181625 | biostudies-literature
| S-EPMC2789385 | biostudies-literature
| S-EPMC2951666 | biostudies-literature
| S-EPMC3796884 | biostudies-other
| S-EPMC3218317 | biostudies-literature
| S-EPMC4043987 | biostudies-literature
| S-EPMC1363357 | biostudies-literature
| S-EPMC3577111 | biostudies-literature
| S-EPMC3347893 | biostudies-literature