Unknown

Dataset Information

0

A comparative study of improvements Pre-filter methods bring on feature selection using microarray data.


ABSTRACT: BACKGROUND:Feature selection techniques have become an apparent need in biomarker discoveries with the development of microarray. However, the high dimensional nature of microarray made feature selection become time-consuming. To overcome such difficulties, filter data according to the background knowledge before applying feature selection techniques has become a hot topic in microarray analysis. Different methods may affect final results greatly, thus it is important to evaluate these pre-filter methods in a system way. METHODS:In this paper, we compared the performance of statistical-based, biological-based pre-filter methods and the combination of them on microRNA-mRNA parallel expression profiles using L1 logistic regression as feature selection techniques. Four types of data were built for both microRNA and mRNA expression profiles. RESULTS:Results showed that pre-filter methods could reduce the number of features greatly for both mRNA and microRNA expression datasets. The features selected after pre-filter procedures were shown to be significant in biological levels such as biology process and microRNA functions. Analyses of classification performance based on precision showed the pre-filter methods were necessary when the number of raw features was much bigger than that of samples. All the computing time was greatly shortened after pre-filter procedures. CONCLUSIONS:With similar or better classification improvements, less but biological significant features, pre-filter-based feature selection should be taken into consideration if researchers need fast results when facing complex computing problems in bioinformatics.

SUBMITTER: Wang Y 

PROVIDER: S-EPMC4340279 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

A comparative study of improvements Pre-filter methods bring on feature selection using microarray data.

Wang Yingying Y   Fan Xiaomao X   Cai Yunpeng Y  

Health information science and systems 20141016


<h4>Background</h4>Feature selection techniques have become an apparent need in biomarker discoveries with the development of microarray. However, the high dimensional nature of microarray made feature selection become time-consuming. To overcome such difficulties, filter data according to the background knowledge before applying feature selection techniques has become a hot topic in microarray analysis. Different methods may affect final results greatly, thus it is important to evaluate these p  ...[more]

Similar Datasets

| S-EPMC7860207 | biostudies-literature
| S-EPMC1181625 | biostudies-literature
| S-EPMC6101392 | biostudies-literature
| S-EPMC2951666 | biostudies-literature
| S-EPMC3796884 | biostudies-literature
| S-EPMC4043987 | biostudies-literature
| S-EPMC4105478 | biostudies-literature
| S-EPMC8638022 | biostudies-literature
| S-EPMC2441630 | biostudies-literature
| S-EPMC5581932 | biostudies-literature