Unknown

Dataset Information

0

A voting approach to identify a small number of highly predictive genes using multiple classifiers.


ABSTRACT: BACKGROUND: Microarray gene expression profiling has provided extensive datasets that can describe characteristics of cancer patients. An important challenge for this type of data is the discovery of gene sets which can be used as the basis of developing a clinical predictor for cancer. It is desirable that such gene sets be compact, give accurate predictions across many classifiers, be biologically relevant and have good biological process coverage. RESULTS: By using a new type of multiple classifier voting approach, we have identified gene sets that can predict breast cancer prognosis accurately, for a range of classification algorithms. Unlike a wrapper approach, our method is not specialised towards a single classification technique. Experimental analysis demonstrates higher prediction accuracies for our sets of genes compared to previous work in the area. Moreover, our sets of genes are generally more compact than those previously proposed. Taking a biological viewpoint, from the literature, most of the genes in our sets are known to be strongly related to cancer. CONCLUSION: We show that it is possible to obtain superior classification accuracy with our approach and obtain a compact gene set that is also biologically relevant and has good coverage of different biological processes.

SUBMITTER: Hassan MR 

PROVIDER: S-EPMC2648737 | biostudies-literature | 2009

REPOSITORIES: biostudies-literature

altmetric image

Publications

A voting approach to identify a small number of highly predictive genes using multiple classifiers.

Hassan Md Rafiul MR   Hossain M Maruf MM   Bailey James J   Macintyre Geoff G   Ho Joshua W K JW   Ramamohanarao Kotagiri K  

BMC bioinformatics 20090130


<h4>Background</h4>Microarray gene expression profiling has provided extensive datasets that can describe characteristics of cancer patients. An important challenge for this type of data is the discovery of gene sets which can be used as the basis of developing a clinical predictor for cancer. It is desirable that such gene sets be compact, give accurate predictions across many classifiers, be biologically relevant and have good biological process coverage.<h4>Results</h4>By using a new type of  ...[more]

Similar Datasets

| S-EPMC2920073 | biostudies-literature
| S-EPMC9972901 | biostudies-literature
| S-EPMC10262409 | biostudies-literature
| S-EPMC9296939 | biostudies-literature
| S-EPMC6004057 | biostudies-literature
| S-EPMC3742133 | biostudies-literature
| S-EPMC4157793 | biostudies-literature
| S-EPMC5287955 | biostudies-literature
| S-EPMC2685112 | biostudies-literature
| S-EPMC3541361 | biostudies-literature