Unknown

Dataset Information

0

Classification of Microarray Gene Expression Data Using an Infiltration Tactics Optimization (ITO) Algorithm.


ABSTRACT: A number of different feature selection and classification techniques have been proposed in literature including parameter-free and parameter-based algorithms. The former are quick but may result in local maxima while the latter use dataset-specific parameter-tuning for higher accuracy. However, higher accuracy may not necessarily mean higher reliability of the model. Thus, generalized optimization is still a challenge open for further research. This paper presents a warzone inspired "infiltration tactics" based optimization algorithm (ITO)-not to be confused with the ITO algorithm based on the Itõ Process in the field of Stochastic calculus. The proposed ITO algorithm combines parameter-free and parameter-based classifiers to produce a high-accuracy-high-reliability (HAHR) binary classifier. The algorithm produces results in two phases: (i) Lightweight Infantry Group (LIG) converges quickly to find non-local maxima and produces comparable results (i.e., 70 to 88% accuracy) (ii) Followup Team (FT) uses advanced tuning to enhance the baseline performance (i.e., 75 to 99%). Every soldier of the ITO army is a base model with its own independently chosen Subset selection method, pre-processing, and validation methods and classifier. The successful soldiers are combined through heterogeneous ensembles for optimal results. The proposed approach addresses a data scarcity problem, is flexible to the choice of heterogeneous base classifiers, and is able to produce HAHR models comparable to the established MAQC-II results.

SUBMITTER: Zahoor J 

PROVIDER: S-EPMC7397166 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Classification of Microarray Gene Expression Data Using an Infiltration Tactics Optimization (ITO) Algorithm.

Zahoor Javed J   Zafar Kashif K  

Genes 20200718 7


A number of different feature selection and classification techniques have been proposed in literature including parameter-free and parameter-based algorithms. The former are quick but may result in local maxima while the latter use dataset-specific parameter-tuning for higher accuracy. However, higher accuracy may not necessarily mean higher reliability of the model. Thus, generalized optimization is still a challenge open for further research. This paper presents a warzone inspired "infiltrati  ...[more]

Shared Molecules

Only show the datasets with similarity scores above: 0.5
     

Similar Datasets

| S-EPMC3033885 | biostudies-literature
| S-EPMC8444075 | biostudies-literature
| S-EPMC1821044 | biostudies-literature
| S-EPMC10194998 | biostudies-literature
| S-EPMC5728509 | biostudies-literature
| S-EPMC1088301 | biostudies-literature
| S-EPMC4439506 | biostudies-other
| S-EPMC8022636 | biostudies-literature
| S-EPMC34421 | biostudies-literature
| S-EPMC10296348 | biostudies-literature