Unknown

Dataset Information

0

Individualized markers optimize class prediction of microarray data.


ABSTRACT:

Background

Identification of molecular markers for the classification of microarray data is a challenging task. Despite the evident dissimilarity in various characteristics of biological samples belonging to the same category, most of the marker--selection and classification methods do not consider this variability. In general, feature selection methods aim at identifying a common set of genes whose combined expression profiles can accurately predict the category of all samples. Here, we argue that this simplified approach is often unable to capture the complexity of a disease phenotype and we propose an alternative method that takes into account the individuality of each patient-sample.

Results

Instead of using the same features for the classification of all samples, the proposed technique starts by creating a pool of informative gene-features. For each sample, the method selects a subset of these features whose expression profiles are most likely to accurately predict the sample's category. Different subsets are utilized for different samples and the outcomes are combined in a hierarchical framework for the classification of all samples. Moreover, this approach can innately identify subgroups of samples within a given class which share common feature sets thus highlighting the effect of individuality on gene expression.

Conclusion

In addition to high classification accuracy, the proposed method offers a more individualized approach for the identification of biological markers, which may help in better understanding the molecular background of a disease and emphasize the need for more flexible medical interventions.

SUBMITTER: Pavlidis P 

PROVIDER: S-EPMC1569876 | biostudies-literature | 2006 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Individualized markers optimize class prediction of microarray data.

Pavlidis Pavlos P   Poirazi Panayiota P  

BMC bioinformatics 20060714


<h4>Background</h4>Identification of molecular markers for the classification of microarray data is a challenging task. Despite the evident dissimilarity in various characteristics of biological samples belonging to the same category, most of the marker--selection and classification methods do not consider this variability. In general, feature selection methods aim at identifying a common set of genes whose combined expression profiles can accurately predict the category of all samples. Here, we  ...[more]

Similar Datasets

| S-EPMC7506836 | biostudies-literature
| S-EPMC3019174 | biostudies-literature
| S-EPMC3098087 | biostudies-literature
| S-EPMC5832435 | biostudies-literature
| S-EPMC101257 | biostudies-literature
| S-EPMC2535775 | biostudies-literature
2015-04-17 | E-GEOD-67979 | biostudies-arrayexpress
2015-04-17 | GSE67979 | GEO
| S-EPMC2825599 | biostudies-literature
| S-EPMC3002369 | biostudies-literature