Unknown

Dataset Information

0

Robust microarray meta-analysis identifies differentially expressed genes for clinical prediction.


ABSTRACT: Combining multiple microarray datasets increases sample size and leads to improved reproducibility in identification of informative genes and subsequent clinical prediction. Although microarrays have increased the rate of genomic data collection, sample size is still a major issue when identifying informative genetic biomarkers. Because of this, feature selection methods often suffer from false discoveries, resulting in poorly performing predictive models. We develop a simple meta-analysis-based feature selection method that captures the knowledge in each individual dataset and combines the results using a simple rank average. In a comprehensive study that measures robustness in terms of clinical application (i.e., breast, renal, and pancreatic cancer), microarray platform heterogeneity, and classifier (i.e., logistic regression, diagonal LDA, and linear SVM), we compare the rank average meta-analysis method to five other meta-analysis methods. Results indicate that rank average meta-analysis consistently performs well compared to five other meta-analysis methods.

SUBMITTER: Phan JH 

PROVIDER: S-EPMC3539384 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robust microarray meta-analysis identifies differentially expressed genes for clinical prediction.

Phan John H JH   Young Andrew N AN   Wang May D MD  

TheScientificWorldJournal 20121218


Combining multiple microarray datasets increases sample size and leads to improved reproducibility in identification of informative genes and subsequent clinical prediction. Although microarrays have increased the rate of genomic data collection, sample size is still a major issue when identifying informative genetic biomarkers. Because of this, feature selection methods often suffer from false discoveries, resulting in poorly performing predictive models. We develop a simple meta-analysis-based  ...[more]

Similar Datasets

| S-EPMC1885857 | biostudies-literature
| S-EPMC4526045 | biostudies-literature
2006-10-28 | GSE6140 | GEO
| S-EPMC2098839 | biostudies-literature
| S-EPMC4694224 | biostudies-literature
| S-EPMC3871567 | biostudies-literature
| S-EPMC7022900 | biostudies-literature
| S-EPMC3582450 | biostudies-literature
| S-EPMC4411479 | biostudies-literature
| S-EPMC3654929 | biostudies-literature