Unknown

Dataset Information

0

Maximum predictive power of the microarray-based models for clinical outcomes is limited by correlation between endpoint and gene expression profile.


ABSTRACT:

Background

Microarray data have been used for gene signature selection to predict clinical outcomes. Many studies have attempted to identify factors that affect models' performance with only little success. Fine-tuning of model parameters and optimizing each step of the modeling process often results in over-fitting problems without improving performance.

Results

We propose a quantitative measurement, termed consistency degree, to detect the correlation between disease endpoint and gene expression profile. Different endpoints were shown to have different consistency degrees to gene expression profiles. The validity of this measurement to estimate the consistency was tested with significance at a p-value less than 2.2e-16 for all of the studied endpoints. According to the consistency degree score, overall survival milestone outcome of multiple myeloma was proposed to extend from 730 days to 1561 days, which is more consistent with gene expression profile.

Conclusion

For various clinical endpoints, the maximum predictive powers of different microarray-based models are limited by the correlation between endpoint and gene expression profile of disease samples as indicated by the consistency degree score. In addition, previous defined clinical outcomes can also be reassessed and refined more coherent according to related disease gene expression profile. Our findings point to an entirely new direction for assessing the microarray-based predictive models and provide important information to gene signature based clinical applications.

SUBMITTER: Zhao C 

PROVIDER: S-EPMC3287499 | biostudies-literature | 2011 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Maximum predictive power of the microarray-based models for clinical outcomes is limited by correlation between endpoint and gene expression profile.

Zhao Chen C   Shi Leming L   Tong Weida W   Shaughnessy John D JD   Oberthuer André A   Pusztai Lajos L   Deng Youping Y   Symmans W Fraser WF   Shi Tieliu T  

BMC genomics 20111223


<h4>Background</h4>Microarray data have been used for gene signature selection to predict clinical outcomes. Many studies have attempted to identify factors that affect models' performance with only little success. Fine-tuning of model parameters and optimizing each step of the modeling process often results in over-fitting problems without improving performance.<h4>Results</h4>We propose a quantitative measurement, termed consistency degree, to detect the correlation between disease endpoint an  ...[more]

Similar Datasets

| S-EPMC7217010 | biostudies-literature
| S-EPMC4506430 | biostudies-literature
| S-EPMC5685677 | biostudies-literature
| S-EPMC2823709 | biostudies-literature
| S-EPMC7410259 | biostudies-literature
2024-08-22 | GSE271689 | GEO
| S-EPMC8153045 | biostudies-literature
| S-EPMC2886096 | biostudies-literature
| S-EPMC3315840 | biostudies-literature
| S-EPMC4013760 | biostudies-literature