Unknown

Dataset Information

0

AUC-based biomarker ensemble with an application on gene scores predicting low bone mineral density.


ABSTRACT:

Motivation

The area under the receiver operating characteristic (ROC) curve (AUC), long regarded as a 'golden' measure for the predictiveness of a continuous score, has propelled the need to develop AUC-based predictors. However, the AUC-based ensemble methods are rather scant, largely due to the fact that the associated objective function is neither continuous nor concave. Indeed, there is no reliable numerical algorithm identifying optimal combination of a set of biomarkers to maximize the AUC, especially when the number of biomarkers is large.

Results

We have proposed a novel AUC-based statistical ensemble methods for combining multiple biomarkers to differentiate a binary response of interest. Specifically, we propose to replace the non-continuous and non-convex AUC objective function by a convex surrogate loss function, whose minimizer can be efficiently identified. With the established framework, the lasso and other regularization techniques enable feature selections. Extensive simulations have demonstrated the superiority of the new methods to the existing methods. The proposal has been applied to a gene expression dataset to construct gene expression scores to differentiate elderly women with low bone mineral density (BMD) and those with normal BMD. The AUCs of the resulting scores in the independent test dataset has been satisfactory.

Conclusion

Aiming for directly maximizing AUC, the proposed AUC-based ensemble method provides an efficient means of generating a stable combination of multiple biomarkers, which is especially useful under the high-dimensional settings.

Contact

lutian@stanford.edu.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Zhao XG 

PROVIDER: S-EPMC3198577 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

AUC-based biomarker ensemble with an application on gene scores predicting low bone mineral density.

Zhao X G XG   Dai W W   Li Y Y   Tian L L  

Bioinformatics (Oxford, England) 20110909 21


<h4>Motivation</h4>The area under the receiver operating characteristic (ROC) curve (AUC), long regarded as a 'golden' measure for the predictiveness of a continuous score, has propelled the need to develop AUC-based predictors. However, the AUC-based ensemble methods are rather scant, largely due to the fact that the associated objective function is neither continuous nor concave. Indeed, there is no reliable numerical algorithm identifying optimal combination of a set of biomarkers to maximize  ...[more]

Similar Datasets

| S-EPMC8849249 | biostudies-literature
| S-EPMC9570421 | biostudies-literature
| S-EPMC7442851 | biostudies-literature
| S-EPMC1124002 | biostudies-literature
| S-EPMC7470426 | biostudies-literature
| S-EPMC4768717 | biostudies-literature
| S-EPMC6975633 | biostudies-literature
| S-EPMC3709009 | biostudies-literature
| S-EPMC3086755 | biostudies-literature
| S-EPMC4788514 | biostudies-literature