Unknown

Dataset Information

0

Study design in high-dimensional classification analysis.


ABSTRACT: Advances in high throughput technology have accelerated the use of hundreds to millions of biomarkers to construct classifiers that partition patients into different clinical conditions. Prior to classifier development in actual studies, a critical need is to determine the sample size required to reach a specified classification precision. We develop a systematic approach for sample size determination in high-dimensional (large [Formula: see text] small [Formula: see text]) classification analysis. Our method utilizes the probability of correct classification (PCC) as the optimization objective function and incorporates the higher criticism thresholding procedure for classifier development. Further, we derive the theoretical bound of maximal PCC gain from feature augmentation (e.g. when molecular and clinical predictors are combined in classifier development). Our methods are motivated and illustrated by a study using proteomics markers to classify post-kidney transplantation patients into stable and rejecting classes.

SUBMITTER: Sanchez BN 

PROVIDER: S-EPMC5031947 | biostudies-literature | 2016 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Study design in high-dimensional classification analysis.

Sánchez Brisa N BN   Wu Meihua M   Song Peter X K PX   Wang Wen W  

Biostatistics (Oxford, England) 20160505 4


Advances in high throughput technology have accelerated the use of hundreds to millions of biomarkers to construct classifiers that partition patients into different clinical conditions. Prior to classifier development in actual studies, a critical need is to determine the sample size required to reach a specified classification precision. We develop a systematic approach for sample size determination in high-dimensional (large [Formula: see text] small [Formula: see text]) classification analys  ...[more]

Similar Datasets

| S-EPMC6980791 | biostudies-literature
2014-10-01 | E-GEOD-51981 | biostudies-arrayexpress
| S-EPMC2861314 | biostudies-literature
| S-EPMC3164859 | biostudies-other
2014-10-01 | GSE51981 | GEO
| S-EPMC3717481 | biostudies-literature
| S-EPMC3577111 | biostudies-literature
| S-EPMC8557345 | biostudies-literature
| S-EPMC5870609 | biostudies-literature
| S-EPMC9574082 | biostudies-literature