Dataset Information

Biases introduced by choosing controls to match risk factors of cases in biomarker research.

ABSTRACT: BACKGROUND:Selecting controls that match cases on risk factors for the outcome is a pervasive practice in biomarker research studies. Such matching, however, biases estimates of biomarker prediction performance. The magnitudes of these biases are unknown. METHODS:We examined the prediction performance of biomarkers and improvements in prediction gained by adding biomarkers to risk factor information. Data simulated from bivariate normal statistical models and data from a study to identify critically ill patients were used. We compared true performance with that estimated from case control studies that do or do not use matching. ROC curves were used to quantify performance. We propose a new statistical method to estimate prediction performance from matched studies for which data on the matching factors are available for subjects in the population. RESULTS:Performance estimated with standard analyses can be grossly biased by matching, especially when biomarkers are highly correlated with matching risk factors. In our studies, the performance of the biomarker alone was underestimated whereas the improvement in performance gained by adding the marker to risk factors was overestimated by 2-10-fold. We found examples for which the relative ranking of 2 biomarkers for prediction was inappropriately reversed by use of a matched design. The new approach to estimation corrected for bias in matched studies. CONCLUSIONS:To properly gauge prediction performance in the population or the improvement gained by adding a biomarker to known risk factors, matched case control studies must be supplemented with risk factor information from the population and must be analyzed with nonstandard statistical methods.

SUBMITTER: Pepe MS

PROVIDER: S-EPMC3464972 | biostudies-literature | 2012 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Biases introduced by choosing controls to match risk factors of cases in biomarker research.

Pepe Margaret Sullivan MS Fan Jing J Seymour Christopher W CW Li Christopher C Huang Ying Y Feng Ziding Z

Clinical chemistry 20120622 8

<h4>Background</h4>Selecting controls that match cases on risk factors for the outcome is a pervasive practice in biomarker research studies. Such matching, however, biases estimates of biomarker prediction performance. The magnitudes of these biases are unknown.<h4>Methods</h4>We examined the prediction performance of biomarkers and improvements in prediction gained by adding biomarkers to risk factor information. Data simulated from bivariate normal statistical models and data from a study to ...[more]

PMID: 22730452

Dataset Information

Biases introduced by choosing controls to match risk factors of cases in biomarker research.

Publications

Biases introduced by choosing controls to match risk factors of cases in biomarker research.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

From differences in means between cases and controls to risk stratification: a business plan for biomarker development.
| S-EPMC3570740 | biostudies-literature

Choosing a Qualitative Research Approach.
| S-EPMC4675428 | biostudies-literature

Reducing inherent biases introduced during DNA viral metagenome analyses of municipal wastewater.
| S-EPMC5882159 | biostudies-literature

Biases introduced by filtering electronic health records for patients with "complete data".
| S-EPMC6080680 | biostudies-literature

Risk factors for UK Plasmodium falciparum cases.
| S-EPMC4132200 | biostudies-literature

Risk factors for eight common cancers revealed from a phenome-wide Mendelian randomisation analysis of 378,142 cases and 485,715 controls.
| S-EPMC10055507 | biostudies-literature

Factors associated with the uptake of newly introduced childhood vaccinations in Ethiopia: the cases of rotavirus and pneumococcal conjugate vaccines.
| S-EPMC6902476 | biostudies-literature

Identification and selection of cases and controls in the Pneumonia Etiology Research for Child Health project.
| S-EPMC3297551 | biostudies-literature

Choosing the right time granularity for analysis of digital biomarker trajectories.
| S-EPMC7748028 | biostudies-literature

Cryptically patterned moths perceive bark structure when choosing body orientations that match wing color pattern to the bark pattern.
| S-EPMC3813426 | biostudies-literature