Dataset Information

Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes.

ABSTRACT: The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the pure additive contribution of genetic variants to classically used demographic models. Since prediction models include some heritable traits, such as body mass index, the contribution of SNPs using unmatched case-control samples may be underestimated. In this article, we propose a method that uses propensity score matching to avoid underestimation by matching case and control samples, thereby determining the pure additive contribution of SNPs. To illustrate the proposed propensity score matching method, we used SNP data from the Korea Association Resources project and reported SNPs from the genome-wide association study catalog. We selected various SNP sets via stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and the elastic-net (EN) algorithm. Using these SNP sets, we made predictions using SLR, LASSO, and EN as logistic regression modeling techniques. The accuracy of the predictions was compared in terms of area under the receiver operating characteristic curve (AUC). The contribution of SNPs to T2D was evaluated by the difference in the AUC between models using only demographic variables and models that included the SNPs. The largest difference among our models showed that the AUC of the model using genetic variants with demographic variables could be 0.107 higher than that of the corresponding model using only demographic variables.

SUBMITTER: Park C

PROVIDER: S-EPMC6944048 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes.

Park Chanwoo C Jiang Nan N Park Taesung T

Genomics & informatics 20191223 4

The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the pure additive contribution of genetic variants to classically used demographic models. Since prediction models include some heritable traits, such as body mass index, the contribution of SNPs using u ...[more]

PMID: 31896247

Similar Datasets

Project description:BackgroundThe diagnostic process is a key element of medicine but it is complex and prone to errors. Infectious diseases are one of the three categories of diseases in which diagnostic errors can be most harmful to patients. In this study we aimed to estimate the effect of initial misdiagnosis of the source of infection in patients with bacteraemia on 14 day mortality using propensity score methods to adjust for confounding.MethodsData from a previously described longitudinal cohort of patients diagnosed with monobacterial bloodstream infection (BSI) at the Leiden University Medical Centre (LUMC) between 2013 and 2015 were used. Propensity score matching and inversed probability of treatment weighting (IPTW) were applied to correct for confounding. The average treatment effect on the treated (ATT), which in this study was the average effect of initial misdiagnosis on the misdiagnosed (AEMM), was estimated. Methodological issues that were encountered when applying propensity score methods were addressed by performing additional sensitivity analyses. Sensitivity analyses consisted of varying caliper in propensity score matching and using different truncated weights in inversed probability of treatment weighting.ResultsData of 887 patients were included in the study. Propensity scores ranged between 0.015 and 0.999 and 80 patients (9.9%) had a propensity score > 0.95. In the matched analyses, 35 of the 171 misdiagnosed patients died within 14 days (20.5%), versus 10 of the 171 correctly diagnosed patients (5.8%), yielding a difference of 14.6% (7.6%; 21.6%). In the total group of patients, the observed percentage of patients with an incorrect initial diagnosis that died within 14 days was 19.8% while propensity score reweighting estimated that their probability of dying would have been 6.5%, if they had been correctly diagnosed (difference 13.3% (95% CI 6.9%;19.6%)). After adjustment for all variables that showed disbalance in the propensity score a difference of 13.7% (7.4%; 19.9%) was estimated. Sensitivity analyses yielded similar results. However, performing weighted analyses without truncation yielded unstable results.ConclusionThus, we observed a substantial increase of 14 day mortality in initially misdiagnosed patients. Furthermore, several patients received propensity scores extremely close to one and were almost sure to be initially misdiagnosed.

Dataset Information

Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes.

Publications

Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets