Dataset Information

Pooling individual participant data from randomized controlled trials: Exploring potential loss of information.

ABSTRACT: BACKGROUND:Pooling individual participant data to enable pooled analyses is often complicated by diversity in variables across available datasets. Therefore, recoding original variables is often necessary to build a pooled dataset. We aimed to quantify how much information is lost in this process and to what extent this jeopardizes validity of analyses results. METHODS:Data were derived from a platform that was developed to pool data from three randomized controlled trials on the effect of treatment of cardiovascular risk factors on cognitive decline or dementia. We quantified loss of information using the R-squared of linear regression models with pooled variables as a function of their original variable(s). In case the R-squared was below 0.8, we additionally explored the potential impact of loss of information for future analyses. We did this second step by comparing whether the Beta coefficient of the predictor differed more than 10% when adding original or recoded variables as a confounder in a linear regression model. In a simulation we randomly sampled numbers, recoded those < = 1000 to 0 and those >1000 to 1 and varied the range of the continuous variable, the ratio of recoded zeroes to recoded ones, or both, and again extracted the R-squared from linear models to quantify information loss. RESULTS:The R-squared was below 0.8 for 8 out of 91 recoded variables. In 4 cases this had a substantial impact on the regression models, particularly when a continuous variable was recoded into a discrete variable. Our simulation showed that the least information is lost when the ratio of recoded zeroes to ones is 1:1. CONCLUSIONS:Large, pooled datasets provide great opportunities, justifying the efforts for data harmonization. Still, caution is warranted when using recoded variables which variance is explained limitedly by their original variables as this may jeopardize the validity of study results.

SUBMITTER: van Wanrooij LL

PROVIDER: S-EPMC7217432 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Pooling individual participant data from randomized controlled trials: Exploring potential loss of information.

van Wanrooij Lennard L LL Hoevenaar-Blom Marieke P MP Coley Nicola N Ngandu Tiia T Meiller Yannick Y Guillemont Juliette J Rosenberg Anna A Beishuizen Cathrien R L CRL Moll van Charante Eric P EP Soininen Hilkka H Brayne Carol C Andrieu Sandrine S Kivipelto Miia M Richard Edo E

PloS one 20200512 5

<h4>Background</h4>Pooling individual participant data to enable pooled analyses is often complicated by diversity in variables across available datasets. Therefore, recoding original variables is often necessary to build a pooled dataset. We aimed to quantify how much information is lost in this process and to what extent this jeopardizes validity of analyses results.<h4>Methods</h4>Data were derived from a platform that was developed to pool data from three randomized controlled trials on the ...[more]

PMID: 32396543

Similar Datasets

Project description:Objective To assess the effect of the FTO genotype on weight loss after dietary, physical activity, or drug based interventions in randomised controlled trials.Design Systematic review and random effects meta-analysis of individual participant data from randomised controlled trials.Data sources Ovid Medline, Scopus, Embase, and Cochrane from inception to November 2015.Eligibility criteria for study selection Randomised controlled trials in overweight or obese adults reporting reduction in body mass index, body weight, or waist circumference by FTO genotype (rs9939609 or a proxy) after dietary, physical activity, or drug based interventions. Gene by treatment interaction models were fitted to individual participant data from all studies included in this review, using allele dose coding for genetic effects and a common set of covariates. Study level interactions were combined using random effect models. Metaregression and subgroup analysis were used to assess sources of study heterogeneity.Results We identified eight eligible randomised controlled trials for the systematic review and meta-analysis (n=9563). Overall, differential changes in body mass index, body weight, and waist circumference in response to weight loss intervention were not significantly different between FTO genotypes. Sensitivity analyses indicated that differential changes in body mass index, body weight, and waist circumference by FTO genotype did not differ by intervention type, intervention length, ethnicity, sample size, sex, and baseline body mass index and age category.Conclusions We have observed that carriage of the FTO minor allele was not associated with differential change in adiposity after weight loss interventions. These findings show that individuals carrying the minor allele respond equally well to dietary, physical activity, or drug based weight loss interventions and thus genetic predisposition to obesity associated with the FTO minor allele can be at least partly counteracted through such interventions.Systematic review registration PROSPERO CRD42015015969.

Project description:BackgroundThe clinical benefit of aspirin for the primary prevention of cardiovascular disease (CVD) in diabetes remains uncertain. To evaluate the efficacy and safety of aspirin for the primary prevention of cardiovascular outcomes and all-cause mortality events in people with diabetes, we conducted an updated meta-analysis of published randomised controlled trials (RCTs) and a pooled analysis of individual participant data (IPD) from three trials.MethodsRandomised controlled trials of aspirin compared with placebo (or no treatment) in participants with diabetes with no known CVD were identified from MEDLINE, Embase, Cochrane Library, and manual search of bibliographies to January 2019. Relative risks with 95% confidence intervals were used as the summary measures of associations.ResultsWe included 12 RCTs based on 34,227 participants with a median treatment duration of 5.0 years. Comparing aspirin use with no aspirin, there was a significant reduction in risk of major adverse cardiovascular events (MACE)0.89 (0.83-0.95), with a number needed to treat (NNT)of 95 (95% CI 61 to 208) to prevent one MACE over 5 years average follow-up. Evidence was lacking of heterogeneity and publication bias among contributing trials for MACE. Aspirin use had no effect on other endpoints including all-cause mortality; however, there was a significant reduction in stroke for aspirin dosage ≤ 100 mg/day 0.75 (0.59-0.95). There were no significant effects of aspirin use on major bleeding and other bleeding events, though some of the estimates were imprecise. Pooled IPD from the three trials (2306 participants) showed no significant evidence of an effect of aspirin on any of the outcomes evaluated; however, aspirin reduced the risk of MACE in non-smokers 0.70 (0.51-0.96) with a NNT of 33 (95% CI 20 to 246) to prevent one MACE.ConclusionsAspirin has potential benefits in cardiovascular primary prevention in diabetes. The use of low dose aspirin may need to be individualised and based on each individual's baseline CVD and bleeding risk. Systematic review registration PROSPERO: CRD42019122326.

Project description:BackgroundIndividual patient data (IPD) meta-analysis of existing randomized controlled trials (RCTs) is a promising approach to achieving sufficient statistical power to identify sub-groups. We created a repository of IPD from multiple low back pain (LBP) RCTs to facilitate a study of treatment moderators. Due to sparse heterogeneous data, the repository needed to be robust and flexible to accommodate millions of data points prior to any subsequent analysis.MethodsWe systematically identified RCTs of therapist delivered intervention for inclusion to the repository. Some were obtained through project publicity. We requested both individual items and aggregate scores of all baseline characteristics and outcomes for all available time points. The repository is made up of a hybrid database: entity-attribute-value and relational database which is capable of storing sparse heterogeneous datasets. We developed a bespoke software program to extract, transform and upload the shared data.ResultsThere were 20 datasets with more than 3 million data points from 9328 participants. All trials collected covariates and outcomes data at baseline and follow-ups. The bespoke standardized repository is flexible to accommodate millions of data points without compromising data integrity. Data are easily retrieved for analysis using standard statistical programs.ConclusionsThe bespoke hybrid repository is complex to implement and to query but its flexibility in supporting datasets with varying sets of responses and outcomes with different data types is a worthy trade off. The large standardized LBP dataset is also an important resource useable by other LBP researchers.SignificanceA flexible adaptive database for pain studies that can easily be expanded for future researchers to map, transform and upload their data in a safe and secure environment. The data are standardized and harmonized which will facilitate future requests from other researchers for secondary analyses.

Project description:BackgroundPeople with comorbidities are underrepresented in clinical trials. Empirical estimates of treatment effect modification by comorbidity are lacking, leading to uncertainty in treatment recommendations. We aimed to produce estimates of treatment effect modification by comorbidity using individual participant data (IPD).Methods and findingsWe obtained IPD for 120 industry-sponsored phase 3/4 trials across 22 index conditions (n = 128,331). Trials had to be registered between 1990 and 2017 and have recruited ≥300 people. Included trials were multicentre and international. For each index condition, we analysed the outcome most frequently reported in the included trials. We performed a two-stage IPD meta-analysis to estimate modification of treatment effect by comorbidity. First, for each trial, we modelled the interaction between comorbidity and treatment arm adjusted for age and sex. Second, for each treatment within each index condition, we meta-analysed the comorbidity-treatment interaction terms from each trial. We estimated the effect of comorbidity measured in 3 ways: (i) the number of comorbidities (in addition to the index condition); (ii) presence or absence of the 6 commonest comorbid diseases for each index condition; and (iii) using continuous markers of underlying conditions (e.g., estimated glomerular filtration rate (eGFR)). Treatment effects were modelled on the usual scale for the type of outcome (absolute scale for numerical outcomes, relative scale for binary outcomes). Mean age in the trials ranged from 37.1 (allergic rhinitis trials) to 73.0 (dementia trials) and percentage of male participants range from 4.4% (osteoporosis trials) to 100% (benign prostatic hypertrophy trials). The percentage of participants with 3 or more comorbidities ranged from 2.3% (allergic rhinitis trials) to 57% (systemic lupus erythematosus trials). We found no evidence of modification of treatment efficacy by comorbidity, for any of the 3 measures of comorbidity. This was the case for 20 conditions for which the outcome variable was continuous (e.g., change in glycosylated haemoglobin in diabetes) and for 3 conditions in which the outcomes were discrete events (e.g., number of headaches in migraine). Although all were null, estimates of treatment effect modification were more precise in some cases (e.g., sodium-glucose co-transporter-2 (SGLT2) inhibitors for type 2 diabetes-interaction term for comorbidity count 0.004, 95% CI -0.01 to 0.02) while for others credible intervals were wide (e.g., corticosteroids for asthma-interaction term -0.22, 95% CI -1.07 to 0.54). The main limitation is that these trials were not designed or powered to assess variation in treatment effect by comorbidity, and relatively few trial participants had >3 comorbidities.ConclusionsAssessments of treatment effect modification rarely consider comorbidity. Our findings demonstrate that for trials included in this analysis, there was no empirical evidence of treatment effect modification by comorbidity. The standard assumption used in evidence syntheses is that efficacy is constant across subgroups, although this is often criticised. Our findings suggest that for modest levels of comorbidities, this assumption is reasonable. Thus, trial efficacy findings can be combined with data on natural history and competing risks to assess the likely overall benefit of treatments in the context of comorbidity.

Project description:BackgroundNephrology has a limited number of randomized controlled trials (RCTs). The quality of randomized trials is compromised further when not all participants randomly assigned are accounted for transparently.ObjectivesSystematically evaluate RCTs in individuals with chronic kidney disease regarding reporting and accounting of data missing in outcome analysis.Study designDe novo empirical evaluation.Setting & populationEnglish-language parallel-group design RCTs in adults with chronic kidney disease on dialysis therapy or with a kidney transplant published in MEDLINE in 2007 and 2008.Outcomes & measurements(1) How often was there loss to analysis, defined as not all randomly assigned participants included in primary outcome analysis? (2) How often was intention-to-treat analysis complete; in other words, included all randomly assigned participants in their originally allocated group? (3) How often were methods of data imputation reported?ResultsOf 196 eligible RCTs, 27% did not clearly describe a primary outcome, 5% did not provide numbers of patients randomly assigned and analyzed, and 12% used time-to-event analysis. Of the remaining 110 studies, 58% had some loss to analysis, with a median loss to analysis of 10%. Fifty-four percent of trials claimed to have performed an intention-to-treat analysis, but only 44% of those included all participants randomly assigned. Only 5 of 110 (5%) studies mentioned imputation of missing data.LimitationsEvaluation is restricted to analysis of primary study outcome. Only English-language publications were included. Exclusion of time-to-event analyses.ConclusionsIn variance to the reporting standards of CONSORT (Consolidated Standards of Reporting Trials), we found primary outcome designation missing in one-fourth of trials and poor quality in reporting and accounting of primary outcome data lost to analysis. Greater attention to transparency in handling and reporting loss to analysis will enhance the quality of trials in individuals with chronic kidney disease.

Dataset Information

Pooling individual participant data from randomized controlled trials: Exploring potential loss of information.

Publications

Pooling individual participant data from randomized controlled trials: Exploring potential loss of information.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets