Unknown

Dataset Information

0

The HCUP SID Imputation Project: Improving Statistical Inferences for Health Disparities Research by Imputing Missing Race Data.


ABSTRACT:

Objective

To identify the most appropriate imputation method for missing data in the HCUP State Inpatient Databases (SID) and assess the impact of different missing data methods on racial disparities research.

Data sources/study setting

HCUP SID.

Study design

A novel simulation study compared four imputation methods (random draw, hot deck, joint multiple imputation [MI], conditional MI) for missing values for multiple variables, including race, gender, admission source, median household income, and total charges. The simulation was built on real data from the SID to retain their hierarchical data structures and missing data patterns. Additional predictive information from the U.S. Census and American Hospital Association (AHA) database was incorporated into the imputation.

Principal findings

Conditional MI prediction was equivalent or superior to the best performing alternatives for all missing data structures and substantially outperformed each of the alternatives in various scenarios.

Conclusions

Conditional MI substantially improved statistical inferences for racial health disparities research with the SID.

SUBMITTER: Ma Y 

PROVIDER: S-EPMC5980335 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

The HCUP SID Imputation Project: Improving Statistical Inferences for Health Disparities Research by Imputing Missing Race Data.

Ma Yan Y   Zhang Wei W   Lyman Stephen S   Huang Yihe Y  

Health services research 20170504 3


<h4>Objective</h4>To identify the most appropriate imputation method for missing data in the HCUP State Inpatient Databases (SID) and assess the impact of different missing data methods on racial disparities research.<h4>Data sources/study setting</h4>HCUP SID.<h4>Study design</h4>A novel simulation study compared four imputation methods (random draw, hot deck, joint multiple imputation [MI], conditional MI) for missing values for multiple variables, including race, gender, admission source, med  ...[more]

Similar Datasets

| S-EPMC8580266 | biostudies-literature
| S-EPMC6063210 | biostudies-literature
| S-EPMC6693809 | biostudies-other
| S-EPMC7274349 | biostudies-literature