Dataset Information

Early Stage Machine Learning-Based Prediction of US County Vulnerability to the COVID-19 Pandemic: Machine Learning Approach.

ABSTRACT:

Background

The rapid spread of COVID-19 means that government and health services providers have little time to plan and design effective response policies. It is therefore important to quickly provide accurate predictions of how vulnerable geographic regions such as counties are to the spread of this virus.

Objective

The aim of this study is to develop county-level prediction around near future disease movement for COVID-19 occurrences using publicly available data.

Methods

We estimated county-level COVID-19 occurrences for the period March 14 to 31, 2020, based on data fused from multiple publicly available sources inclusive of health statistics, demographics, and geographical features. We developed a three-stage model using XGBoost, a machine learning algorithm, to quantify the probability of COVID-19 occurrence and estimate the number of potential occurrences for unaffected counties. Finally, these results were combined to predict the county-level risk. This risk was then used as an estimated after-five-day-vulnerability of the county.

Results

The model predictions showed a sensitivity over 71% and specificity over 94% for models built using data from March 14 to 31, 2020. We found that population, population density, percentage of people aged >70 years, and prevalence of comorbidities play an important role in predicting COVID-19 occurrences. We observed a positive association at the county level between urbanicity and vulnerability to COVID-19.

Conclusions

The developed model can be used for identification of vulnerable counties and potential data discrepancies. Limited testing facilities and delayed results introduce significant variation in reported cases, which produces a bias in the model.

SUBMITTER: Mehta M

PROVIDER: S-EPMC7490002 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Early Stage Machine Learning-Based Prediction of US County Vulnerability to the COVID-19 Pandemic: Machine Learning Approach.

Mehta Mihir M Julaiti Juxihong J Griffin Paul P Kumara Soundar S

JMIR public health and surveillance 20200911 3

<h4>Background</h4>The rapid spread of COVID-19 means that government and health services providers have little time to plan and design effective response policies. It is therefore important to quickly provide accurate predictions of how vulnerable geographic regions such as counties are to the spread of this virus.<h4>Objective</h4>The aim of this study is to develop county-level prediction around near future disease movement for COVID-19 occurrences using publicly available data.<h4>Methods</h ...[more]

PMID: 32784193

Dataset Information

Early Stage Machine Learning-Based Prediction of US County Vulnerability to the COVID-19 Pandemic: Machine Learning Approach.

Background

Objective

Methods

Results

Conclusions

Publications

Early Stage Machine Learning-Based Prediction of US County Vulnerability to the COVID-19 Pandemic: Machine Learning Approach.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Machine learning assisted breathomic approach for early-stage thoracic cancer detection.
| S-EPMC12483886 | biostudies-literature

Prediction of Breast Cancer Estrogen Receptor Status using Machine Learning
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress

Personalized prediction of early childhood asthma persistence: A machine learning approach.
| S-EPMC7920380 | biostudies-literature

A machine learning approach to predict self-protecting behaviors during the early wave of the COVID-19 pandemic.
| S-EPMC10103659 | biostudies-literature

Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach.
| S-EPMC7690968 | biostudies-literature

Population cohort-validated PM2.5-induced gene signatures: A Machine Learning Approach to Individual Exposure Prediction
2026-05-01 | GSE298585 | GEO

COVID-19 ICU mortality prediction: a machine learning approach using SuperLearner algorithm.
| S-EPMC8413709 | biostudies-literature

Early prediction of end-stage kidney disease using electronic health record data: a machine learning approach with a 2-year horizon.
| S-EPMC10898824 | biostudies-literature

Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach.
| S-EPMC7972393 | biostudies-literature

Machine Learning Based Clinical Decision Support System for Early COVID-19 Mortality Prediction.
| S-EPMC8149622 | biostudies-literature