Ontology highlight
ABSTRACT: Objective
Although lower respiratory infections (LRI) are among the leading causes of mortality in the US, their association with underlying factors and geographic variation have not been adequately examined.Methods
In this study, explanatory variables (n = 46) including climatic, topographic, socio-economic, and demographic factors were compiled at the county level across the continentalUS.Machine learning algorithms - logistic regression (LR), random forest (RF), gradient boosting decision trees (GBDT), k-nearest neighbors (KNN), and support vector machine (SVM) - were employed to predict the presence/absence of hotspots (P < 0.05) for elevated age-adjusted LRI mortality rates in a geographic information system framework.Results
Overall, there was a historical shift in hotspots away from the western US into the southeastern parts of the country and they were highly localized in a few counties. The two decision tree methods (RF and GBDT) outperformed the other algorithms (accuracies: 0.92; F1-scores: 0.85 and 0.84; area under the precision-recall curve: 0.84 and 0.83, respectively). Moreover, the results of the RF and GBDT indicated that higher spring minimum temperature, increased winter precipitation, and higher annual median household income were among the most substantial factors in predicting the hotspots.Conclusions
This study helps raise awareness of public health decision-makers to develop and target LRI prevention programs.
SUBMITTER: Mollalo A
PROVIDER: S-EPMC7442929 | biostudies-literature | 2020 Oct
REPOSITORIES: biostudies-literature
Mollalo Abolfazl A Vahedi Behrooz B Bhattarai Shreejana S Hopkins Laura C LC Banik Swagata S Vahedi Behzad B
International journal of medical informatics 20200822
<h4>Objective</h4>Although lower respiratory infections (LRI) are among the leading causes of mortality in the US, their association with underlying factors and geographic variation have not been adequately examined.<h4>Methods</h4>In this study, explanatory variables (n = 46) including climatic, topographic, socio-economic, and demographic factors were compiled at the county level across the continentalUS.Machine learning algorithms - logistic regression (LR), random forest (RF), gradient boost ...[more]