Unknown

Dataset Information

0

An infodemiological framework for tracking the spread of SARS-CoV-2 using integrated public data.


ABSTRACT: The outbreak of the SARS-CoV-2 novel coronavirus has caused a health crisis of immeasurable magnitude. Signals from heterogeneous public data sources could serve as early predictors for infection waves of the pandemic, particularly in its early phases, when infection data was scarce. In this article, we characterize temporal pandemic indicators by leveraging an integrated set of public data and apply them to a Prophet model to predict COVID-19 trends. An effective natural language processing pipeline was first built to extract time-series signals of specific articles from a news corpus. Bursts of these temporal signals were further identified with Kleinberg's burst detection algorithm. Across different US states, correlations for Google Trends of COVID-19 related terms, COVID-19 news volume, and publicly available wastewater SARS-CoV-2 measurements with weekly COVID-19 case numbers were generally high with lags ranging from 0 to 3 weeks, indicating them as strong predictors of viral spread. Incorporating time-series signals of these effective predictors significantly improved the performance of the Prophet model, which was able to predict the COVID-19 case numbers between one and two weeks with average mean absolute error rates of 0.38 and 0.46 respectively across different states.

SUBMITTER: Liu Z 

PROVIDER: S-EPMC9040481 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

An infodemiological framework for tracking the spread of SARS-CoV-2 using integrated public data.

Liu Zhimin Z   Jiang Zuodong Z   Kip Geoffrey G   Snigdha Kirti K   Xu Jennings J   Wu Xiaoying X   Khan Najat N   Schultz Timothy T  

Pattern recognition letters 20220426


The outbreak of the SARS-CoV-2 novel coronavirus has caused a health crisis of immeasurable magnitude. Signals from heterogeneous public data sources could serve as early predictors for infection waves of the pandemic, particularly in its early phases, when infection data was scarce. In this article, we characterize temporal pandemic indicators by leveraging an integrated set of public data and apply them to a Prophet model to predict COVID-19 trends. An effective natural language processing pip  ...[more]

Similar Datasets

| S-EPMC5964307 | biostudies-literature
| S-EPMC10569657 | biostudies-literature
| S-EPMC8355311 | biostudies-literature
| S-EPMC9156676 | biostudies-literature
| S-EPMC9765449 | biostudies-literature
| S-EPMC10913390 | biostudies-literature
| S-EPMC7543729 | biostudies-literature
| S-EPMC8111700 | biostudies-literature
| S-EPMC9349760 | biostudies-literature