Unknown

Dataset Information

0

Use of Twitter data to improve Zika virus surveillance in the United States during the 2016 epidemic.


ABSTRACT:

Background

Zika virus (ZIKV) is an emerging mosquito-borne arbovirus that can produce serious public health consequences. In 2016, ZIKV caused an epidemic in many countries around the world, including the United States. ZIKV surveillance and vector control is essential to combating future epidemics. However, challenges relating to the timely publication of case reports significantly limit the effectiveness of current surveillance methods. In many countries with poor infrastructure, established systems for case reporting often do not exist. Previous studies investigating the H1N1 pandemic, general influenza and the recent Ebola outbreak have demonstrated that time- and geo-tagged Twitter data, which is immediately available, can be utilized to overcome these limitations.

Methods

In this study, we employed a recently developed system called Cloudberry to filter a random sample of Twitter data to investigate the feasibility of using such data for ZIKV epidemic tracking on a national and state (Florida) level. Two auto-regressive models were calibrated using weekly ZIKV case counts and zika tweets in order to estimate weekly ZIKV cases 1 week in advance.

Results

While models tended to over-predict at low case counts and under-predict at extreme high counts, a comparison of predicted versus observed weekly ZIKV case counts following model calibration demonstrated overall reasonable predictive accuracy, with an R2 of 0.74 for the Florida model and 0.70 for the U.S.

Model

Time-series analysis of predicted and observed ZIKV cases following internal cross-validation exhibited very similar patterns, demonstrating reasonable model performance. Spatially, the distribution of cumulative ZIKV case counts (local- & travel-related) and zika tweets across all 50?U.S. states showed a high correlation (r?=?0.73) after adjusting for population.

Conclusions

This study demonstrates the value of utilizing Twitter data for the purposes of disease surveillance. This is of high value to epidemiologist and public health officials charged with protecting the public during future outbreaks.

SUBMITTER: Masri S 

PROVIDER: S-EPMC6570872 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Use of Twitter data to improve Zika virus surveillance in the United States during the 2016 epidemic.

Masri Shahir S   Jia Jianfeng J   Li Chen C   Zhou Guofa G   Lee Ming-Chieh MC   Yan Guiyun G   Wu Jun J  

BMC public health 20190614 1


<h4>Background</h4>Zika virus (ZIKV) is an emerging mosquito-borne arbovirus that can produce serious public health consequences. In 2016, ZIKV caused an epidemic in many countries around the world, including the United States. ZIKV surveillance and vector control is essential to combating future epidemics. However, challenges relating to the timely publication of case reports significantly limit the effectiveness of current surveillance methods. In many countries with poor infrastructure, estab  ...[more]

Similar Datasets

| S-EPMC7067377 | biostudies-literature
| S-EPMC7454085 | biostudies-literature
| S-EPMC10015296 | biostudies-literature
| S-EPMC8363233 | biostudies-literature
| S-EPMC5952990 | biostudies-literature
| S-EPMC8030855 | biostudies-literature
| S-EPMC7181904 | biostudies-literature
| S-EPMC8935355 | biostudies-literature
| S-EPMC8025225 | biostudies-literature
| S-EPMC6578581 | biostudies-literature