Unknown

Dataset Information

0

Influenza-like illness surveillance on Twitter through automated learning of naive language.


ABSTRACT: Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems.

SUBMITTER: Gesualdo F 

PROVIDER: S-EPMC3853203 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Influenza-like illness surveillance on Twitter through automated learning of naïve language.

Gesualdo Francesco F   Stilo Giovanni G   Agricola Eleonora E   Gonfiantini Michaela V MV   Pandolfi Elisabetta E   Velardi Paola P   Tozzi Alberto E AE  

PloS one 20131204 12


Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean que  ...[more]

Similar Datasets

| S-EPMC4959719 | biostudies-literature
| S-EPMC3739526 | biostudies-literature
| S-EPMC7450367 | biostudies-literature
| S-EPMC6839169 | biostudies-literature
| S-EPMC6478375 | biostudies-literature
| S-EPMC5617904 | biostudies-literature
| S-EPMC8392992 | biostudies-literature
| S-EPMC2988734 | biostudies-literature
| S-EPMC5533314 | biostudies-literature
| S-EPMC5226807 | biostudies-literature