Unknown

Dataset Information

0

Filtering Entities to Optimize Identification of Adverse Drug Reaction From Social Media: How Can the Number of Words Between Entities in the Messages Help?


ABSTRACT:

Background

With the increasing popularity of Web 2.0 applications, social media has made it possible for individuals to post messages on adverse drug reactions. In such online conversations, patients discuss their symptoms, medical history, and diseases. These disorders may correspond to adverse drug reactions (ADRs) or any other medical condition. Therefore, methods must be developed to distinguish between false positives and true ADR declarations.

Objective

The aim of this study was to investigate a method for filtering out disorder terms that did not correspond to adverse events by using the distance (as number of words) between the drug term and the disorder or symptom term in the post. We hypothesized that the shorter the distance between the disorder name and the drug, the higher the probability to be an ADR.

Methods

We analyzed a corpus of 648 messages corresponding to a total of 1654 (drug and disorder) pairs from 5 French forums using Gaussian mixture models and an expectation-maximization (EM) algorithm .

Results

The distribution of the distances between the drug term and the disorder term enabled the filtering of 50.03% (733/1465) of the disorders that were not ADRs. Our filtering strategy achieved a precision of 95.8% and a recall of 50.0%.

Conclusions

This study suggests that such distance between terms can be used for identifying false positives, thereby improving ADR detection in social media.

SUBMITTER: Abdellaoui R 

PROVIDER: S-EPMC5500778 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Filtering Entities to Optimize Identification of Adverse Drug Reaction From Social Media: How Can the Number of Words Between Entities in the Messages Help?

Abdellaoui Redhouane R   Schück Stéphane S   Texier Nathalie N   Burgun Anita A  

JMIR public health and surveillance 20170622 2


<h4>Background</h4>With the increasing popularity of Web 2.0 applications, social media has made it possible for individuals to post messages on adverse drug reactions. In such online conversations, patients discuss their symptoms, medical history, and diseases. These disorders may correspond to adverse drug reactions (ADRs) or any other medical condition. Therefore, methods must be developed to distinguish between false positives and true ADR declarations.<h4>Objective</h4>The aim of this study  ...[more]

Similar Datasets

| S-EPMC8422061 | biostudies-literature
| S-EPMC4143012 | biostudies-other
| S-EPMC6837202 | biostudies-literature
| S-EPMC5784043 | biostudies-literature
| S-EPMC6315252 | biostudies-literature
| S-EPMC4457113 | biostudies-literature
| S-EPMC8053157 | biostudies-literature
| S-EPMC8276184 | biostudies-literature
| S-EPMC2842424 | biostudies-other
| S-EPMC3105296 | biostudies-literature