Unknown

Dataset Information

0

Two Wrongs Make a Right: Addressing Underreporting in Binary Data from Multiple Sources.


ABSTRACT: Media-based event data-i.e., data comprised from reporting by media outlets-are widely used in political science research. However, events of interest (e.g., strikes, protests, conflict) are often underreported by these primary and secondary sources, producing incomplete data that risks inconsistency and bias in subsequent analysis. While general strategies exist to help ameliorate this bias, these methods do not make full use of the information often available to researchers. Specifically, much of the event data used in the social sciences is drawn from multiple, overlapping news sources (e.g., Agence France-Presse, Reuters). Therefore, we propose a novel maximum likelihood estimator that corrects for misclassification in data arising from multiple sources. In the most general formulation of our estimator, researchers can specify separate sets of predictors for the true-event model and each of the misclassification models characterizing whether a source fails to report on an event. As such, researchers are able to accurately test theories on both the causes of and reporting on an event of interest. Simulations evidence that our technique regularly out performs current strategies that either neglect misclassification, the unique features of the data-generating process, or both. We also illustrate the utility of this method with a model of repression using the Social Conflict in Africa Database.

SUBMITTER: Cook SJ 

PROVIDER: S-EPMC5667662 | biostudies-literature | 2017 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Two Wrongs Make a Right: Addressing Underreporting in Binary Data from Multiple Sources.

Cook Scott J SJ   Blas Betsabe B   Carroll Raymond J RJ   Sinha Samiran S  

Political analysis : an annual publication of the Methodology Section of the American Political Science Association 20170411 2


Media-based event data-i.e., data comprised from reporting by media outlets-are widely used in political science research. However, events of interest (e.g., strikes, protests, conflict) are often underreported by these primary and secondary sources, producing incomplete data that risks inconsistency and bias in subsequent analysis. While general strategies exist to help ameliorate this bias, these methods do not make full use of the information often available to researchers. Specifically, much  ...[more]

Similar Datasets

| S-EPMC8861112 | biostudies-literature
| S-EPMC6697782 | biostudies-literature
| S-EPMC7589942 | biostudies-literature
| S-EPMC7476641 | biostudies-literature
| S-EPMC5634537 | biostudies-literature
| S-EPMC3123338 | biostudies-literature
| S-EPMC7571608 | biostudies-literature
| S-EPMC8110123 | biostudies-literature
| S-EPMC7789687 | biostudies-literature
| S-EPMC3596844 | biostudies-literature