Unknown

Dataset Information

0

Aggregating and Predicting Sequence Labels from Crowd Annotations.


ABSTRACT: Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unannotated text, we propose a neural approach using Long Short Term Memory. We evaluate a suite of methods across two different applications and text genres: Named-Entity Recognition in news articles and Information Extraction from biomedical abstracts. Results show improvement over strong baselines. Our source code and data are available online.

SUBMITTER: Nguyen AT 

PROVIDER: S-EPMC5662012 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Aggregating and Predicting Sequence Labels from Crowd Annotations.

Nguyen An T AT   Wallace Byron C BC   Li Junyi Jessy JJ   Nenkova Ani A   Lease Matthew M  

Proceedings of the conference. Association for Computational Linguistics. Meeting 20170101


Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unann  ...[more]

Similar Datasets

| S-EPMC7549292 | biostudies-literature
| S-EPMC4384381 | biostudies-literature
| S-EPMC5963392 | biostudies-literature
| S-EPMC4108922 | biostudies-literature
| S-EPMC10640596 | biostudies-literature
| S-EPMC2025603 | biostudies-literature
| S-EPMC3712327 | biostudies-literature
| S-EPMC6309178 | biostudies-literature
| S-EPMC5521088 | biostudies-literature
| S-EPMC430176 | biostudies-literature