Ontology highlight
ABSTRACT:
SUBMITTER: Nguyen AT
PROVIDER: S-EPMC5662012 | biostudies-literature | 2017
REPOSITORIES: biostudies-literature
Nguyen An T AT Wallace Byron C BC Li Junyi Jessy JJ Nenkova Ani A Lease Matthew M
Proceedings of the conference. Association for Computational Linguistics. Meeting 20170101
Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unann ...[more]