Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
Ontology highlight
ABSTRACT: Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on community question answering services. In this paper, we address an extractive summarization (i.e., headline extraction) task for such questions as a case study and consider how to alleviate the problem by using question-answer pairs, instead of missing-headline pairs. To this end, we propose a framework to examine how to use such unlabeled paired data from the viewpoint of training methods. Experimental results show that multi-task training performs well with undersampling and distant supervision.
SUBMITTER: Jose J
PROVIDER: S-EPMC7148067 | biostudies-literature | 2020 Mar
REPOSITORIES: biostudies-literature
ACCESS DATA