Unknown

Dataset Information

0

Figure-associated text summarization and evaluation.


ABSTRACT: Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).

SUBMITTER: Polepalli Ramesh B 

PROVIDER: S-EPMC4313946 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Figure-associated text summarization and evaluation.

Polepalli Ramesh Balaji B   Sethi Ricky J RJ   Yu Hong H  

PloS one 20150202 2


Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered th  ...[more]

Similar Datasets

| S-EPMC10280265 | biostudies-literature
| S-EPMC8449627 | biostudies-literature
| S-EPMC8521877 | biostudies-literature
| S-EPMC4261035 | biostudies-literature
| S-EPMC10280405 | biostudies-literature
| S-EPMC9265758 | biostudies-literature
| S-EPMC6454593 | biostudies-literature
| S-EPMC7647812 | biostudies-literature
| S-EPMC2217662 | biostudies-literature
| S-EPMC8011436 | biostudies-literature