Unknown

Dataset Information

0

Extracting and modeling geographic information from scientific articles.


ABSTRACT: Scientific articles often contain relevant geographic information such as where field work was performed or where patients were treated. Most often, this information appears in the full-text article contents as a description in natural language including place names, with no accompanying machine-readable geographic metadata. Automatically extracting this geographic information could help conduct meta-analyses, find geographical research gaps, and retrieve articles using spatial search criteria. Research on this problem is still in its infancy, with many works manually processing corpora for locations and few cross-domain studies. In this paper, we develop a fully automatic pipeline to extract and represent relevant locations from scientific articles, applying it to two varied corpora. We obtain good performance, with full pipeline precision of 0.84 for an environmental corpus, and 0.78 for a biomedical corpus. Our results can be visualized as simple global maps, allowing human annotators to both explore corpus patterns in space and triage results for downstream analysis. Future work should not only focus on improving individual pipeline components, but also be informed by user needs derived from the potential spatial analysis and exploration of such corpora.

SUBMITTER: Acheson E 

PROVIDER: S-EPMC7787447 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extracting and modeling geographic information from scientific articles.

Acheson Elise E   Purves Ross S RS  

PloS one 20210106 1


Scientific articles often contain relevant geographic information such as where field work was performed or where patients were treated. Most often, this information appears in the full-text article contents as a description in natural language including place names, with no accompanying machine-readable geographic metadata. Automatically extracting this geographic information could help conduct meta-analyses, find geographical research gaps, and retrieve articles using spatial search criteria.  ...[more]

Similar Datasets

| S-EPMC7302801 | biostudies-literature
| S-EPMC10210167 | biostudies-literature
| S-EPMC5970438 | biostudies-literature
| S-EPMC3475109 | biostudies-literature
| S-EPMC2995677 | biostudies-literature
| S-EPMC2995679 | biostudies-literature
| S-EPMC4798794 | biostudies-literature
| S-EPMC3653959 | biostudies-literature
| S-EPMC10087802 | biostudies-literature
| S-EPMC3441580 | biostudies-literature