Unknown

Dataset Information

0

Large-scale extraction of brain connectivity from the neuroscientific literature.


ABSTRACT: MOTIVATION: In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles. One challenge for modern neuroinformatics is finding methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis and the integration of such data into computational models. A key example of this is metascale brain connectivity, where results are not reported in a normalized repository. Instead, these experimental results are published in natural language, scattered among individual scientific publications. This lack of normalization and centralization hinders the large-scale integration of brain connectivity results. In this article, we present text-mining models to extract and aggregate brain connectivity results from 13.2 million PubMed abstracts and 630?216 full-text publications related to neuroscience. The brain regions are identified with three different named entity recognizers (NERs) and then normalized against two atlases: the Allen Brain Atlas (ABA) and the atlas from the Brain Architecture Management System (BAMS). We then use three different extractors to assess inter-region connectivity. RESULTS: NERs and connectivity extractors are evaluated against a manually annotated corpus. The complete in litero extraction models are also evaluated against in vivo connectivity data from ABA with an estimated precision of 78%. The resulting database contains over 4 million brain region mentions and over 100?000 (ABA) and 122?000 (BAMS) potential brain region connections. This database drastically accelerates connectivity literature review, by providing a centralized repository of connectivity data to neuroscientists.

SUBMITTER: Richardet R 

PROVIDER: S-EPMC4426844 | biostudies-other | 2015 May

REPOSITORIES: biostudies-other

altmetric image

Publications

Large-scale extraction of brain connectivity from the neuroscientific literature.

Richardet Renaud R   Chappelier Jean-Cédric JC   Telefont Martin M   Hill Sean S  

Bioinformatics (Oxford, England) 20150120 10


<h4>Motivation</h4>In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles. One challenge for modern neuroinformatics is finding methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis and the integration of such data into computational models. A key example of this is metascale brain connectivity, where results are not reported in a normalized repository. Instead, these expe  ...[more]

Similar Datasets

| S-EPMC3629104 | biostudies-literature
| S-EPMC8436785 | biostudies-literature
| S-EPMC4681986 | biostudies-literature
| S-EPMC4601701 | biostudies-literature
| S-EPMC6155203 | biostudies-literature
| S-EPMC7701248 | biostudies-literature
| S-EPMC6374165 | biostudies-literature
| S-EPMC7267969 | biostudies-literature
| S-EPMC5318284 | biostudies-literature
| S-EPMC5840443 | biostudies-literature