Unknown

Dataset Information

0

Gaps within the Biomedical Literature: Initial Characterization and Assessment of Strategies for Discovery.


ABSTRACT: Within well-established fields of biomedical science, we identify "gaps", topical areas of investigation that might be expected to occur but are missing. We define a field by carrying out a topical PubMed query, and analyze Medical Subject Headings by which the set of retrieved articles are indexed. Medical Subject headings (MeSH terms) which occur in >1% of the articles are examined pairwise to see how often they are predicted to co-occur within individual articles (assuming that they are independent of each other). A pair of MeSH terms that are predicted to co-occur in at least 10 articles, yet are not observed to co-occur in any article, are "gaps" and were studied further in a corpus of 10 disease-related article sets and 10 related to biological processes. Overall, articles that filled gaps were cited more heavily than non-gap-filling articles and were 61% more likely to be published in multidisciplinary high-impact journals. Nine different features of these "gaps" were characterized and tested to learn which, if any, correlate with the appearance of one or more articles containing both MeSH terms within the next five years. Several different types of gaps were identified, each having distinct combinations of predictive features: a) those arising as a byproduct of MeSH indexing rules; b) those having little biological meaning; c) those representing "low hanging fruit" for immediate exploitation; and d) those representing gaps across disciplines or sub-disciplines that do not talk to each other or work together. We have built a free, open tool called "Mine the Gap!" that identifies and characterizes the "gaps" for any PubMed query, which can be accessed via the Anne O'Tate value-added PubMed search interface (http://arrowsmith.psych.uic.edu/cgi-bin/arrowsmith_uic/AnneOTate.cgi).

SUBMITTER: Peng Y 

PROVIDER: S-EPMC5736374 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Gaps within the Biomedical Literature: Initial Characterization and Assessment of Strategies for Discovery.

Peng Yufang Y   Bonifield Gary G   Smalheiser Neil R NR  

Frontiers in research metrics and analytics 20170522


Within well-established fields of biomedical science, we identify "gaps", topical areas of investigation that might be expected to occur but are missing. We define a field by carrying out a topical PubMed query, and analyze Medical Subject Headings by which the set of retrieved articles are indexed. Medical Subject headings (MeSH terms) which occur in >1% of the articles are examined pairwise to see how often they are predicted to co-occur within individual articles (assuming that they are indep  ...[more]

Similar Datasets

| S-EPMC5070740 | biostudies-literature
2013-12-23 | E-GEOD-53091 | biostudies-arrayexpress
2013-12-23 | GSE53091 | GEO
| S-EPMC5741828 | biostudies-literature
| S-EPMC7951980 | biostudies-literature
| S-EPMC3106317 | biostudies-literature
| S-EPMC7001095 | biostudies-literature
| S-EPMC3541249 | biostudies-literature
| S-EPMC2703945 | biostudies-literature
| S-EPMC2655927 | biostudies-literature