Ontology highlight
ABSTRACT: Background
Concept recognition is a term that corresponds to the two sequential steps of named entity recognition and named entity normalization, and plays an essential role in the field of bioinformatics. However, the conventional dictionary-based methods did not sufficiently addressed the variation of the concepts in actual use in literature, resulting in the particularly degraded performances in recognition of multi-token concepts.Results
In this paper, we propose a concept recognition method of multi-token biological entities using neural models combined with literature contexts. The key aspect of our method is utilizing the contextual information from the biological knowledge-bases for concept normalization, which is followed by named entity recognition procedure. The model showed improved performances over conventional methods, particularly for multi-token concepts with higher variations.Conclusions
We expect that our model can be utilized for effective concept recognition and variety of natural language processing tasks on bioinformatics.
SUBMITTER: Kim K
PROVIDER: S-EPMC8529713 | biostudies-literature | 2021 Oct
REPOSITORIES: biostudies-literature
BMC bioinformatics 20211021 Suppl 11
<h4>Background</h4>Concept recognition is a term that corresponds to the two sequential steps of named entity recognition and named entity normalization, and plays an essential role in the field of bioinformatics. However, the conventional dictionary-based methods did not sufficiently addressed the variation of the concepts in actual use in literature, resulting in the particularly degraded performances in recognition of multi-token concepts.<h4>Results</h4>In this paper, we propose a concept re ...[more]