Unknown

Dataset Information

0

Keyphrase Identification Using Minimal Labeled Data with Hierarchical Context and Transfer Learning.


ABSTRACT: Interoperable clinical decision support system (CDSS) rules provide a pathway to interoperability, a well-recognized challenge in health information technology. Building an ontology facilitates creating interoperable CDSS rules, which can be achieved by identifying the keyphrases (KP) from the existing literature. However, KP identification for data labeling requires human expertise, consensus, and contextual understanding. This paper aims to present a semi-supervised KP identification framework using minimal labeled data based on hierarchical attention over the documents and domain adaptation. Our method outperforms the prior neural architectures by learning through synthetic labels for initial training, document-level contextual learning, language modeling, and fine-tuning with limited gold standard label data. To the best of our knowledge, this is the first functional framework for the CDSS sub-domain to identify KPs, which is trained on limited labeled data. It contributes to the general natural language processing (NLP) architectures in areas such as clinical NLP, where manual data labeling is challenging, and light-weighted deep learning models play a role in real-time KP identification as a complementary approach to human experts' effort.

SUBMITTER: Goli R 

PROVIDER: S-EPMC10246160 | biostudies-literature | 2023 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Keyphrase Identification Using Minimal Labeled Data with Hierarchical Contexts and Transfer Learning.

Goli Rohan R   Komatineni Keerthana K   Alluri Shailesh S   Hubig Nina N   Min Hua H   Gong Yang Y   Sittig Dean F DF   Rennert Lior L   Robinson David D   Biondich Paul P   Wright Adam A   Nøhr Christian C   Law Timothy T   Faxvaag Arild A   Weaver Aneesa A   Gimbel Ronald R   Jing Xia X  

medRxiv : the preprint server for health sciences 20241118


<h4>Background</h4>Interoperable clinical decision support system (CDSS) rules provide a pathway to interoperability, a well-recognized challenge in health information technology. Building an ontology facilitates creating interoperable CDSS rules, which can be achieved by identifying the keyphrases (KP) from the existing literature. Ontology construction is traditionally a manual effort by human domain experts, and the newly advanced natural language processing techniques, such as KP identificat  ...[more]

Similar Datasets

| S-EPMC5070523 | biostudies-literature
| S-EPMC11782500 | biostudies-literature
| S-EPMC6813555 | biostudies-literature
| S-EPMC9222348 | biostudies-literature
| S-EPMC7921640 | biostudies-literature
2024-03-28 | GSE227087 | GEO
| S-EPMC9011006 | biostudies-literature
2021-04-07 | GSE171636 | GEO
| S-EPMC10280182 | biostudies-literature
| S-EPMC9677468 | biostudies-literature