Dataset Information

Explainable Prediction of Medical Codes With Knowledge Graphs.

ABSTRACT: International Classification of Diseases (ICD) is an authoritative health care classification system of different diseases. It is widely used for disease and health records, assisted medical reimbursement decisions, and collecting morbidity and mortality statistics. The most existing ICD coding models only translate the simple diagnosis descriptions into ICD codes. And it obscures the reasons and details behind specific diagnoses. Besides, the label (code) distribution is uneven. And there is a dependency between labels. Based on the above considerations, the knowledge graph and attention mechanism were expanded into medical code prediction to improve interpretability. In this study, a new method called G_Coder was presented, which mainly consists of Multi-CNN, graph presentation, attentional matching, and adversarial learning. The medical knowledge graph was constructed by extracting entities related to ICD-9 from freebase. Ontology contains 5 entity classes, which are disease, symptom, medicine, surgery, and examination. The result of G_Coder on the MIMIC-III dataset showed that the micro-F1 score is 69.2% surpassing the state of art. The following conclusions can be obtained through the experiment: G_Coder integrates information across medical records using Multi-CNN and embeds knowledge into ICD codes. Adversarial learning is used to generate the adversarial samples to reconcile the writing styles of doctor. With the knowledge graph and attention mechanism, most relevant segments of medical codes can be explained. This suggests that the knowledge graph significantly improves the precision of code prediction and reduces the working pressure of the human coders.

SUBMITTER: Teng F

PROVIDER: S-EPMC7456905 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Explainable Prediction of Medical Codes With Knowledge Graphs.

Teng Fei F Yang Wei W Chen Li L Huang LuFei L Xu Qiang Q

Frontiers in bioengineering and biotechnology 20200814

International Classification of Diseases (ICD) is an authoritative health care classification system of different diseases. It is widely used for disease and health records, assisted medical reimbursement decisions, and collecting morbidity and mortality statistics. The most existing ICD coding models only translate the simple diagnosis descriptions into ICD codes. And it obscures the reasons and details behind specific diagnoses. Besides, the label (code) distribution is uneven. And there is a ...[more]

PMID: 32923430

Dataset Information

Explainable Prediction of Medical Codes With Knowledge Graphs.

Publications

Explainable Prediction of Medical Codes With Knowledge Graphs.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Accurate Prediction of Kinase-Substrate Networks Using Knowledge Graphs
2022-02-20 | PXD018905 | Pride

Accurate prediction of kinase-substrate networks using knowledge graphs.
| S-EPMC7738173 | biostudies-literature

KR4SL: knowledge graph reasoning for explainable prediction of synthetic lethality.
| S-EPMC10311291 | biostudies-literature

A knowledge empowered explainable gene ontology fingerprint approach to improve gene functional explication and prediction.
| S-EPMC10119605 | biostudies-literature

Disease ontologies for knowledge graphs.
| S-EPMC8296689 | biostudies-literature

Knowledge Graphs of Kawasaki Disease.
| S-EPMC7910781 | biostudies-literature

Constructing knowledge graphs and their biomedical applications.
| S-EPMC7327409 | biostudies-literature

Neuro-symbolic representation learning on biological knowledge graphs.
| S-EPMC5860058 | biostudies-literature

Chemical reaction network knowledge graphs: the OntoRXN ontology.
| S-EPMC9153116 | biostudies-literature

KG-Hub-building and exchanging biological knowledge graphs.
| S-EPMC10336030 | biostudies-literature