Unknown

Dataset Information

0

Using domain knowledge and domain-inspired discourse model for coreference resolution for clinical narratives.


ABSTRACT:

Objective

This paper presents a coreference resolution system for clinical narratives. Coreference resolution aims at clustering all mentions in a single document to coherent entities.

Materials and methods

A knowledge-intensive approach for coreference resolution is employed. The domain knowledge used includes several domain-specific lists, a knowledge intensive mention parsing, and task informed discourse model. Mention parsing allows us to abstract over the surface form of the mention and represent each mention using a higher-level representation, which we call the mention's semantic representation (SR). SR reduces the mention to a standard form and hence provides better support for comparing and matching. Existing coreference resolution systems tend to ignore discourse aspects and rely heavily on lexical and structural cues in the text. The authors break from this tradition and present a discourse model for "person" type mentions in clinical narratives, which greatly simplifies the coreference resolution.

Results

This system was evaluated on four different datasets which were made available in the 2011 i2b2/VA coreference challenge. The unweighted average of F1 scores (over B-cubed, MUC and CEAF) varied from 84.2% to 88.1%. These experiments show that domain knowledge is effective for different mention types for all the datasets.

Discussion

Error analysis shows that most of the recall errors made by the system can be handled by further addition of domain knowledge. The precision errors, on the other hand, are more subtle and indicate the need to understand the relations in which mentions participate for building a robust coreference system.

Conclusion

This paper presents an approach that makes an extensive use of domain knowledge to significantly improve coreference resolution. The authors state that their system and the knowledge sources developed will be made publicly available.

SUBMITTER: Jindal P 

PROVIDER: S-EPMC3638172 | biostudies-literature | 2013 Mar-Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using domain knowledge and domain-inspired discourse model for coreference resolution for clinical narratives.

Jindal Prateek P   Roth Dan D  

Journal of the American Medical Informatics Association : JAMIA 20120710 2


<h4>Objective</h4>This paper presents a coreference resolution system for clinical narratives. Coreference resolution aims at clustering all mentions in a single document to coherent entities.<h4>Materials and methods</h4>A knowledge-intensive approach for coreference resolution is employed. The domain knowledge used includes several domain-specific lists, a knowledge intensive mention parsing, and task informed discourse model. Mention parsing allows us to abstract over the surface form of the  ...[more]

Similar Datasets

| S-EPMC5508069 | biostudies-literature
| S-EPMC3422821 | biostudies-literature
| S-EPMC7043320 | biostudies-literature
| S-EPMC4962668 | biostudies-literature
| S-EPMC4976126 | biostudies-literature
| S-EPMC3422831 | biostudies-other
| S-EPMC3422835 | biostudies-literature
| S-EPMC6506541 | biostudies-other
| S-EPMC3422837 | biostudies-other
| S-EPMC3855581 | biostudies-literature