Dataset Information

Anaphoric relations in the clinical narrative: corpus creation.

ABSTRACT:

Objective

The long-term goal of this work is the automated discovery of anaphoric relations from the clinical narrative. The creation of a gold standard set from a cross-institutional corpus of clinical notes and high-level characteristics of that gold standard are described.

Methods

A standard methodology for annotation guideline development, gold standard annotations, and inter-annotator agreement (IAA) was used.

Results

The gold standard annotations resulted in 7214 markables, 5992 pairs, and 1304 chains. Each report averaged 40 anaphoric markables, 33 pairs, and seven chains. The overall IAA is high on the Mayo dataset (0.6607), and moderate on the University of Pittsburgh Medical Center (UPMC) dataset (0.4072). The IAA between each annotator and the gold standard is high (Mayo: 0.7669, 0.7697, and 0.9021; UPMC: 0.6753 and 0.7138). These results imply a quality corpus feasible for system development. They also suggest the complementary nature of the annotations performed by the experts and the importance of an annotator team with diverse knowledge backgrounds.

Limitations

Only one of the annotators had the linguistic background necessary for annotation of the linguistic attributes. The overall generalizability of the guidelines will be further strengthened by annotations of data from additional sites. This will increase the overall corpus size and the representation of each relation type.

Conclusion

The first step toward the development of an anaphoric relation resolver as part of a comprehensive natural language processing system geared specifically for the clinical narrative in the electronic medical record is described. The deidentified annotated corpus will be available to researchers.

SUBMITTER: Savova GK

PROVIDER: S-EPMC3128403 | biostudies-literature | 2011 Jul-Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Anaphoric relations in the clinical narrative: corpus creation.

Savova Guergana K GK Chapman Wendy W WW Zheng Jiaping J Crowley Rebecca S RS

Journal of the American Medical Informatics Association : JAMIA 20110401 4

<h4>Objective</h4>The long-term goal of this work is the automated discovery of anaphoric relations from the clinical narrative. The creation of a gold standard set from a cross-institutional corpus of clinical notes and high-level characteristics of that gold standard are described.<h4>Methods</h4>A standard methodology for annotation guideline development, gold standard annotations, and inter-annotator agreement (IAA) was used.<h4>Results</h4>The gold standard annotations resulted in 7214 mark ...[more]

PMID: 21459927

Similar Datasets

Project description:BackgroundMany new medicines have been derived from natural sources such as plants, which have a long history of being used for disease treatment. Thus, their benefits and side effects have been studied, and plant-related information including plant and disease relations have been accumulated in Medline articles. Because numerous articles are available in Medline and are written in natural language, text-mining is important. However, a corpus of plant and disease relations is not available yet. Thus, we aimed to construct such a corpus.Methods and resultsIn this study, we designed and annotated a plant-disease relations corpus, and proposed a computational model to predict plant-disease relations using the corpus. We categorized plant and disease relations into four types: treatments of diseases, causes of diseases, associations, and negative relations. To construct a corpus of plant-disease relations, we first created its annotation guidelines and randomly selected 200 Medline abstracts. From these abstracts, we identified 1,405 and 1,755 plant and disease mentions, annotated to 105 and 237 unique plant and disease identifiers, respectively. When we selected sentences containing at least one plant and one disease mention, we extracted 878 plant and 1,077 disease entities, which finally generated a corpus of plant-disease relations including 1,309 relations from 199 abstracts. To verify the effectiveness of the corpus, we proposed a convolutional neural network model with the shortest dependency path (SDP-CNN) and applied it to the constructed corpus. The micro F-score with ten-fold cross-validation was found to be 0.764. We also applied the proposed SDP-CNN model to all Medline abstracts. When we measured its performance for 483 randomly selected plant-disease co-occurring sentences, the model showed a precision of 0.707.ConclusionThe plant-disease relations corpus is unique and represents an important resource for biomedical text-mining. The corpus of plant and disease relations is available at http://gcancer.org/pdr/.

Project description:Background and objectiveSurgical creation of arteriovenous fistulas (AVF) and grafts (AVG) continues to be the mainstay access for hemodialysis (HD). Avoidance of dependence on dialysis catheters continues to be a worldwide mission in dialysis access. Importantly, there is no one-size-fits-all approach to hemodialysis access and each patient should undergo access creation that is patient-centered. The aim of this paper is to review the literature, current guidelines, and discuss the common types of upper extremity hemodialysis access and their reported outcomes. We will also share our institutional experience regarding the surgical creation of upper extremity hemodialysis access.MethodsThe literature review incorporates twenty-seven relevant articles from 1997 to present and one case report series from 1966. Sources were gathered from electronic databases including PubMed, EMBASE, Medline, and Google Scholar. Only articles written in the English language were considered and study designs varied from current clinical guidelines, systematic and meta-analyses, randomized controlled trials, observational studies, and two main vascular surgery textbooks.Key content and findingsThis review exclusively focuses on the surgical creation of upper extremity hemodialysis accesses. Creating a graft versus fistula ultimately is decided by the existing anatomy, and is centered around the need of the patient. Preoperatively, the patient should undergo a thorough history and physical exam, with special attention to any previous central venous access, as well as, delineating the vascular anatomy with ultrasound imaging. The major tenets of access creation are choosing the most distal site of the non-dominant upper extremity whenever possible; and ideally creation of an autogenous access is preferred over a prosthetic graft. Described in this review are multiple surgical approaches for upper extremity hemodialysis access creation and associated institutional practices performed by the surgeon author. In the postoperative period, follow up care and surveillance are imperative to preserve a functioning access.ConclusionsThe most recent guidelines regarding hemodialysis access still favor arteriovenous fistula as the primary goal for patients with suitable anatomy. Preoperative evaluation including patient education, intraoperative ultrasound assessment, meticulous technique, and careful postoperative management are all paramount for successful access surgery. Dialysis access remains quite challenging, but with diligence the great majority of patients can be dialyzed without catheter dependence.

Project description:Previous research has shown that humor and self-presentation are linked in several ways. With regard to individual differences, it turned out that gelotophilia (the joy of being laughed at) and katagelasticism (the joy of laughing at others) are substantially associated with the histrionic self-presentation style that is characterized by performing explicit As-If-behaviors (e.g., irony, parodying others) in everyday interactions. By contrast, gelotophobia (the fear of being laughed at) shows a negative correlation with histrionic self-presentation. In order to further contribute to the nomological network, we have explored whether the three dispositions toward ridicule and laughter as well as histrionic self-presentation are related to humor creation abilities. In doing so, we have assessed the four constructs in a study with 337 participants that also completed the Cartoon Punch line Production Test (CPPT, Köhler and Ruch, 1993, unpublished). In the CPPT, subjects were asked to generate as many funny punch lines as possible for six caption-removed cartoons. The created punch lines were then analyzed with regard to quantitative (e.g., number of punch lines) and qualitative (e.g., wittiness of the punch lines and overall wittiness of the person as evaluated by three independent raters) humor creation abilities. Results show that both gelotophilia and histrionic self-presentation were positively correlated with quantitative and qualitative humor creation abilities. By contrast, gelotophobia showed slightly negative and katagelasticism no associations with the assessed humor creation abilities. These findings especially apply to the subgroup of participants that created punch lines for each of the six cartoons and partly replicate and extend the results of a previous study by Ruch et al. (2009). Altogether, the results of our study show that individual differences in humor-related traits are associated with the quantity and quality of humorous punch lines. It is argued that behavior-related or performative humor creation tasks should be considered in addition to the CPPT in order to open up new avenues that can cross-fertilize research on individual differences in humor and self-presentation.

Project description:Objective: The resource group method intends to promote patients' agency and self-management and to organize meaningful partnerships between patients and their informal and formal support systems. The aim of this study was to enhance the understanding of interpersonal dynamics that arise within resource groups for people with severe mental illness. Insight into these unfolding processes would enable improved implementation of the resource group method so that it contributes to establishing a positive social environment, which can lead to more enduring recovery. Methodology: We performed a narrative analysis of transcripts and field notes obtained in a longitudinal, qualitative study on the resource group method. The stories of four different resource groups were reconstructed and analyzed in depth. Data included a total of 36 interviews (with patients, significant others, and mental health professionals) and 18 observations of resource group meetings. Results: The degree to which the resource group method actually contributes to recovery was based on the extent to which the existing roles of and patterns between the patient and his/her resource group members were altered. Breaking through old patterns of inequality and the joint search for a new balance in relationships proved to be crucial processes for establishing an empowering resource group. The four cases showed that it takes time, patience, and small steps back and forth to overcome the struggles and fears related to finding new ways of relating to each other. An honest and reflective atmosphere in which all participants are encouraged to participate and be curious about themselves and each other is essential for changes in interpersonal dynamics to emerge. Such changes pave the way for individuals with SMI to find their own voices and pursue their unique recovery journeys. Conclusions: The functioning of the resource group and the ability of the involved members to respond in new ways are important when working toward the patient's recovery goals. The resource group method should therefore not be considered an intervention to organize informal support for the patient, but a platform to expose and adjust the functioning of the patient's social network as a whole.

Dataset Information

Anaphoric relations in the clinical narrative: corpus creation.

Objective

Methods

Results

Limitations

Conclusion

Publications

Anaphoric relations in the clinical narrative: corpus creation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets