Unknown

Dataset Information

0

Towards a framework for developing semantic relatedness reference standards.


ABSTRACT: Our objective is to develop a framework for creating reference standards for functional testing of computerized measures of semantic relatedness. Currently, research on computerized approaches to semantic relatedness between biomedical concepts relies on reference standards created for specific purposes using a variety of methods for their analysis. In most cases, these reference standards are not publicly available and the published information provided in manuscripts that evaluate computerized semantic relatedness measurement approaches is not sufficient to reproduce the results. Our proposed framework is based on the experiences of medical informatics and computational linguistics communities and addresses practical and theoretical issues with creating reference standards for semantic relatedness. We demonstrate the use of the framework on a pilot set of 101 medical term pairs rated for semantic relatedness by 13 medical coding experts. While the reliability of this particular reference standard is in the "moderate" range; we show that using clustering and factor analyses offers a data-driven approach to finding systematic differences among raters and identifying groups of potential outliers. We test two ontology-based measures of relatedness and provide both the reference standard containing individual ratings and the R program used to analyze the ratings as open-source. Currently, these resources are intended to be used to reproduce and compare results of studies involving computerized measures of semantic relatedness. Our framework may be extended to the development of reference standards in other research areas in medical informatics including automatic classification, information retrieval from medical records and vocabulary/ontology development.

SUBMITTER: Pakhomov SV 

PROVIDER: S-EPMC3063326 | biostudies-literature | 2011 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Towards a framework for developing semantic relatedness reference standards.

Pakhomov Serguei V S SV   Pedersen Ted T   McInnes Bridget B   Melton Genevieve B GB   Ruggieri Alexander A   Chute Christopher G CG  

Journal of biomedical informatics 20101031 2


Our objective is to develop a framework for creating reference standards for functional testing of computerized measures of semantic relatedness. Currently, research on computerized approaches to semantic relatedness between biomedical concepts relies on reference standards created for specific purposes using a variety of methods for their analysis. In most cases, these reference standards are not publicly available and the published information provided in manuscripts that evaluate computerized  ...[more]

Similar Datasets

| S-EPMC6526012 | biostudies-literature
| S-EPMC8277378 | biostudies-literature
| S-EPMC7403759 | biostudies-literature
| S-EPMC7403758 | biostudies-literature
| S-EPMC4943322 | biostudies-literature
| S-EPMC5489662 | biostudies-literature
| S-EPMC9203053 | biostudies-literature
| S-EPMC4255738 | biostudies-literature
| PRJNA1007404 | ENA
| PRJNA1007405 | ENA