Unknown

Dataset Information

0

Using contextual and lexical features to restructure and validate the classification of biomedical concepts.


ABSTRACT:

Background

Biomedical ontologies are critical for integration of data from diverse sources and for use by knowledge-based biomedical applications, especially natural language processing as well as associated mining and reasoning systems. The effectiveness of these systems is heavily dependent on the quality of the ontological terms and their classifications. To assist in developing and maintaining the ontologies objectively, we propose automatic approaches to classify and/or validate their semantic categories. In previous work, we developed an approach using contextual syntactic features obtained from a large domain corpus to reclassify and validate concepts of the Unified Medical Language System (UMLS), a comprehensive resource of biomedical terminology. In this paper, we introduce another classification approach based on words of the concept strings and compare it to the contextual syntactic approach.

Results

The string-based approach achieved an error rate of 0.143, with a mean reciprocal rank of 0.907. The context-based and string-based approaches were found to be complementary, and the error rate was reduced further by applying a linear combination of the two classifiers. The advantage of combining the two approaches was especially manifested on test data with sufficient contextual features, achieving the lowest error rate of 0.055 and a mean reciprocal rank of 0.969.

Conclusion

The lexical features provide another semantic dimension in addition to syntactic contextual features that support the classification of ontological concepts. The classification errors of each dimension can be further reduced through appropriate combination of the complementary classifiers.

SUBMITTER: Fan JW 

PROVIDER: S-EPMC2014782 | biostudies-literature | 2007 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using contextual and lexical features to restructure and validate the classification of biomedical concepts.

Fan Jung-Wei JW   Xu Hua H   Friedman Carol C  

BMC bioinformatics 20070724


<h4>Background</h4>Biomedical ontologies are critical for integration of data from diverse sources and for use by knowledge-based biomedical applications, especially natural language processing as well as associated mining and reasoning systems. The effectiveness of these systems is heavily dependent on the quality of the ontological terms and their classifications. To assist in developing and maintaining the ontologies objectively, we propose automatic approaches to classify and/or validate the  ...[more]

Similar Datasets

| S-EPMC11244985 | biostudies-literature
| S-EPMC8579614 | biostudies-literature
| S-EPMC9911740 | biostudies-literature
| S-EPMC8236058 | biostudies-literature
| S-EPMC5890980 | biostudies-literature
| S-EPMC3282273 | biostudies-literature
| S-EPMC1810343 | biostudies-literature
| S-EPMC2572701 | biostudies-literature
| S-EPMC6764083 | biostudies-literature
| S-EPMC3623732 | biostudies-literature