Unknown

Dataset Information

0

Semi-automated ontology generation within OBO-Edit.


ABSTRACT: MOTIVATION: Ontologies and taxonomies have proven highly beneficial for biocuration. The Open Biomedical Ontology (OBO) Foundry alone lists over 90 ontologies mainly built with OBO-Edit. Creating and maintaining such ontologies is a labour-intensive, difficult, manual process. Automating parts of it is of great importance for the further development of ontologies and for biocuration. RESULTS: We have developed the Dresden Ontology Generator for Directed Acyclic Graphs (DOG4DAG), a system which supports the creation and extension of OBO ontologies by semi-automatically generating terms, definitions and parent-child relations from text in PubMed, the web and PDF repositories. DOG4DAG is seamlessly integrated into OBO-Edit. It generates terms by identifying statistically significant noun phrases in text. For definitions and parent-child relations it employs pattern-based web searches. We systematically evaluate each generation step using manually validated benchmarks. The term generation leads to high-quality terms also found in manually created ontologies. Up to 78% of definitions are valid and up to 54% of child-ancestor relations can be retrieved. There is no other validated system that achieves comparable results. By combining the prediction of high-quality terms, definitions and parent-child relations with the ontology editor OBO-Edit we contribute a thoroughly validated tool for all OBO ontology engineers. AVAILABILITY: DOG4DAG is available within OBO-Edit 2.1 at http://www.oboedit.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

SUBMITTER: Wachter T 

PROVIDER: S-EPMC2881373 | biostudies-literature | 2010 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Semi-automated ontology generation within OBO-Edit.

Wächter Thomas T   Schroeder Michael M  

Bioinformatics (Oxford, England) 20100601 12


<h4>Motivation</h4>Ontologies and taxonomies have proven highly beneficial for biocuration. The Open Biomedical Ontology (OBO) Foundry alone lists over 90 ontologies mainly built with OBO-Edit. Creating and maintaining such ontologies is a labour-intensive, difficult, manual process. Automating parts of it is of great importance for the further development of ontologies and for biocuration.<h4>Results</h4>We have developed the Dresden Ontology Generator for Directed Acyclic Graphs (DOG4DAG), a s  ...[more]

Similar Datasets

| S-EPMC3105495 | biostudies-literature
| S-EPMC3477006 | biostudies-literature
| S-EPMC3376233 | biostudies-literature
| S-EPMC2684543 | biostudies-literature
| S-EPMC9360406 | biostudies-literature
| S-EPMC7249269 | biostudies-literature
| S-EPMC2719631 | biostudies-literature
| S-EPMC2796125 | biostudies-literature
| S-EPMC3547776 | biostudies-literature
| S-EPMC9897586 | biostudies-literature