Unknown

Dataset Information

0

GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products.


ABSTRACT: BACKGROUND: The Gene Ontology (GO) provides a controlled vocabulary for describing genes and gene products. In spite of the undoubted importance of GO, several drawbacks associated with GO and GO-based annotations have been introduced. We identified three types of semantic inconsistencies in GO-based annotations; semantically redundant, biological-domain inconsistent and taxonomy inconsistent annotations. METHODS: To determine the semantic inconsistencies in GO annotation, we used the hierarchical structure of GO graph and tree structure of NCBI taxonomy. Twenty seven biological databases were collected for finding semantic inconsistent annotation. RESULTS: The distributions and possible causes of the semantic inconsistencies were investigated using twenty seven biological databases with GO-based annotations. We found that some evidence codes of annotation were associated with the inconsistencies. The numbers of gene products and species in a database that are related to the complexity of database management are also in correlation with the inconsistencies. Consequently, numerous annotation errors arise and are propagated throughout biological databases and GO-based high-level analyses. GOChase-II is developed to detect and correct both syntactic and semantic errors in GO-based annotations. CONCLUSIONS: We identified some inconsistencies in GO-based annotation and provided software, GOChase-II, for correcting these semantic inconsistencies in addition to the previous corrections for the syntactic errors by GOChase-I.

SUBMITTER: Park YR 

PROVIDER: S-EPMC3044297 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products.

Park Yu Rang YR   Kim Jihun J   Lee Hye Won HW   Yoon Young Jo YJ   Kim Ju Han JH  

BMC bioinformatics 20110215


<h4>Background</h4>The Gene Ontology (GO) provides a controlled vocabulary for describing genes and gene products. In spite of the undoubted importance of GO, several drawbacks associated with GO and GO-based annotations have been introduced. We identified three types of semantic inconsistencies in GO-based annotations; semantically redundant, biological-domain inconsistent and taxonomy inconsistent annotations.<h4>Methods</h4>To determine the semantic inconsistencies in GO annotation, we used t  ...[more]

Similar Datasets

| S-EPMC2655092 | biostudies-literature
| S-EPMC3712327 | biostudies-literature
| S-EPMC4339680 | biostudies-literature
| S-EPMC5429001 | biostudies-literature
| S-EPMC3337258 | biostudies-literature
| S-EPMC2686450 | biostudies-literature
| S-EPMC2652876 | biostudies-literature
| S-EPMC3178059 | biostudies-literature
| S-EPMC4966780 | biostudies-literature
| S-EPMC4416163 | biostudies-literature