Unknown

Dataset Information

0

GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms.


ABSTRACT: Measuring the semantic similarity between Gene Ontology (GO) terms is an essential step in functional bioinformatics research. We implemented a software named GOGO for calculating the semantic similarity between GO terms. GOGO has the advantages of both information-content-based and hybrid methods, such as Resnik's and Wang's methods. Moreover, GOGO is relatively fast and does not need to calculate information content (IC) from a large gene annotation corpus but still has the advantage of using IC. This is achieved by considering the number of children nodes in the GO directed acyclic graphs when calculating the semantic contribution of an ancestor node giving to its descendent nodes. GOGO can calculate functional similarities between genes and then cluster genes based on their functional similarities. Evaluations performed on multiple pathways retrieved from the saccharomyces genome database (SGD) show that GOGO can accurately and robustly cluster genes based on functional similarities. We release GOGO as a web server and also as a stand-alone tool, which allows convenient execution of the tool for a small number of GO terms or integration of the tool into bioinformatics pipelines for large-scale calculations. GOGO can be freely accessed or downloaded from http://dna.cs.miami.edu/GOGO/ .

SUBMITTER: Zhao C 

PROVIDER: S-EPMC6180005 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms.

Zhao Chenguang C   Wang Zheng Z  

Scientific reports 20181010 1


Measuring the semantic similarity between Gene Ontology (GO) terms is an essential step in functional bioinformatics research. We implemented a software named GOGO for calculating the semantic similarity between GO terms. GOGO has the advantages of both information-content-based and hybrid methods, such as Resnik's and Wang's methods. Moreover, GOGO is relatively fast and does not need to calculate information content (IC) from a large gene annotation corpus but still has the advantage of using  ...[more]

Similar Datasets

| S-EPMC6350067 | biostudies-literature
| S-EPMC2935448 | biostudies-literature
| S-EPMC4966780 | biostudies-literature
| S-EPMC2655092 | biostudies-literature
| S-EPMC3314586 | biostudies-literature
| S-EPMC3599846 | biostudies-literature
| S-EPMC2518162 | biostudies-literature
| S-EPMC3904913 | biostudies-literature