Unknown

Dataset Information

0

Gene Ontology Enrichment Improves Performances of Functional Similarity of Genes.


ABSTRACT: There exists a plethora of measures to evaluate functional similarity (FS) between genes, which is a widely used in many bioinformatics applications including detecting molecular pathways, identifying co-expressed genes, predicting protein-protein interactions, and prioritization of disease genes. Measures of FS between genes are mostly derived from Information Contents (IC) of Gene Ontology (GO) terms annotating the genes. However, existing measures evaluating IC of terms based either on the representations of terms in the annotating corpus or on the knowledge embedded in the GO hierarchy do not consider the enrichment of GO terms by the querying pair of genes. The enrichment of a GO term by a pair of gene is dependent on whether the term is annotated by one gene (i.e., partial annotation) or by both genes (i.e. complete annotation) in the pair. In this paper, we propose a method that incorporate enrichment of GO terms by a gene pair in computing their FS and show that GO enrichment improves the performances of 46 existing FS measures in the prediction of sequence homologies, gene expression correlations, protein-protein interactions, and disease associated genes.

SUBMITTER: Liu W 

PROVIDER: S-EPMC6092333 | biostudies-literature | 2018 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Gene Ontology Enrichment Improves Performances of Functional Similarity of Genes.

Liu Wenting W   Liu Jianjun J   Rajapakse Jagath C JC  

Scientific reports 20180814 1


There exists a plethora of measures to evaluate functional similarity (FS) between genes, which is a widely used in many bioinformatics applications including detecting molecular pathways, identifying co-expressed genes, predicting protein-protein interactions, and prioritization of disease genes. Measures of FS between genes are mostly derived from Information Contents (IC) of Gene Ontology (GO) terms annotating the genes. However, existing measures evaluating IC of terms based either on the re  ...[more]

Similar Datasets

| S-EPMC2518162 | biostudies-literature
| S-EPMC6760551 | biostudies-literature
| S-EPMC1559652 | biostudies-literature
| S-EPMC5006308 | biostudies-literature
| S-EPMC1940007 | biostudies-literature
| S-EPMC5359872 | biostudies-literature
| S-EPMC9251090 | biostudies-literature
| S-EPMC4140130 | biostudies-literature
| S-EPMC7347127 | biostudies-literature
| S-EPMC4966780 | biostudies-literature