Unknown

Dataset Information

0

Predicting gene ontology biological process from temporal gene expression patterns.


ABSTRACT: The aim of the present study was to generate hypotheses on the involvement of uncharacterized genes in biological processes. To this end, supervised learning was used to analyze microarray-derived time-series gene expression data. Our method was objectively evaluated on known genes using cross-validation and provided high-precision Gene Ontology biological process classifications for 211 of the 213 uncharacterized genes in the data set used. In addition, new roles in biological process were hypothesized for known genes. Our method uses biological knowledge expressed by Gene Ontology and generates a rule model associating this knowledge with minimal characteristic features of temporal gene expression profiles. This model allows learning and classification of multiple biological process roles for each gene and can predict participation of genes in a biological process even though the genes of this class exhibit a wide variety of gene expression profiles including inverse coregulation. A considerable number of the hypothesized new roles for known genes were confirmed by literature search. In addition, many biological process roles hypothesized for uncharacterized genes were found to agree with assumptions based on homology information. To our knowledge, a gene classifier of similar scope and functionality has not been reported earlier.

SUBMITTER: Lagreid A 

PROVIDER: S-EPMC430886 | biostudies-literature | 2003 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting gene ontology biological process from temporal gene expression patterns.

Lagreid Astrid A   Hvidsten Torgeir R TR   Midelfart Herman H   Komorowski Jan J   Sandvik Arne K AK  

Genome research 20030414 5


The aim of the present study was to generate hypotheses on the involvement of uncharacterized genes in biological processes. To this end, supervised learning was used to analyze microarray-derived time-series gene expression data. Our method was objectively evaluated on known genes using cross-validation and provided high-precision Gene Ontology biological process classifications for 211 of the 213 uncharacterized genes in the data set used. In addition, new roles in biological process were hypo  ...[more]

Similar Datasets

| S-EPMC2982154 | biostudies-literature
| S-EPMC2233648 | biostudies-literature
| S-EPMC4542782 | biostudies-literature
| S-EPMC3712327 | biostudies-literature
| S-EPMC2719670 | biostudies-literature
| S-EPMC10459906 | biostudies-literature
| S-EPMC2246263 | biostudies-literature
| S-EPMC187511 | biostudies-literature
| S-EPMC1360676 | biostudies-literature
| S-EPMC10996487 | biostudies-literature