Unknown

Dataset Information

0

DIAN: a novel algorithm for genome ontological classification.


ABSTRACT: Faced with the determination of many completely sequenced genomes, computational biology is now faced with the challenge of interpreting the significance of these data sets. A multiplicity of data-related problems impedes this goal: Biological annotations associated with raw data are often not normalized, and the data themselves are often poorly interrelated and their interpretation unclear. All of these problems make interpretation of genomic databases increasingly difficult. With the current explosion of sequences now available from the human genome as well as from model organisms, the importance of sorting this vast amount of conceptually unstructured source data into a limited universe of genes, proteins, functions, structures, and pathways has become a bottleneck for the field. To address this problem, we have developed a method of interrelating data sources by applying a novel method of associating biological objects to ontologies. We have developed an intelligent knowledge-based algorithm, to support biological knowledge mapping, and, in particular, to facilitate the interpretation of genomic data. In this respect, the method makes it possible to inventory genomes by collapsing multiple types of annotations and normalizing them to various ontologies. By relying on a conceptual view of the genome, researchers can now easily navigate the human genome in a biologically intuitive, scientifically accurate manner.

SUBMITTER: Pouliot Y 

PROVIDER: S-EPMC311153 | biostudies-literature | 2001 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

DIAN: a novel algorithm for genome ontological classification.

Pouliot Y Y   Gao J J   Su Q J QJ   Liu G G GG   Ling X B XB  

Genome research 20011001 10


Faced with the determination of many completely sequenced genomes, computational biology is now faced with the challenge of interpreting the significance of these data sets. A multiplicity of data-related problems impedes this goal: Biological annotations associated with raw data are often not normalized, and the data themselves are often poorly interrelated and their interpretation unclear. All of these problems make interpretation of genomic databases increasingly difficult. With the current e  ...[more]

Similar Datasets

| S-EPMC4040517 | biostudies-literature
| S-EPMC3125788 | biostudies-literature
| S-EPMC4537225 | biostudies-literature
| S-EPMC3973683 | biostudies-literature
| S-EPMC8444070 | biostudies-literature
| S-EPMC9137854 | biostudies-literature
| S-EPMC4814818 | biostudies-literature
| S-EPMC7785992 | biostudies-literature
2015-04-29 | GSE57162 | GEO
2005-05-01 | GSE2187 | GEO