Unknown

Dataset Information

0

I-Genome: a database to summarize oligonucleotide data in genomes.


ABSTRACT:

Background

Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming.

Results

The proposed database provides information about sequence features generated by exhaustively computing the sequences of the complete genome. The repetitive elements in the eukaryotic genomes, such as LINEs, SINEs, Alu and LTR, are obtained from Repbase. The database supports various complete genomes including human, yeast, worm, and 128 microbial genomes.

Conclusions

This investigation presents and implements an efficiently computational approach to accumulate the occurrences of the oligonucleotides or patterns in complete genomes. A database is established to maintain the information of the sequence features, including the distributions of oligonucleotide, the gene distribution, the distribution of repetitive elements in genomes and the occurrences of the oligonucleotides. The database can provide more effective and efficient way to access the repetitive features in genomes.

SUBMITTER: Lin FM 

PROVIDER: S-EPMC526275 | biostudies-literature | 2004 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

i-Genome: a database to summarize oligonucleotide data in genomes.

Lin Feng-Mao FM   Huang Hsien-Da HD   Chang Yu-Chung YC   Horng Jorng-Tzong JT  

BMC genomics 20041009


<h4>Background</h4>Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming.<h4>Results</h4>The proposed database provides information about sequence features generated by exhaustively computing the sequences of the complete genome. The repetitive elements  ...[more]

Similar Datasets

| S-EPMC5502365 | biostudies-literature
| S-EPMC5210664 | biostudies-literature
| S-EPMC9903328 | biostudies-literature
| S-EPMC3965049 | biostudies-literature
| S-EPMC7703759 | biostudies-literature
| S-EPMC10767852 | biostudies-literature
| S-EPMC539972 | biostudies-literature
| S-EPMC9976658 | biostudies-literature
| S-EPMC6444255 | biostudies-literature
| S-EPMC1274298 | biostudies-literature