Unknown

Dataset Information

0

Algorithm for large-scale clustering across multiple genomes.


ABSTRACT: Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements.

SUBMITTER: Yi G 

PROVIDER: S-EPMC3218420 | biostudies-other | 2011

REPOSITORIES: biostudies-other

altmetric image

Publications

Algorithm for large-scale clustering across multiple genomes.

Yi Gangman G   Jung Jaehee J  

Bioinformation 20111031 5


Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficien  ...[more]

Similar Datasets

| S-EPMC3976248 | biostudies-literature
| S-EPMC1351371 | biostudies-literature
| S-EPMC6179193 | biostudies-literature
| S-EPMC2853685 | biostudies-other
| S-EPMC547898 | biostudies-literature
| S-EPMC1890301 | biostudies-literature
| S-EPMC7550569 | biostudies-literature
| S-BSST207 | biostudies-other