Unknown

Dataset Information

0

A new method to cluster DNA sequences using Fourier power spectrum.


ABSTRACT: A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from these spectra, and multidimensional vectors of real numbers are constructed from these moments. Cluster analysis is then performed in order to determine the evolutionary relationship between DNA sequences. The novelty of this method is that sequences with different lengths can be compared easily via the use of power spectra and moments. Experimental results on various datasets show that the proposed method provides an efficient tool to classify genes and genomes. It not only gives comparable results but also is remarkably faster than other multiple sequence alignment and alignment-free methods.

SUBMITTER: Hoang T 

PROVIDER: S-EPMC7094126 | biostudies-literature | 2015 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A new method to cluster DNA sequences using Fourier power spectrum.

Hoang Tung T   Yin Changchuan C   Zheng Hui H   Yu Chenglong C   Lucy He Rong R   Yau Stephen S-T SS  

Journal of theoretical biology 20150305


A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from these spectra, and multidimensional vectors of real numbers are constructed from these moments. Cluster analysis is then performed in order to determine the evolutionary relationship between DNA sequences  ...[more]

Similar Datasets

| S-EPMC7094093 | biostudies-literature
| S-EPMC9997877 | biostudies-literature
| S-EPMC4383417 | biostudies-literature
| S-EPMC7094107 | biostudies-literature
| S-EPMC1618414 | biostudies-literature
| S-EPMC7094160 | biostudies-literature
| S-EPMC2661002 | biostudies-literature
| S-EPMC3447367 | biostudies-literature
| S-EPMC8639721 | biostudies-literature
| S-EPMC3290115 | biostudies-literature