Unknown

Dataset Information

0

Similarity Estimation Between DNA Sequences Based on Local Pattern Histograms of Binary Images.


ABSTRACT: Graphical representation of DNA sequences is one of the most popular techniques for alignment-free sequence comparison. Here, we propose a new method for the feature extraction of DNA sequences represented by binary images, by estimating the similarity between DNA sequences using the frequency histograms of local bitmap patterns of images. Our method shows linear time complexity for the length of DNA sequences, which is practical even when long sequences, such as whole genome sequences, are compared. We tested five distance measures for the estimation of sequence similarities, and found that the histogram intersection and Manhattan distance are the most appropriate ones for phylogenetic analyses.

SUBMITTER: Kobori Y 

PROVIDER: S-EPMC4880953 | biostudies-literature | 2016 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Similarity Estimation Between DNA Sequences Based on Local Pattern Histograms of Binary Images.

Kobori Yusei Y   Mizuta Satoshi S  

Genomics, proteomics & bioinformatics 20160427 2


Graphical representation of DNA sequences is one of the most popular techniques for alignment-free sequence comparison. Here, we propose a new method for the feature extraction of DNA sequences represented by binary images, by estimating the similarity between DNA sequences using the frequency histograms of local bitmap patterns of images. Our method shows linear time complexity for the length of DNA sequences, which is practical even when long sequences, such as whole genome sequences, are comp  ...[more]

Similar Datasets

| S-EPMC5513428 | biostudies-other
| S-EPMC10627294 | biostudies-literature
| S-EPMC5526147 | biostudies-other
2011-11-23 | E-GEOD-33899 | biostudies-arrayexpress
2011-11-23 | GSE33899 | GEO
2010-06-20 | E-GEOD-12542 | biostudies-arrayexpress
| S-EPMC2238722 | biostudies-literature
| S-EPMC10280544 | biostudies-literature
2008-09-02 | GSE12542 | GEO
| S-EPMC4331808 | biostudies-literature