Unknown

Dataset Information

0

Numerical characterization of DNA sequence based on dinucleotides.


ABSTRACT: Sequence comparison is a primary technique for the analysis of DNA sequences. In order to make quantitative comparisons, one devises mathematical descriptors that capture the essence of the base composition and distribution of the sequence. Alignment methods and graphical techniques (where each sequence is represented by a curve in high-dimension Euclidean space) have been used popularly for a long time. In this contribution we will introduce a new nongraphical and nonalignment approach based on the frequencies of the dinucleotide XY in DNA sequences. The most important feature of this method is that it not only identifies adjacent XY pairs but also nonadjacent XY ones where X and Y are separated by some number of nucleotides. This methodology preserves information in DNA sequence that is ignored by other methods. We test our method on the coding regions of exon-1 of ?-globin for 11 species, and the utility of this new method is demonstrated.

SUBMITTER: Qi X 

PROVIDER: S-EPMC3349307 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Numerical characterization of DNA sequence based on dinucleotides.

Qi Xingqin X   Fuller Edgar E   Wu Qin Q   Zhang Cun-Quan CQ  

TheScientificWorldJournal 20120424


Sequence comparison is a primary technique for the analysis of DNA sequences. In order to make quantitative comparisons, one devises mathematical descriptors that capture the essence of the base composition and distribution of the sequence. Alignment methods and graphical techniques (where each sequence is represented by a curve in high-dimension Euclidean space) have been used popularly for a long time. In this contribution we will introduce a new nongraphical and nonalignment approach based on  ...[more]

Similar Datasets

| S-EPMC9344474 | biostudies-literature
| S-EPMC8412381 | biostudies-literature
| S-EPMC6442381 | biostudies-literature
2016-07-22 | GSE84696 | GEO
2019-01-20 | GSE116704 | GEO
| S-EPMC10100885 | biostudies-literature
| S-EPMC2689654 | biostudies-literature
| S-EPMC5054945 | biostudies-literature
| S-EPMC7913701 | biostudies-literature
| S-EPMC2292415 | biostudies-literature