Unknown

Dataset Information

0

New powerful statistics for alignment-free sequence comparison under a pattern transfer model.


ABSTRACT: Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D*2 and D(s)2 showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D*2 and D(s)2 by comparing local sequence pairs and then summing over all the local sequence pairs of certain length. We show that the new statistics are much more powerful than the corresponding statistics and the power tends to 1 as the sequence length tends to infinity under the pattern transfer model.

SUBMITTER: Liu X 

PROVIDER: S-EPMC3146591 | biostudies-literature | 2011 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

New powerful statistics for alignment-free sequence comparison under a pattern transfer model.

Liu Xuemei X   Wan Lin L   Li Jing J   Reinert Gesine G   Waterman Michael S MS   Sun Fengzhu F  

Journal of theoretical biology 20110625 1


Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D*2 and D(s)2 showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D*2 and D(s)2 by comparing local sequence pairs and then s  ...[more]

Similar Datasets

| S-EPMC2818754 | biostudies-literature
| S-EPMC3123933 | biostudies-literature
| S-EPMC4017329 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC6659240 | biostudies-literature
| S-EPMC4929450 | biostudies-literature
| S-EPMC5627421 | biostudies-literature
| S-EPMC6355110 | biostudies-literature
| S-EPMC3704055 | biostudies-literature
| S-EPMC4080745 | biostudies-literature