Unknown

Dataset Information

0

Studying the evolution of promoter sequences: a waiting time problem.


ABSTRACT: To gain a better understanding of the evolutionary dynamics of regulatory DNA sequences, we address the following questions: (1) How long does it take until a given transcription factor (TF) binding site emerges at random in a promoter sequence? and (2) How does the composition of a TF binding site affect this waiting time? Using two different probabilistic models (an i.i.d. model and a neighbor dependent model), we can compute the expected waiting time for every k-mer, k ranging from 5 to 10, until it appears in a promoter of a species. Our findings indicate that new TF binding sites can be created on a short evolutionary time scale, i.e. in a time span below the speciation time of human and chimp. Furthermore, one can conclude that the composition of a TF binding site plays a crucial role concerning the waiting time until it appears and that the CpG methylation-deamination substitution process probably accelerates the creation of new TF binding sites. A screening of existing TF binding sites moreover reveals that k-mers predicted to have short waiting times occur more frequently than others. Supplementary Material is available at www.libertonline.com/cmb .

SUBMITTER: Behrens S 

PROVIDER: S-EPMC3119604 | biostudies-literature | 2010 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Studying the evolution of promoter sequences: a waiting time problem.

Behrens Sarah S   Vingron Martin M  

Journal of computational biology : a journal of computational molecular cell biology 20101201 12


To gain a better understanding of the evolutionary dynamics of regulatory DNA sequences, we address the following questions: (1) How long does it take until a given transcription factor (TF) binding site emerges at random in a promoter sequence? and (2) How does the composition of a TF binding site affect this waiting time? Using two different probabilistic models (an i.i.d. model and a neighbor dependent model), we can compute the expected waiting time for every k-mer, k ranging from 5 to 10, u  ...[more]

Similar Datasets

| S-EPMC8791639 | biostudies-literature
| S-EPMC6546376 | biostudies-literature
| S-EPMC5095216 | biostudies-literature
| S-EPMC6546378 | biostudies-literature
| S-EPMC6693195 | biostudies-literature
| S-EPMC2065895 | biostudies-literature
| S-EPMC9625104 | biostudies-literature
| S-EPMC7706098 | biostudies-literature
| S-EPMC1764036 | biostudies-literature
| S-EPMC106926 | biostudies-literature