Unknown

Dataset Information

0

A stitch in time: efficient computation of genomic DNA melting bubbles.


ABSTRACT: BACKGROUND:It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. RESULTS:An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model. CONCLUSION:Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions.

SUBMITTER: Tostesen E 

PROVIDER: S-EPMC2553405 | biostudies-literature | 2008 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A stitch in time: efficient computation of genomic DNA melting bubbles.

Tøstesen Eivind E  

Algorithms for molecular biology : AMB 20080717


<h4>Background</h4>It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic.<h4>Results</h4>An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior  ...[more]

Similar Datasets

| S-EPMC2375138 | biostudies-literature
| S-EPMC5360225 | biostudies-literature
| S-EPMC5292573 | biostudies-literature
| S-EPMC4775097 | biostudies-literature
| S-EPMC3741346 | biostudies-literature
| S-EPMC8186806 | biostudies-literature
| S-EPMC3392737 | biostudies-literature
| S-EPMC2705278 | biostudies-literature