Unknown

Dataset Information

0

A stitch in time: efficient computation of genomic DNA melting bubbles.


ABSTRACT:

Background

It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic.

Results

An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model.

Conclusion

Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions.

SUBMITTER: Tostesen E 

PROVIDER: S-EPMC2553405 | biostudies-literature | 2008 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A stitch in time: efficient computation of genomic DNA melting bubbles.

Tøstesen Eivind E  

Algorithms for molecular biology : AMB 20080717


<h4>Background</h4>It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic.<h4>Results</h4>An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior  ...[more]

Similar Datasets

| S-EPMC2375138 | biostudies-literature
| S-EPMC5360225 | biostudies-literature
| S-EPMC10555835 | biostudies-literature
| S-EPMC5292573 | biostudies-literature
| S-EPMC3593895 | biostudies-other
| S-EPMC3593865 | biostudies-other
| S-EPMC4775097 | biostudies-literature
| S-EPMC10199258 | biostudies-literature
| S-EPMC3741346 | biostudies-literature
| S-EPMC8186806 | biostudies-literature