Unknown

Dataset Information

0

Seeds for effective oligonucleotide design.


ABSTRACT:

Background

DNA oligonucleotides are a very useful tool in biology. The best algorithms for designing good DNA oligonucleotides are filtering out unsuitable regions using a seeding approach. Determining the quality of the seeds is crucial for the performance of these algorithms.

Results

We present a sound framework for evaluating the quality of seeds for oligonucleotide design. The F - score is used to measure the accuracy of each seed. A number of natural candidates are tested: contiguous (BLAST-like), spaced, transitions-constrained, and multiple spaced seeds. Multiple spaced seeds are the best, with more seeds providing better accuracy. Single spaced and transition seeds are very close whereas, as expected, contiguous seeds come last. Increased accuracy comes at the price of reduced efficiency. An exception is that single spaced and transitions-constrained seeds are both more accurate and more efficient than contiguous ones.

Conclusions

Our work confirms another application where multiple spaced seeds perform the best. It will be useful in improving the algorithms for oligonucleotide design.

SUBMITTER: Ilie L 

PROVIDER: S-EPMC3128067 | biostudies-literature | 2011 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Seeds for effective oligonucleotide design.

Ilie Lucian L   Ilie Silvana S   Khoshraftar Shima S   Bigvand Anahita Mansouri AM  

BMC genomics 20110601


<h4>Background</h4>DNA oligonucleotides are a very useful tool in biology. The best algorithms for designing good DNA oligonucleotides are filtering out unsuitable regions using a seeding approach. Determining the quality of the seeds is crucial for the performance of these algorithms.<h4>Results</h4>We present a sound framework for evaluating the quality of seeds for oligonucleotide design. The F - score is used to measure the accuracy of each seed. A number of natural candidates are tested: co  ...[more]

Similar Datasets

| S-EPMC2788383 | biostudies-literature
| S-EPMC3648450 | biostudies-other
| S-EPMC1169010 | biostudies-literature
| S-EPMC6030895 | biostudies-literature
| S-EPMC8098674 | biostudies-literature
| S-EPMC7139664 | biostudies-literature
| S-EPMC2887943 | biostudies-literature
| S-EPMC441539 | biostudies-literature
| S-EPMC549428 | biostudies-literature
2008-07-26 | GSE12208 | GEO