Unknown

Dataset Information

0

Improved lower bounds of DNA tags based on a modified genetic algorithm.


ABSTRACT: The well-known massively parallel sequencing method is efficient and it can obtain sequence data from multiple individual samples. In order to ensure that sequencing, replication, and oligonucleotide synthesis errors do not result in tags (or barcodes) that are unrecoverable or confused, the tag sequences should be abundant and sufficiently different. Recently, many design methods have been proposed for correcting errors in data using error-correcting codes. The existing tag sets contain small tag sequences, so we used a modified genetic algorithm to improve the lower bound of the tag sets in this study. Compared with previous research, our algorithm is effective for designing sets of DNA tags. Moreover, the GC content determined by existing methods includes an imprecise range. Thus, we improved the GC content determination method to obtain tag sets that control the GC content in a more precise range. Finally, previous studies have only considered perfect self-complementarity. Thus, we considered the crossover between different tags and introduced an improved constraint into the design of tag sets.

SUBMITTER: Wang B 

PROVIDER: S-EPMC4332859 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved lower bounds of DNA tags based on a modified genetic algorithm.

Wang Bin B   Wei Xiaopeng X   Dong Jing J   Zhang Qiang Q  

PloS one 20150218 2


The well-known massively parallel sequencing method is efficient and it can obtain sequence data from multiple individual samples. In order to ensure that sequencing, replication, and oligonucleotide synthesis errors do not result in tags (or barcodes) that are unrecoverable or confused, the tag sequences should be abundant and sufficiently different. Recently, many design methods have been proposed for correcting errors in data using error-correcting codes. The existing tag sets contain small t  ...[more]

Similar Datasets

| S-EPMC8796283 | biostudies-literature
| S-EPMC8658624 | biostudies-literature
| S-EPMC4778922 | biostudies-literature
| S-EPMC2576724 | biostudies-literature
| S-EPMC4427117 | biostudies-literature
| S-EPMC6409685 | biostudies-literature
| S-EPMC1904243 | biostudies-literature
| S-EPMC4690601 | biostudies-literature
| S-EPMC4158716 | biostudies-other
| S-EPMC4153912 | biostudies-literature