Unknown

Dataset Information

0

Complexity: an internet resource for analysis of DNA sequence complexity.


ABSTRACT: The search for DNA regions with low complexity is one of the pivotal tasks of modern structural analysis of complete genomes. The low complexity may be preconditioned by strong inequality in nucleotide content (biased composition), by tandem or dispersed repeats or by palindrome-hairpin structures, as well as by a combination of all these factors. Several numerical measures of textual complexity, including combinatorial and linguistic ones, together with complexity estimation using a modified Lempel-Ziv algorithm, have been implemented in a software tool called 'Complexity' (http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/). The software enables a user to search for low-complexity regions in long sequences, e.g. complete bacterial genomes or eukaryotic chromosomes. In addition, it estimates the complexity of groups of aligned sequences.

SUBMITTER: Orlov YL 

PROVIDER: S-EPMC441604 | biostudies-literature | 2004 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Complexity: an internet resource for analysis of DNA sequence complexity.

Orlov Y L YL   Potapov V N VN  

Nucleic acids research 20040701 Web Server issue


The search for DNA regions with low complexity is one of the pivotal tasks of modern structural analysis of complete genomes. The low complexity may be preconditioned by strong inequality in nucleotide content (biased composition), by tandem or dispersed repeats or by palindrome-hairpin structures, as well as by a combination of all these factors. Several numerical measures of textual complexity, including combinatorial and linguistic ones, together with complexity estimation using a modified Le  ...[more]

Similar Datasets

| S-EPMC6044401 | biostudies-literature
| S-EPMC5343954 | biostudies-literature
| S-EPMC2953079 | biostudies-literature
| S-EPMC7770047 | biostudies-literature
| S-EPMC6933851 | biostudies-literature
2022-04-25 | GSE196696 | GEO
| S-EPMC9798383 | biostudies-literature
| S-EPMC4491172 | biostudies-literature
| PRJNA806895 | ENA
| S-EPMC3531071 | biostudies-literature