Ontology highlight
ABSTRACT:
SUBMITTER: Li B
PROVIDER: S-EPMC5172548 | biostudies-literature | 2005 Nov
REPOSITORIES: biostudies-literature
Li Bin B Li Yi Bing YB He Hong Bo HB
Genomics, proteomics & bioinformatics 20051101 4
DNA sequences can be treated as finite-length symbol strings over a four-letter alphabet (A, C, T, G). As a universal and computable complexity measure, LZ complexity is valid to describe the complexity of DNA sequences. In this study, a concept of conditional LZ complexity between two sequences is proposed according to the principle of LZ complexity measure. An LZ complexity distance metric between two nonnull sequences is defined by utilizing conditional LZ complexity. Based on LZ complexity d ...[more]