Ontology highlight
ABSTRACT:
SUBMITTER: Bussi Y
PROVIDER: S-EPMC8516232 | biostudies-literature | 2021
REPOSITORIES: biostudies-literature
Bussi Yuval Y Kapon Ruti R Reich Ziv Z
PloS one 20211014 10
Information theoretic approaches are ubiquitous and effective in a wide variety of bioinformatics applications. In comparative genomics, alignment-free methods, based on short DNA words, or k-mers, are particularly powerful. We evaluated the utility of varying k-mer lengths for genome comparisons by analyzing their sequence space coverage of 5805 genomes in the KEGG GENOME database. In subsequent analyses on four k-mer lengths spanning the relevant range (11, 21, 31, 41), hierarchical clustering ...[more]