Ontology highlight
ABSTRACT:
SUBMITTER: Wright E
PROVIDER: S-EPMC11001989 | biostudies-literature | 2024 Apr
REPOSITORIES: biostudies-literature
Nature communications 20240408 1
Clustering biological sequences into similar groups is an increasingly important task as the number of available sequences continues to grow exponentially. Search-based approaches to clustering scale super-linearly with the number of input sequences, making it impractical to cluster very large sets of sequences. Approaches to clustering sequences in linear time currently lack the accuracy of super-linear approaches. Here, I set out to develop and characterize a strategy for clustering with linea ...[more]