Ontology highlight
ABSTRACT:
SUBMITTER: Li R
PROVIDER: S-EPMC7056916 | biostudies-literature | 2019 Oct
REPOSITORIES: biostudies-literature

Genomics, proteomics & bioinformatics 20191001 5
The accelerating growth of the public microbial genomic data imposes substantial burden on the research community that uses such resources. Building databases for non-redundant reference sequences from massive microbial genomic data based on clustering analysis is essential. However, existing clustering algorithms perform poorly on long genomic sequences. In this article, we present Gclust, a parallel program for clustering complete or draft genomic sequences, where clustering is accelerated wit ...[more]