Ontology highlight
ABSTRACT:
SUBMITTER: Li R
PROVIDER: S-EPMC7056916 | biostudies-literature | 2019 Oct
REPOSITORIES: biostudies-literature
Li Ruilin R He Xiaoyu X Dai Chuangchuang C Zhu Haidong H Lang Xianyu X Chen Wei W Li Xiaodong X Zhao Dan D Zhang Yu Y Han Xinyin X Niu Tie T Zhao Yi Y Cao Rongqiang R He Rong R Lu Zhonghua Z Chi Xuebin X Li Weizhong W Niu Beifang B
Genomics, proteomics & bioinformatics 20191001 5
The accelerating growth of the public microbial genomic data imposes substantial burden on the research community that uses such resources. Building databases for non-redundant reference sequences from massive microbial genomic data based on clustering analysis is essential. However, existing clustering algorithms perform poorly on long genomic sequences. In this article, we present Gclust, a parallel program for clustering complete or draft genomic sequences, where clustering is accelerated wit ...[more]