Ontology highlight
ABSTRACT:
SUBMITTER: Fu L
PROVIDER: S-EPMC3516142 | biostudies-literature | 2012 Dec
REPOSITORIES: biostudies-literature
Fu Limin L Niu Beifang B Zhu Zhengwei Z Wu Sitao S Li Weizhong W
Bioinformatics (Oxford, England) 20121011 23
<h4>Summary</h4>CD-HIT is a widely used program for clustering biological sequences to reduce sequence redundancy and improve the performance of other sequence analyses. In response to the rapid increase in the amount of sequencing data produced by the next-generation sequencing technologies, we have developed a new CD-HIT program accelerated with a novel parallelization strategy and some other techniques to allow efficient clustering of such datasets. Our tests demonstrated very good speedup de ...[more]