Ontology highlight
ABSTRACT:
SUBMITTER: Hooper SD
PROVIDER: S-EPMC2673424 | biostudies-literature | 2009 Apr
REPOSITORIES: biostudies-literature
Hooper Sean D SD Anderson Iain J IJ Pati Amrita A Dalevi Daniel D Mavromatis Konstantinos K Kyrpides Nikos C NC
Nucleic acids research 20090217 7
In order to simplify and meaningfully categorize large sets of protein sequence data, it is commonplace to cluster proteins based on the similarity of those sequences. However, it quickly becomes clear that the sequence flexibility allowed a given protein varies significantly among different protein families. The degree to which sequences are conserved not only differs for each protein family, but also is affected by the phylogenetic divergence of the source organisms. Clustering techniques that ...[more]