Ontology highlight
ABSTRACT:
SUBMITTER: Rappoport N
PROVIDER: S-EPMC3245180 | biostudies-literature | 2012 Jan
REPOSITORIES: biostudies-literature
Rappoport Nadav N Karsenty Solange S Stern Amos A Linial Nathan N Linial Michal M
Nucleic acids research 20111125 Database issue
ProtoNet 6.0 (http://www.protonet.cs.huji.ac.il) is a data structure of protein families that cover the protein sequence space. These families are generated through an unsupervised bottom-up clustering algorithm. This algorithm organizes large sets of proteins in a hierarchical tree that yields high-quality protein families. The 2012 ProtoNet (Version 6.0) tree includes over 9 million proteins of which 5.5% come from UniProtKB/SwissProt and the rest from UniProtKB/TrEMBL. The hierarchical tree s ...[more]