Unknown

Dataset Information

0

The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver.


ABSTRACT: Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database.

SUBMITTER: Pandurangan AP 

PROVIDER: S-EPMC6324026 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver.

Pandurangan Arun Prasad AP   Stahlhacke Jonathan J   Oates Matt E ME   Smithers Ben B   Gough Julian J  

Nucleic acids research 20190101 D1


Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniPro  ...[more]

Similar Datasets

| S-EPMC8728302 | biostudies-literature
| S-EPMC4064129 | biostudies-literature
| S-EPMC5679007 | biostudies-literature
| S-EPMC5515284 | biostudies-literature
| S-EPMC3653121 | biostudies-literature
| S-EPMC4691341 | biostudies-literature
| S-EPMC5210635 | biostudies-literature
| S-EPMC3468816 | biostudies-literature
| S-EPMC7145695 | biostudies-literature
| S-EPMC3362740 | biostudies-literature