Unknown

Dataset Information

0

InterPro in 2019: improving coverage, classification and access to protein sequence annotations.


ABSTRACT: The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities.

SUBMITTER: Mitchell AL 

PROVIDER: S-EPMC6323941 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

InterPro in 2019: improving coverage, classification and access to protein sequence annotations.

Mitchell Alex L AL   Attwood Teresa K TK   Babbitt Patricia C PC   Blum Matthias M   Bork Peer P   Bridge Alan A   Brown Shoshana D SD   Chang Hsin-Yu HY   El-Gebali Sara S   Fraser Matthew I MI   Gough Julian J   Haft David R DR   Huang Hongzhan H   Letunic Ivica I   Lopez Rodrigo R   Luciani AurĂ©lien A   Madeira Fabio F   Marchler-Bauer Aron A   Mi Huaiyu H   Natale Darren A DA   Necci Marco M   Nuka Gift G   Orengo Christine C   Pandurangan Arun P AP   Paysan-Lafosse Typhaine T   Pesseat Sebastien S   Potter Simon C SC   Qureshi Matloob A MA   Rawlings Neil D ND   Redaschi Nicole N   Richardson Lorna J LJ   Rivoire Catherine C   Salazar Gustavo A GA   Sangrador-Vegas Amaia A   Sigrist Christian J A CJA   Sillitoe Ian I   Sutton Granger G GG   Thanki Narmada N   Thomas Paul D PD   Tosatto Silvio C E SCE   Yong Siew-Yit SY   Finn Robert D RD  

Nucleic acids research 20190101 D1


The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic in  ...[more]

Similar Datasets

| S-EPMC5210578 | biostudies-literature
| S-EPMC3170169 | biostudies-literature
| S-EPMC4383996 | biostudies-literature
| S-EPMC165493 | biostudies-literature
| S-EPMC5963392 | biostudies-literature
| S-EPMC4799721 | biostudies-literature
2022-12-14 | PXD032786 | Pride
| S-EPMC9134455 | biostudies-literature
| S-EPMC3405096 | biostudies-literature
| S-EPMC6612146 | biostudies-literature