Unknown

Dataset Information

0

IProClass: an integrated database of protein family, function and structure information.


ABSTRACT: The iProClass database provides comprehensive, value-added descriptions of proteins and serves as a framework for data integration in a distributed networking environment. The protein information in iProClass includes family relationships as well as structural and functional classifications and features. The current version consists of about 830 000 non-redundant PIR-PSD, SWISS-PROT, and TrEMBL proteins organized with more than 36 000 PIR superfamilies, 145 000 families, 4000 domains, 1300 motifs and 550 000 FASTA similarity clusters. It provides rich links to over 50 database of protein sequences, families, functions and pathways, protein-protein interactions, post-translational modifications, protein expressions, structures and structural classifications, genes and genomes, ontologies, literature and taxonomy. Protein and superfamily summary reports present extensive annotation information and include membership statistics and graphical display of domains and motifs. iProClass employs an open and modular architecture for interoperability and scalability. It is implemented in the Oracle object-relational database system and is updated biweekly. The database is freely accessible from the web site at http://pir.georgetown.edu/iproclass/ and searchable by sequence or text string. The data integration in iProClass supports exploration of protein relationships. Such knowledge is fundamental to the understanding of protein evolution, structure and function and crucial to functional genomic and proteomic research.

SUBMITTER: Huang H 

PROVIDER: S-EPMC165491 | biostudies-literature | 2003 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

iProClass: an integrated database of protein family, function and structure information.

Huang Hongzhan H   Barker Winona C WC   Chen Yongxing Y   Wu Cathy H CH  

Nucleic acids research 20030101 1


The iProClass database provides comprehensive, value-added descriptions of proteins and serves as a framework for data integration in a distributed networking environment. The protein information in iProClass includes family relationships as well as structural and functional classifications and features. The current version consists of about 830 000 non-redundant PIR-PSD, SWISS-PROT, and TrEMBL proteins organized with more than 36 000 PIR superfamilies, 145 000 families, 4000 domains, 1300 motif  ...[more]

Similar Datasets

| S-EPMC3000042 | biostudies-literature
| S-EPMC8652034 | biostudies-literature
| S-EPMC6928464 | biostudies-literature
| S-EPMC5998897 | biostudies-literature
| S-EPMC1217580 | biostudies-other
| S-EPMC5444566 | biostudies-literature
| S-EPMC6897987 | biostudies-literature
| S-EPMC4806539 | biostudies-literature
| S-EPMC147243 | biostudies-other
| S-EPMC102450 | biostudies-literature