Dataset Information

PASS2: an automated database of protein alignments organised as structural superfamilies.

ABSTRACT:

Background

The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins.

Description

An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database.

Conclusions

The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html

SUBMITTER: Bhaduri A

PROVIDER: S-EPMC407847 | biostudies-literature | 2004 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

PASS2: an automated database of protein alignments organised as structural superfamilies.

Bhaduri Anirban A Pugalenthi Ganesan G Sowdhamini Ramanathan R

BMC bioinformatics 20040402

<h4>Background</h4>The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unificat ...[more]

PMID: 15059245

Dataset Information

PASS2: an automated database of protein alignments organised as structural superfamilies.

Background

Description

Conclusions

Publications

PASS2: an automated database of protein alignments organised as structural superfamilies.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

PASS2: a semi-automated database of protein alignments organised as structural superfamilies.
| S-EPMC99156 | biostudies-literature

PASS2 version 6: a database of structure-based sequence alignments of protein domain superfamilies in accordance with SCOPe.
| S-EPMC6395796 | biostudies-literature

PASS2: update of database of structure-based sequence alignments.
| S-EPMC12612674 | biostudies-literature

Flexible Structural Neighborhood--a database of protein structural similarities and alignments.
| S-EPMC1347486 | biostudies-literature

DoSA: Database of Structural Alignments.
| S-EPMC3708618 | biostudies-literature

PASS2 database for the structure-based sequence alignment of distantly related SCOP domain superfamilies: update to version 5 and added features.
| S-EPMC4702857 | biostudies-literature

TOPOFIT-DB, a database of protein structural alignments based on the TOPOFIT method.
| S-EPMC1635338 | biostudies-literature

DMAPS: a database of multiple alignments for protein structures.
| S-EPMC1347381 | biostudies-literature

Automated identification of RNA 3D modules with discriminative power in RNA structural alignments.
| S-EPMC3905863 | biostudies-literature

GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains.
| S-EPMC2817468 | biostudies-literature