Unknown

Dataset Information

0

PALI-a database of Phylogeny and ALIgnment of homologous protein structures.


ABSTRACT: PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally aligned with every other member in the same family (pairwise alignment) and all the members in the family are also aligned using simultaneous super-position (multiple alignment). The structural alignments are performed largely automatically, with manual interventions especially in the cases of distantly related proteins, using the program STAMP (version 4.2). Every family is also associated with two dendrograms, calculated using PHYLIP (version 3.5), one based on a structural dissimilarity metric defined for every pairwise alignment and the other based on similarity of topologically equivalent residues. These dendrograms enable easy comparison of sequence and structure-based relationships among the members in a family. Structure-based alignments with the details of structural and sequence similarities, superposed coordinate sets and dendrograms can be accessed conveniently using a web interface. The database can be queried for protein pairs with sequence or structural similarities falling within a specified range. Thus PALI forms a useful resource to help in analysing the relationship between sequence and structure variation at a given level of sequence similarity. PALI also contains over 653 'orphans' (single member families). Using the web interface involving PSI_BLAST and PHYLIP it is possible to associate the sequence of a new protein with one of the families in PALI and generate a phylogenetic tree combining the query sequence and proteins of known 3-D structure. The database with the web interfaced search and dendrogram generation tools can be accessed at http://pauling.mbu.iisc.ernet. in/ approximately pali.

SUBMITTER: Balaji S 

PROVIDER: S-EPMC29825 | biostudies-literature | 2001 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

PALI-a database of Phylogeny and ALIgnment of homologous protein structures.

Balaji S S   Sujatha S S   Kumar S S SS   Srinivasan N N  

Nucleic acids research 20010101 1


PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally a  ...[more]

Similar Datasets

| S-EPMC308761 | biostudies-literature
| S-EPMC2279924 | biostudies-literature
| S-EPMC3697813 | biostudies-literature
| S-EPMC3645717 | biostudies-literature
| S-EPMC1182382 | biostudies-literature
| S-EPMC4029036 | biostudies-literature
| S-EPMC2535786 | biostudies-literature
| S-EPMC3166271 | biostudies-literature
| S-EPMC3964956 | biostudies-literature
| S-EPMC2238833 | biostudies-literature