Dataset Information

ASH structure alignment package: sensitivity and selectivity in domain classification.

ABSTRACT:

Background

Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website http://www.pdbj.org/ASH/.

Results

The new ASH score shows increased selectivity and sensitivity compared with values reported for several popular programs using the same test set of 4,298,905 structure pairs, yielding an area of .96 under the receiver operating characteristic (ROC) curve. In addition, weak sequence homologies between similar domains are revealed that could not be detected by BLAST sequence alignment. Also, a subset of domain pairs is identified that exhibit high similarity, even though their CATH and SCOP classification differs. Finally, we show that the ranking of alignment programs based solely on geometric measures depends on the choice of the quality measure.

Conclusion

ASH shows high selectivity and sensitivity with regard to domain classification, an important step in defining distantly related protein sequence families. Moreover, the CPU cost per alignment is competitive with the fastest programs, making ASH a practical option for large-scale structure classification studies.

SUBMITTER: Standley DM

PROVIDER: S-EPMC1955748 | biostudies-literature | 2007 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

ASH structure alignment package: sensitivity and selectivity in domain classification.

Standley Daron M DM Toh Hiroyuki H Nakamura Haruki H

BMC bioinformatics 20070404

<h4>Background</h4>Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP ...[more]

PMID: 17407606

Dataset Information

ASH structure alignment package: sensitivity and selectivity in domain classification.

Background

Results

Conclusion

Publications

ASH structure alignment package: sensitivity and selectivity in domain classification.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

STOPGAP, an open-source package for template matching, subtomogram alignment, and classification.
| S-EPMC10769363 | biostudies-literature

The FSSP database: fold classification based on structure-structure alignment of proteins.
| S-EPMC145583 | biostudies-other

Protein structure alignment by Reseek improves sensitivity to remote homologs.
| S-EPMC11601161 | biostudies-literature

Functional Classification and Interaction Selectivity Landscape of the Human SH3 Domain Superfamily.
| S-EPMC10814857 | biostudies-literature

A Treatment Package without Escape Extinction to Address Food Selectivity.
| S-EPMC4692558 | biostudies-other

Classification of protein quaternary structure by functional domain composition.
| S-EPMC1450311 | biostudies-literature

CroMaSt: a workflow for assessing protein domain classification by cross-mapping of structural instances between domain databases and structural alignment.
| S-EPMC10329740 | biostudies-literature

UniProt-DAAC: domain architecture alignment and classification, a new method for automatic functional annotation in UniProtKB.
| S-EPMC4965628 | biostudies-literature

Human fatty acid synthase: structure and substrate selectivity of the thioesterase domain.
| S-EPMC524853 | biostudies-literature

A perl package and an alignment tool for phylogenetic networks.
| S-EPMC2330044 | biostudies-literature