Dataset Information

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

ABSTRACT: BACKGROUND: SCOP and CATH are widely used as gold standards to benchmark novel protein structure comparison methods as well as to train machine learning approaches for protein structure classification and prediction. The two hierarchies result from different protocols which may result in differing classifications of the same protein. Ignoring such differences leads to problems when being used to train or benchmark automatic structure classification methods. Here, we propose a method to compare SCOP and CATH in detail and discuss possible applications of this analysis. RESULTS: We create a new mapping between SCOP and CATH and define a consistent benchmark set which is shown to largely reduce errors made by structure comparison methods such as TM-Align and has useful further applications, e.g. for machine learning methods being trained for protein structure classification. Additionally, we extract additional connections in the topology of the protein fold space from the orthogonal features contained in SCOP and CATH. CONCLUSION: Via an all-to-all comparison, we find that there are large and unexpected differences between SCOP and CATH w.r.t. their domain definitions as well as their hierarchic partitioning of the fold space on every level of the two classifications. A consistent mapping of SCOP and CATH can be exploited for automated structure comparison and classification. AVAILABILITY: Benchmark sets and an interactive SCOP-CATH browser are available at http://www.bio.ifi.lmu.de/SCOPCath.

SUBMITTER: Csaba G

PROVIDER: S-EPMC2678134 | biostudies-literature | 2009

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

Csaba Gergely G Birzele Fabian F Zimmer Ralf R

BMC structural biology 20090417

<h4>Background</h4>SCOP and CATH are widely used as gold standards to benchmark novel protein structure comparison methods as well as to train machine learning approaches for protein structure classification and prediction. The two hierarchies result from different protocols which may result in differing classifications of the same protein. Ignoring such differences leads to problems when being used to train or benchmark automatic structure classification methods. Here, we propose a method to co ...[more]

PMID: 19374763

Dataset Information

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

Publications

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Systematic comparison of variant calling pipelines using gold standard personal exome variants.
| S-EPMC4671096 | biostudies-literature

A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary.
| S-EPMC2366924 | biostudies-literature

Extending CATH: increasing coverage of the protein structure universe and linking structure with function.
| S-EPMC3013636 | biostudies-literature

Genome3D: a UK collaborative project to annotate genomic sequences with predicted 3D structures based on SCOP and CATH domains.
| S-EPMC3531217 | biostudies-literature

CATH: an expanded resource to predict protein function through structure and sequence.
| S-EPMC5210570 | biostudies-literature

Genome-based Salmonella serotyping as the new gold standard.
| S-EPMC7062728 | biostudies-literature

New enumeration algorithm for protein structure comparison and classification.
| S-EPMC3582452 | biostudies-literature

Addressing brain tumors with targeted gold nanoparticles: a new gold standard for hydrophobic drug delivery?
| S-EPMC3837553 | biostudies-literature

Author Correction: Genome-based Salmonella serotyping as the new gold standard.
| S-EPMC7316794 | biostudies-literature

Adventures in data citation: sorghum genome data exemplifies the new gold standard.
| S-EPMC3392744 | biostudies-literature