Unknown

Dataset Information

0

Delineation and analysis of the conceptual data model implied by the "IUPAC Recommendations for Biochemical Nomenclature".


ABSTRACT: Computational analysis of the bonding, geometric, and topological relationships within proteins typically takes on the order of hours, mainly devoted to the writing of scripts and code to correctly parse the data. The Structured Query Language (SQL) built into modern database management systems eliminates the need for data parsing, effectively reducing the analysis time to seconds. To this end, we have formulated a conceptual data model (CDM) for proteins based on the IUPAC recommendations for biochemical nomenclature. This conceptual data model makes explicit the inherent bonding relationships between the atoms of a protein, as well as the geometric (bond angle and torsion angle) and topological (chirality) relationships between the bonds. The validity of the CDM has been tested with a reduced implementation using commercial database software. The ease in both populating the database with data from the Protein Data Bank and formulating/executing queries supports the correctness of the model. The ability to conduct truly interactive analyses of protein structure is essential to fully capitalize on the explosion in postgenomic protein structure data.

SUBMITTER: Fox-Erlich S 

PROVIDER: S-EPMC2280010 | biostudies-literature | 2004 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Delineation and analysis of the conceptual data model implied by the "IUPAC Recommendations for Biochemical Nomenclature".

Fox-Erlich Susan S   Martyn Timothy O TO   Ellis Heidi J C HJ   Gryk Michael R MR  

Protein science : a publication of the Protein Society 20040804 9


Computational analysis of the bonding, geometric, and topological relationships within proteins typically takes on the order of hours, mainly devoted to the writing of scripts and code to correctly parse the data. The Structured Query Language (SQL) built into modern database management systems eliminates the need for data parsing, effectively reducing the analysis time to seconds. To this end, we have formulated a conceptual data model (CDM) for proteins based on the IUPAC recommendations for b  ...[more]

Similar Datasets

| S-EPMC2865858 | biostudies-literature
| S-EPMC6349552 | biostudies-literature
| S-EPMC6484368 | biostudies-literature
| S-EPMC9378753 | biostudies-literature
| S-EPMC4783007 | biostudies-literature
| S-EPMC7528558 | biostudies-literature
| S-EPMC10895818 | biostudies-literature
| S-EPMC4724253 | biostudies-literature
| S-EPMC2830119 | biostudies-literature
| S-EPMC6710392 | biostudies-literature