Unknown

Dataset Information

0

Validation of the Crystallography Open Database using the Crystallographic Information Framework.


ABSTRACT: Data curation practices of the Crystallography Open Database (COD) are described with additional focus being placed on the formal validation using the Crystallographic Information Framework (CIF). The cif_validate program, capable of validating CIF files against both the DDL1 and the DDLm dictionaries, is presented and used to process the entirety of the COD. Validation results collected from over 450 000 CIF files are demonstrated to be a useful resource in the data maintenance process as well as the development of the underlying ontologies. A set of programs intended to aid in the dictionary migration from DDL1 to DDLm is also presented.

SUBMITTER: Vaitkus A 

PROVIDER: S-EPMC8056762 | biostudies-literature | 2021 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Validation of the Crystallography Open Database using the Crystallographic Information Framework.

Vaitkus Antanas A   Merkys Andrius A   Gražulis Saulius S  

Journal of applied crystallography 20210214 Pt 2


Data curation practices of the Crystallography Open Database (COD) are described with additional focus being placed on the formal validation using the Crystallographic Information Framework (CIF). The <i>cif_validate</i> program, capable of validating CIF files against both the DDL1 and the DDLm dictionaries, is presented and used to process the entirety of the COD. Validation results collected from over 450 000 CIF files are demonstrated to be a useful resource in the data maintenance process a  ...[more]

Similar Datasets

| S-EPMC10730636 | biostudies-literature
| S-EPMC8130828 | biostudies-literature
| S-EPMC5959826 | biostudies-literature
| S-EPMC7109073 | biostudies-literature
| S-EPMC2504710 | biostudies-literature
| S-EPMC11744758 | biostudies-literature
| S-EPMC11291090 | biostudies-literature
| S-EPMC3752282 | biostudies-literature
| S-EPMC10269341 | biostudies-literature
2007-12-08 | GSE9734 | GEO