Unknown

Dataset Information

0

PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank.


ABSTRACT: While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit, such as 2D depictions, 3D conformers, physicochemical properties, scaffolds, common fragments, and cross-references to small molecule databases using UniChem. The toolkit also provides methods for identifying all the covalently attached chemical components in a macromolecular structure and calculating similarity among small molecules. By providing a broad range of functionality, PDBe CCDUtils caters to the needs of researchers in cheminformatics, structural biology, bioinformatics and computational chemistry.

SUBMITTER: Kunnakkattu IR 

PROVIDER: S-EPMC10693035 | biostudies-literature | 2023 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank.

Kunnakkattu Ibrahim Roshan IR   Choudhary Preeti P   Pravda Lukas L   Nadzirin Nurul N   Smart Oliver S OS   Yuan Qi Q   Anyango Stephen S   Nair Sreenath S   Varadi Mihaly M   Velankar Sameer S  

Journal of cheminformatics 20231202 1


While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit  ...[more]

Similar Datasets

| S-EPMC2808887 | biostudies-literature
| S-EPMC3013808 | biostudies-literature
| S-EPMC3965016 | biostudies-literature
| S-EPMC3245096 | biostudies-literature
| S-EPMC5753225 | biostudies-literature
| S-EPMC3069747 | biostudies-literature
| S-EPMC3843158 | biostudies-literature
| S-EPMC4243272 | biostudies-literature
| S-EPMC8457362 | biostudies-literature
| S-EPMC8849442 | biostudies-literature