Dataset Information

A chemogenomics view on protein-ligand spaces.

ABSTRACT: BACKGROUND: Chemogenomics is an emerging inter-disciplinary approach to drug discovery that combines traditional ligand-based approaches with biological information on drug targets and lies at the interface of chemistry, biology and informatics. The ultimate goal in chemogenomics is to understand molecular recognition between all possible ligands and all possible drug targets. Protein and ligand space have previously been studied as separate entities, but chemogenomics studies deal with large datasets that cover parts of the joint protein-ligand space. Since drug discovery has traditionally focused on ligand optimization, the chemical space has been studied extensively. The protein space has been studied to some extent, typically for the purpose of classification of proteins into functional and structural classes. Since chemogenomics deals not only with ligands but also with the macromolecules the ligands interact with, it is of interest to find means to explore, compare and visualize protein-ligand subspaces. RESULTS: Two chemogenomics protein-ligand interaction datasets were prepared for this study. The first dataset covers the known structural protein-ligand space, and includes all non-redundant protein-ligand interactions found in the worldwide Protein Data Bank (PDB). The second dataset contains all approved drugs and drug targets stored in the DrugBank database, and represents the approved drug-drug target space. To capture biological and physicochemical features of the chemogenomics datasets, sequence-based descriptors were computed for the proteins, and 0, 1 and 2 dimensional descriptors for the ligands. Principal component analysis (PCA) was used to analyze the multidimensional data and to create global models of protein-ligand space. The nearest neighbour method, computed using the principal components, was used to obtain a measure of overlap between the datasets. CONCLUSION: In this study, we present an approach to visualize protein-ligand spaces from a chemogenomics perspective, where both ligand and protein features are taken into account. The method can be applied to any protein-ligand interaction dataset. Here, the approach is applied to analyze the structural protein-ligand space and the protein-ligand space of all approved drugs and their targets. We show that this approach can be used to visualize and compare chemogenomics datasets, and possibly to identify cross-interaction complexes in protein-ligand space.

SUBMITTER: Strombergsson H

PROVIDER: S-EPMC2697636 | biostudies-literature | 2009

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A chemogenomics view on protein-ligand spaces.

Strömbergsson Helena H Kleywegt Gerard J GJ

BMC bioinformatics 20090616

<h4>Background</h4>Chemogenomics is an emerging inter-disciplinary approach to drug discovery that combines traditional ligand-based approaches with biological information on drug targets and lies at the interface of chemistry, biology and informatics. The ultimate goal in chemogenomics is to understand molecular recognition between all possible ligands and all possible drug targets. Protein and ligand space have previously been studied as separate entities, but chemogenomics studies deal with l ...[more]

PMID: 19534738

Similar Datasets

Project description:Background and purposeChemogenomics focuses on the discovery of new connections between chemical and biological space leading to the discovery of new protein targets and biologically active molecules. G-protein coupled receptors (GPCRs) are a particularly interesting protein family for chemogenomics studies because there is an overwhelming amount of ligand binding affinity data available. The increasing number of aminergic GPCR crystal structures now for the first time allows the integration of chemogenomics studies with high-resolution structural analyses of GPCR-ligand complexes.Experimental approachIn this study, we have combined ligand affinity data, receptor mutagenesis studies, and amino acid sequence analyses to high-resolution structural analyses of (hist)aminergic GPCR-ligand interactions. This integrated structural chemogenomics analysis is used to more accurately describe the molecular and structural determinants of ligand affinity and selectivity in different key binding regions of the crystallized aminergic GPCRs, and histamine receptors in particular.Key resultsOur investigations highlight interesting correlations and differences between ligand similarity and ligand binding site similarity of different aminergic receptors. Apparent discrepancies can be explained by combining detailed analysis of crystallized or predicted protein-ligand binding modes, receptor mutation studies, and ligand structure-selectivity relationships that identify local differences in essential pharmacophore features in the ligand binding sites of different receptors.Conclusions and implicationsWe have performed structural chemogenomics studies that identify links between (hist)aminergic receptor ligands and their binding sites and binding modes. This knowledge can be used to identify structure-selectivity relationships that increase our understanding of ligand binding to (hist)aminergic receptors and hence can be used in future GPCR ligand discovery and design.

Dataset Information

A chemogenomics view on protein-ligand spaces.

Publications

A chemogenomics view on protein-ligand spaces.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets