Dataset Information

Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions.

ABSTRACT: Domains are fundamental subunits of proteins, and while they play major roles in facilitating protein-DNA, protein-RNA and other protein-ligand interactions, a systematic assessment of their various interaction modes is still lacking. A comprehensive resource identifying positions within domains that tend to interact with nucleic acids, small molecules and other ligands would expand our knowledge of domain functionality as well as aid in detecting ligand-binding sites within structurally uncharacterized proteins. Here, we introduce an approach to identify per-domain-position interaction 'frequencies' by aggregating protein co-complex structures by domain and ascertaining how often residues mapping to each domain position interact with ligands. We perform this domain-based analysis on ?91000 co-complex structures, and infer positions involved in binding DNA, RNA, peptides, ions or small molecules across 4128 domains, which we refer to collectively as the InteracDome. Cross-validation testing reveals that ligand-binding positions for 2152 domains are highly consistent and can be used to identify residues facilitating interactions in ?63-69% of human genes. Our resource of domain-inferred ligand-binding sites should be a great aid in understanding disease etiology: whereas these sites are enriched in Mendelian-associated and cancer somatic mutations, they are depleted in polymorphisms observed across healthy populations. The InteracDome is available at http://interacdome.princeton.edu.

SUBMITTER: Kobren SN

PROVIDER: S-EPMC6344845 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions.

Kobren Shilpa Nadimpalli SN Singh Mona M

Nucleic acids research 20190101 2

Domains are fundamental subunits of proteins, and while they play major roles in facilitating protein-DNA, protein-RNA and other protein-ligand interactions, a systematic assessment of their various interaction modes is still lacking. A comprehensive resource identifying positions within domains that tend to interact with nucleic acids, small molecules and other ligands would expand our knowledge of domain functionality as well as aid in detecting ligand-binding sites within structurally unchara ...[more]

PMID: 30535108

Dataset Information

Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions.

Publications

Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

The glucocorticoid receptor DNA-binding domain recognizes RNA hairpin structures with high affinity.
| S-EPMC6735959 | biostudies-literature

Evidence for DNA-binding domain--ligand-binding domain communications in the androgen receptor.
| S-EPMC3434514 | biostudies-literature

PPARα Ligand-Binding Domain Structures with Endogenous Fatty Acids and Fibrates.
| S-EPMC7653058 | biostudies-literature

Structures of the ligand-binding domain of Helicobacter pylori chemoreceptor TlpA.
| S-EPMC6201720 | biostudies-literature

Ligand binding and crystal structures of the substrate-binding domain of the ABC transporter OpuA.
| S-EPMC2861598 | biostudies-literature

Structures of the HIN domain:DNA complexes reveal ligand binding and activation mechanisms of the AIM2 inflammasome and IFI16 receptor.
| S-EPMC3334467 | biostudies-literature

Crystal structures of PI3K-C2alpha PX domain indicate conformational change associated with ligand binding.
| S-EPMC2292188 | biostudies-literature

Ligand binding and aggregation of pathogenic SOD1.
| S-EPMC3644087 | biostudies-literature

Structures of the Escherichia coli transcription activator and regulator of diauxie, XylR: an AraC DNA-binding family member with a LacI/GalR ligand-binding domain.
| S-EPMC3561964 | biostudies-literature

Crystal structures of complexes of vitamin D receptor ligand-binding domain with lithocholic acid derivatives.
| S-EPMC3708370 | biostudies-literature