Unknown

Dataset Information

0

Covering complete proteomes with X-ray structures: a current snapshot.


ABSTRACT: Structural genomics programs have developed and applied structure-determination pipelines to a wide range of protein targets, facilitating the visualization of macromolecular interactions and the understanding of their molecular and biochemical functions. The fundamental question of whether three-dimensional structures of all proteins and all functional annotations can be determined using X-ray crystallography is investigated. A first-of-its-kind large-scale analysis of crystallization propensity for all proteins encoded in 1953 fully sequenced genomes was performed. It is shown that current X-ray crystallographic knowhow combined with homology modeling can provide structures for 25% of modeling families (protein clusters for which structural models can be obtained through homology modeling), with at least one structural model produced for each Gene Ontology functional annotation. The coverage varies between superkingdoms, with 19% for eukaryotes, 35% for bacteria and 49% for archaea, and with those of viruses following the coverage values of their hosts. It is shown that the crystallization propensities of proteomes from the taxonomic superkingdoms are distinct. The use of knowledge-based target selection is shown to substantially increase the ability to produce X-ray structures. It is demonstrated that the human proteome has one of the highest attainable coverage values among eukaryotes, and GPCR membrane proteins suitable for X-ray structure determination were determined.

SUBMITTER: Mizianty MJ 

PROVIDER: S-EPMC4220968 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3141956 | biostudies-literature
| S-EPMC7111068 | biostudies-literature
| S-EPMC2685734 | biostudies-literature
| S-EPMC4408594 | biostudies-literature
| S-EPMC5047515 | biostudies-literature
| S-EPMC8381864 | biostudies-literature
| S-EPMC556006 | biostudies-literature
| S-EPMC4640134 | biostudies-literature
| S-EPMC8320365 | biostudies-literature
| S-EPMC2879073 | biostudies-literature