Unknown

Dataset Information

0

Genome pool strategy for structural coverage of protein families.


ABSTRACT: Even closely homologous proteins often have different crystallization properties and propensities. This observation can be used to introduce an additional dimension into crystallization trials by simultaneous targeting multiple homologs in what we call a "genome pool" strategy. We show that this strategy works because protein physicochemical properties correlated with crystallization success have a surprisingly broad distribution within most protein families. There are also "easy" and "difficult" families where this distribution is tilted in one direction. This leads to uneven structural coverage of protein families, with more "easy" ones solved. Increasing the size of the "genome pool" can improve chances of solving the "difficult" ones. In contrast, our analysis does not indicate that any specific genomes are "easy" or "difficult". Finally, we show that the group of proteins with known 3D structures is systematically different from the general pool of known proteins and we assess the structural consequences of these differences.

SUBMITTER: Jaroszewski L 

PROVIDER: S-EPMC2902364 | biostudies-literature | 2008 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome pool strategy for structural coverage of protein families.

Jaroszewski Lukasz L   Slabinski Lukasz L   Wooley John J   Deacon Ashley M AM   Lesley Scott A SA   Wilson Ian A IA   Godzik Adam A  

Structure (London, England : 1993) 20081101 11


Even closely homologous proteins often have different crystallization properties and propensities. This observation can be used to introduce an additional dimension into crystallization trials by simultaneous targeting multiple homologs in what we call a "genome pool" strategy. We show that this strategy works because protein physicochemical properties correlated with crystallization success have a surprisingly broad distribution within most protein families. There are also "easy" and "difficult  ...[more]

Similar Datasets

| S-EPMC2919736 | biostudies-literature
| S-EPMC4250425 | biostudies-literature
| S-EPMC102407 | biostudies-literature
| S-EPMC9252788 | biostudies-literature
| S-EPMC1560931 | biostudies-literature
| S-EPMC2144314 | biostudies-other
| S-EPMC169885 | biostudies-literature
| S-EPMC3982801 | biostudies-literature
| S-EPMC2692051 | biostudies-literature
| S-EPMC4610308 | biostudies-literature