Unknown

Dataset Information

0

PSI-2: structural genomics to cover protein domain family space.


ABSTRACT: One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.

SUBMITTER: Dessailly BH 

PROVIDER: S-EPMC2920419 | biostudies-literature | 2009 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

PSI-2: structural genomics to cover protein domain family space.

Dessailly Benoît H BH   Nair Rajesh R   Jaroszewski Lukasz L   Fajardo J Eduardo JE   Kouranov Andrei A   Lee David D   Fiser Andras A   Godzik Adam A   Rost Burkhard B   Orengo Christine C  

Structure (London, England : 1993) 20090601 6


One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from struct  ...[more]

Similar Datasets

| S-EPMC1373602 | biostudies-literature
| S-EPMC543483 | biostudies-literature
| S-EPMC9776390 | biostudies-literature
| S-EPMC2373547 | biostudies-literature
| S-EPMC4163028 | biostudies-literature
| S-EPMC2064049 | biostudies-literature
| S-EPMC2359764 | biostudies-literature
| S-EPMC8166929 | biostudies-literature
| S-EPMC2778303 | biostudies-literature
| S-EPMC4080787 | biostudies-literature