Unknown

Dataset Information

0

A limited universe of membrane protein families and folds.


ABSTRACT: One of the goals of structural genomics is to obtain a structural representative of almost every fold in nature. A recent estimate suggests that 70%-80% of soluble protein domains identified in the first 1000 genome sequences should be covered by about 25,000 structures-a reasonably achievable goal. As no current estimates exist for the number of membrane protein families, however, it is not possible to know whether family coverage is a realistic goal for membrane proteins. Here we find that virtually all polytopic helical membrane protein families are present in the already known sequences so we can make an estimate of the total number of families. We find that only approximately 700 polytopic membrane protein families account for 80% of structured residues and approximately 1700 cover 90% of structured residues. While apparently a finite and reachable goal, we estimate that it will likely take more than three decades to obtain the structures needed for 90% residue coverage, if current trends continue.

SUBMITTER: Oberai A 

PROVIDER: S-EPMC2242558 | biostudies-literature | 2006 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A limited universe of membrane protein families and folds.

Oberai Amit A   Ihm Yungok Y   Kim Sanguk S   Bowie James U JU  

Protein science : a publication of the Protein Society 20060701 7


One of the goals of structural genomics is to obtain a structural representative of almost every fold in nature. A recent estimate suggests that 70%-80% of soluble protein domains identified in the first 1000 genome sequences should be covered by about 25,000 structures-a reasonably achievable goal. As no current estimates exist for the number of membrane protein families, however, it is not possible to know whether family coverage is a realistic goal for membrane proteins. Here we find that vir  ...[more]

Similar Datasets

| S-EPMC5529312 | biostudies-literature
| S-EPMC148146 | biostudies-other
| S-EPMC4171485 | biostudies-literature
| S-EPMC1821046 | biostudies-literature
| S-EPMC2323961 | biostudies-literature
| S-EPMC3118404 | biostudies-literature
| S-EPMC2698892 | biostudies-literature
| S-EPMC1914367 | biostudies-literature
| S-EPMC7367713 | biostudies-literature
| S-EPMC3003448 | biostudies-literature