Unknown

Dataset Information

0

Unexpected features of the dark proteome.


ABSTRACT: We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ?14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology.

SUBMITTER: Perdigao N 

PROVIDER: S-EPMC4702990 | biostudies-literature | 2015 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications


We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome compris  ...[more]

Similar Datasets

| S-EPMC6630768 | biostudies-literature
| S-EPMC7066081 | biostudies-literature
| S-EPMC5895634 | biostudies-literature
| S-EPMC6602455 | biostudies-literature
| S-EPMC3303825 | biostudies-literature
| S-EPMC5278394 | biostudies-literature
| S-EPMC2937035 | biostudies-literature
| S-EPMC3430701 | biostudies-literature
2012-01-31 | GSE35418 | GEO
| S-SCDT-MSB-20-9500 | biostudies-other