Unknown

Dataset Information

0

Aromatic claw: A new fold with high aromatic content that evades structural prediction.


ABSTRACT: We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in ?-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily ?-helical with a small two-stranded ?-sheet with a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.

SUBMITTER: Sachleben JR 

PROVIDER: S-EPMC5275723 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Aromatic claw: A new fold with high aromatic content that evades structural prediction.

Sachleben Joseph R JR   Adhikari Aashish N AN   Gawlak Grzegorz G   Hoey Robert J RJ   Liu Gaohua G   Joachimiak Andrzej A   Montelione Gaetano T GT   Sosnick Tobin R TR   Koide Shohei S  

Protein science : a publication of the Protein Society 20161110 2


We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in β-sheet, upon r  ...[more]

Similar Datasets

| S-EPMC5131300 | biostudies-literature
| S-EPMC6043474 | biostudies-literature
| S-EPMC1751310 | biostudies-literature
| S-EPMC3360076 | biostudies-literature
| S-EPMC2144056 | biostudies-other
| S-EPMC4570669 | biostudies-literature
| S-EPMC3674391 | biostudies-literature
| S-EPMC305551 | biostudies-literature
| S-EPMC5684096 | biostudies-literature
| S-EPMC3536170 | biostudies-other