Unknown

Dataset Information

0

Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.


ABSTRACT: Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-score 17% higher than that by traditional comparative modeling methods. For 495 unknown hard sequences, 72 are predicted to have a correct fold (TM-score > 0.5) and 321 have a substantial portion of structure correctly modeled (TM-score > 0.35). 317 sequences can be reliably assigned to a SCOP fold family based on structural analogy to existing proteins in PDB. The presented results, as a case study of E. coli, represent promising progress towards genome-wide structure modeling and fold family assignment using state-of-the-art ab initio folding algorithms.

SUBMITTER: Xu D 

PROVIDER: S-EPMC3667494 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.

Xu Dong D   Zhang Yang Y  

Scientific reports 20130101


Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-sc  ...[more]

Similar Datasets

| S-EPMC1303233 | biostudies-literature
| S-EPMC6031167 | biostudies-literature
| S-EPMC7523679 | biostudies-literature
| S-EPMC9936577 | biostudies-literature
| S-EPMC3976798 | biostudies-other
| S-EPMC9568664 | biostudies-literature
| S-EPMC10448985 | biostudies-literature
| S-EPMC4547140 | biostudies-literature
| S-EPMC2390798 | biostudies-literature
| S-EPMC9160776 | biostudies-literature