Unknown

Dataset Information

0

Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.


ABSTRACT: Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-score 17% higher than that by traditional comparative modeling methods. For 495 unknown hard sequences, 72 are predicted to have a correct fold (TM-score > 0.5) and 321 have a substantial portion of structure correctly modeled (TM-score > 0.35). 317 sequences can be reliably assigned to a SCOP fold family based on structural analogy to existing proteins in PDB. The presented results, as a case study of E. coli, represent promising progress towards genome-wide structure modeling and fold family assignment using state-of-the-art ab initio folding algorithms.

SUBMITTER: Xu D 

PROVIDER: S-EPMC3667494 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.

Xu Dong D   Zhang Yang Y  

Scientific reports 20130101


Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-sc  ...[more]

Similar Datasets

| S-EPMC7055371 | biostudies-literature
| S-EPMC9568664 | biostudies-literature
| S-EPMC4547140 | biostudies-literature
| S-EPMC9248749 | biostudies-literature
| S-EPMC3616975 | biostudies-literature
| S-EPMC1878469 | biostudies-literature
| S-EPMC5656601 | biostudies-literature
| S-EPMC7217823 | biostudies-literature
| S-EPMC9814694 | biostudies-literature
| S-EPMC4285884 | biostudies-literature