Unknown

Dataset Information

0

Designing succinct structural alphabets.


ABSTRACT: MOTIVATION: The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the 'structural alphabet' for this segment. Classical approaches such as ROSETTA used sequence profile and secondary structure information, to predict structural fragments. In contrast, we utilize more structural information, such as solvent accessibility and contact capacity, for finding structural fragments. RESULTS: Integer linear programming technique is applied to derive the best combination of these sequence and structural information items. This approach generates significantly more accurate and succinct structural alphabets with more than 50% improvement over the previous accuracies. With these novel structural alphabets, we are able to construct more accurate protein structures than the state-of-art ab initio protein structure prediction programs such as ROSETTA. We are also able to reduce the Kolodny's library size by a factor of 8, at the same accuracy. AVAILABILITY: The online FRazor server is under construction.

SUBMITTER: Li SC 

PROVIDER: S-EPMC2718643 | biostudies-literature | 2008 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Designing succinct structural alphabets.

Li Shuai Cheng SC   Bu Dongbo D   Gao Xin X   Xu Jinbo J   Li Ming M  

Bioinformatics (Oxford, England) 20080701 13


<h4>Motivation</h4>The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the 'structural alphabet' for this segment. Classical approaches such as ROSETTA used sequence profile and secondary structure information, to predict structural fragments. In contrast, we utilize more structural information, such as solvent accessibility an  ...[more]

Similar Datasets

| S-EPMC8000707 | biostudies-literature
| S-EPMC4445325 | biostudies-literature
| S-EPMC2838871 | biostudies-literature
| S-EPMC7466747 | biostudies-literature
| S-EPMC4257954 | biostudies-literature
| S-EPMC5872255 | biostudies-literature
| S-EPMC6661066 | biostudies-literature
| S-EPMC7467413 | biostudies-literature
| S-EPMC3962180 | biostudies-literature
| S-EPMC3155763 | biostudies-literature