Unknown

Dataset Information

0

Blurring contact maps of thousands of proteins: what we can learn by reconstructing 3D structure.


ABSTRACT: BACKGROUND:The present knowledge of protein structures at atomic level derives from some 60,000 molecules. Yet the exponential ever growing set of hypothetical protein sequences comprises some 10 million chains and this makes the problem of protein structure prediction one of the challenging goals of bioinformatics. In this context, the protein representation with contact maps is an intermediate step of fold recognition and constitutes the input of contact map predictors. However contact map representations require fast and reliable methods to reconstruct the specific folding of the protein backbone. METHODS:In this paper, by adopting a GRID technology, our algorithm for 3D reconstruction FT-COMAR is benchmarked on a huge set of non redundant proteins (1716) taking random noise into consideration and this makes our computation the largest ever performed for the task at hand. RESULTS:We can observe the effects of introducing random noise on 3D reconstruction and derive some considerations useful for future implementations. The dimension of the protein set allows also statistical considerations after grouping per SCOP structural classes. CONCLUSIONS:All together our data indicate that the quality of 3D reconstruction is unaffected by deleting up to an average 75% of the real contacts while only few percentage of randomly generated contacts in place of non-contacts are sufficient to hamper 3D reconstruction.

SUBMITTER: Vassura M 

PROVIDER: S-EPMC3033854 | biostudies-literature | 2011 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Blurring contact maps of thousands of proteins: what we can learn by reconstructing 3D structure.

Vassura Marco M   Di Lena Pietro P   Margara Luciano L   Mirto Maria M   Aloisio Giovanni G   Fariselli Piero P   Casadio Rita R  

BioData mining 20110113 1


<h4>Background</h4>The present knowledge of protein structures at atomic level derives from some 60,000 molecules. Yet the exponential ever growing set of hypothetical protein sequences comprises some 10 million chains and this makes the problem of protein structure prediction one of the challenging goals of bioinformatics. In this context, the protein representation with contact maps is an intermediate step of fold recognition and constitutes the input of contact map predictors. However contact  ...[more]

Similar Datasets

| S-EPMC1800355 | biostudies-literature
| S-EPMC7203009 | biostudies-literature
| S-EPMC8475980 | biostudies-literature
| S-EPMC5824572 | biostudies-literature
| S-EPMC7911092 | biostudies-literature
| S-EPMC4876523 | biostudies-literature
| S-EPMC6144818 | biostudies-literature
| S-EPMC6141191 | biostudies-literature
| S-EPMC5094293 | biostudies-literature
| S-EPMC8336924 | biostudies-literature