Unknown

Dataset Information

0

Expanded explorations into the optimization of an energy function for protein design.


ABSTRACT: Nature possesses a secret formula for the energy as a function of the structure of a protein. In protein design, approximations are made to both the structural representation of the molecule and to the form of the energy equation, such that the existence of a general energy function for proteins is by no means guaranteed. Here, we present new insights toward the application of machine learning to the problem of finding a general energy function for protein design. Machine learning requires the definition of an objective function, which carries with it the implied definition of success in protein design. We explored four functions, consisting of two functional forms, each with two criteria for success. Optimization was carried out by a Monte Carlo search through the space of all variable parameters. Cross-validation of the optimized energy function against a test set gave significantly different results depending on the choice of objective function, pointing to relative correctness of the built-in assumptions. Novel energy cross terms correct for the observed nonadditivity of energy terms and an imbalance in the distribution of predicted amino acids. This paper expands on the work presented at the 2012 ACM-BCB.

SUBMITTER: Huang YM 

PROVIDER: S-EPMC3919130 | biostudies-literature | 2013 Sep-Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Expanded explorations into the optimization of an energy function for protein design.

Huang Yao-Ming YM   Bystroff Christopher C  

IEEE/ACM transactions on computational biology and bioinformatics 20130901 5


Nature possesses a secret formula for the energy as a function of the structure of a protein. In protein design, approximations are made to both the structural representation of the molecule and to the form of the energy equation, such that the existence of a general energy function for proteins is by no means guaranteed. Here, we present new insights toward the application of machine learning to the problem of finding a general energy function for protein design. Machine learning requires the d  ...[more]

Similar Datasets

| S-EPMC21885 | biostudies-literature
| S-EPMC3183803 | biostudies-literature
| S-EPMC7144094 | biostudies-literature
2018-10-01 | GSE118147 | GEO
| S-EPMC6736313 | biostudies-literature
| S-EPMC30138 | biostudies-literature
| S-EPMC3170394 | biostudies-other
| S-EPMC2373672 | biostudies-literature
| S-EPMC3531388 | biostudies-literature
| S-EPMC7980421 | biostudies-literature