Unknown

Dataset Information

0

ProDCoNN: Protein design using a convolutional neural network.


ABSTRACT: Designing protein sequences that fold to a given three-dimensional (3D) structure has long been a challenging problem in computational structural biology with significant theoretical and practical implications. In this study, we first formulated this problem as predicting the residue type given the 3D structural environment around the C α atom of a residue, which is repeated for each residue of a protein. We designed a nine-layer 3D deep convolutional neural network (CNN) that takes as input a gridded box with the atomic coordinates and types around a residue. Several CNN layers were designed to capture structure information at different scales, such as bond lengths, bond angles, torsion angles, and secondary structures. Trained on a very large number of protein structures, the method, called ProDCoNN (protein design with CNN), achieved state-of-the-art performance when tested on large numbers of test proteins and benchmark datasets.

SUBMITTER: Zhang Y 

PROVIDER: S-EPMC8204568 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC9709917 | biostudies-literature
| 2443187 | ecrin-mdr-crc
| S-EPMC6343664 | biostudies-literature
| S-EPMC7013409 | biostudies-literature
| S-EPMC6197001 | biostudies-literature
| S-EPMC5940226 | biostudies-literature
| S-EPMC7180882 | biostudies-literature
| S-EPMC8576712 | biostudies-literature
| S-EPMC5571811 | biostudies-literature
| S-EPMC4908339 | biostudies-literature