Unknown

Dataset Information

0

Specific and intrinsic sequence patterns extracted by deep learning from intra-protein binding and non-binding peptide fragments.


ABSTRACT: The key finding in the DNA double helix model is the specific pairing or binding between nucleotides A-T and C-G, and the pairing rules are the molecule basis of genetic code. Unfortunately, no such rules have been discovered for proteins. Here we show that intrinsic sequence patterns between intra-protein binding peptide fragments exist, they can be extracted using a deep learning algorithm, and they bear an interesting semblance to the DNA double helix model. The intra-protein binding peptide fragments have specific and intrinsic sequence patterns, distinct from non-binding peptide fragments, and multi-millions of binding and non-binding peptide fragments from currently available protein X-ray structures are classified with an accuracy of up to 93%. The specific binding between short peptide fragments may provide an important driving force for protein folding and protein-protein interaction, two open and fundamental problems in molecular biology, and it may have significant potential in design, discovery, and development of peptide, protein, and antibody drugs.

SUBMITTER: Wang Y 

PROVIDER: S-EPMC5668431 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Specific and intrinsic sequence patterns extracted by deep learning from intra-protein binding and non-binding peptide fragments.

Wang Yuhong Y   Huang Junzhou J   Li Wei W   Wang Sheng S   Ding Chuanfan C  

Scientific reports 20171102 1


The key finding in the DNA double helix model is the specific pairing or binding between nucleotides A-T and C-G, and the pairing rules are the molecule basis of genetic code. Unfortunately, no such rules have been discovered for proteins. Here we show that intrinsic sequence patterns between intra-protein binding peptide fragments exist, they can be extracted using a deep learning algorithm, and they bear an interesting semblance to the DNA double helix model. The intra-protein binding peptide  ...[more]

Similar Datasets

| S-EPMC7570975 | biostudies-literature
| S-EPMC5698827 | biostudies-literature
| S-EPMC9907221 | biostudies-literature
| S-EPMC9120437 | biostudies-literature
| S-EPMC6084614 | biostudies-literature
| S-EPMC10786417 | biostudies-literature
2023-07-10 | GSE221870 | GEO
| S-EPMC6612801 | biostudies-other
2024-02-03 | GSE254493 | GEO
| S-EPMC10045089 | biostudies-literature