Unknown

Dataset Information

0

Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors.


ABSTRACT: Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein segments, randomly selected from simulated model datasets. The advantage of 2DUV descriptors over one-dimensional linear absorption and circular dichroism spectra lies in the cross-peak information that reflects interactions between local regions of the protein. Thanks to their ultrafast (∼200 fs) nature, 2DUV measurements can be used in the future to probe conformational variations in the course of protein dynamics.

SUBMITTER: Ren H 

PROVIDER: S-EPMC9171355 | biostudies-literature | 2022 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors.

Ren Hao H   Zhang Qian Q   Wang Zhengjie Z   Zhang Guozhen G   Liu Hongzhang H   Guo Wenyue W   Mukamel Shaul S   Jiang Jun J  

Proceedings of the National Academy of Sciences of the United States of America 20220427 18


Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein se  ...[more]

Similar Datasets

| S-EPMC11228460 | biostudies-literature
| S-EPMC11641695 | biostudies-literature
| S-EPMC10316327 | biostudies-literature
| S-EPMC7043407 | biostudies-literature
2021-06-02 | GSE175942 | GEO
| S-EPMC8242018 | biostudies-literature
| S-EPMC8363013 | biostudies-literature
| S-EPMC8715543 | biostudies-literature
| S-EPMC10401178 | biostudies-literature
| S-EPMC11275739 | biostudies-literature