Unknown

Dataset Information

0

Determination of protein fold class from Raman or Raman optical activity spectra using random forests.


ABSTRACT: Knowledge of the fold class of a protein is valuable because fold class gives an indication of protein function and evolution. Fold class can be accurately determined from a crystal structure or NMR structure, though these methods are expensive, time-consuming, and inapplicable to all proteins. In contrast, vibrational spectra [infra-red, Raman, or Raman optical activity (ROA)] are rapidly obtained for proteins under wide range of biological molecules under diverse experimental and physiological conditions. Here, we show that the fold class of a protein can be determined from Raman or ROA spectra by converting a spectrum into data of 10 cm(-1) bin widths and applying the random forest machine learning algorithm. Spectral data from 605 and 1785 cm(-1) were analyzed, as well as the amide I, II, and III regions in isolation and in combination. ROA amide II and III data gave the best performance, with 33 of 44 proteins assigned to one of the correct four top-level structural classification of proteins (SCOP) fold class (all ?, all ?, ? and ?, and disordered). The method also shows which spectral regions are most valuable in assigning fold class.

SUBMITTER: Kinalwa M 

PROVIDER: S-EPMC3218359 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Determination of protein fold class from Raman or Raman optical activity spectra using random forests.

Kinalwa Myra M   Blanch Ewan W EW   Doig Andrew J AJ  

Protein science : a publication of the Protein Society 20110818 10


Knowledge of the fold class of a protein is valuable because fold class gives an indication of protein function and evolution. Fold class can be accurately determined from a crystal structure or NMR structure, though these methods are expensive, time-consuming, and inapplicable to all proteins. In contrast, vibrational spectra [infra-red, Raman, or Raman optical activity (ROA)] are rapidly obtained for proteins under wide range of biological molecules under diverse experimental and physiological  ...[more]

Similar Datasets

| S-EPMC2335306 | biostudies-literature
| S-EPMC6686255 | biostudies-literature
| S-EPMC6370055 | biostudies-literature
| S-EPMC5834538 | biostudies-literature
| S-EPMC4148397 | biostudies-literature
| S-EPMC7387700 | biostudies-literature
| S-EPMC2651179 | biostudies-literature
| S-EPMC3018816 | biostudies-other
| S-EPMC4232575 | biostudies-literature
| S-EPMC3750505 | biostudies-literature