Unknown

Dataset Information

0

Predicting plant Rubisco kinetics from RbcL sequence data using machine learning.


ABSTRACT: Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) is responsible for the conversion of atmospheric CO2 to organic carbon during photosynthesis, and often acts as a rate limiting step in the later process. Screening the natural diversity of Rubisco kinetics is the main strategy used to find better Rubisco enzymes for crop engineering efforts. Here, we demonstrate the use of Gaussian processes (GPs), a family of Bayesian models, coupled with protein encoding schemes, for predicting Rubisco kinetics from Rubisco large subunit (RbcL) sequence data. GPs trained on published experimentally obtained Rubisco kinetic datasets were applied to over 9000 sequences encoding RbcL to predict Rubisco kinetic parameters. Notably, our predicted kinetic values were in agreement with known trends, e.g. higher carboxylation turnover rates (Kcat) for Rubisco enzymes from C4 or crassulacean acid metabolism (CAM) species, compared with those found in C3 species. This is the first study demonstrating machine learning approaches as a tool for screening and predicting Rubisco kinetics, which could be applied to other enzymes.

SUBMITTER: Iqbal WA 

PROVIDER: S-EPMC9833099 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting plant Rubisco kinetics from RbcL sequence data using machine learning.

Iqbal Wasim A WA   Lisitsa Alexei A   Kapralov Maxim V MV  

Journal of experimental botany 20230101 2


Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) is responsible for the conversion of atmospheric CO2 to organic carbon during photosynthesis, and often acts as a rate limiting step in the later process. Screening the natural diversity of Rubisco kinetics is the main strategy used to find better Rubisco enzymes for crop engineering efforts. Here, we demonstrate the use of Gaussian processes (GPs), a family of Bayesian models, coupled with protein encoding schemes, for predicting Rubisco  ...[more]

Similar Datasets

| S-EPMC6150517 | biostudies-other
2021-06-02 | GSE175942 | GEO
| S-EPMC8160335 | biostudies-literature
| S-EPMC11899459 | biostudies-literature
| S-EPMC2949053 | biostudies-literature
| S-EPMC10896787 | biostudies-literature
| S-EPMC8743549 | biostudies-literature
| S-EPMC9525268 | biostudies-literature
| S-EPMC10711018 | biostudies-literature
| S-EPMC10954144 | biostudies-literature