Unknown

Dataset Information

0

A machine learning strategy for the identification of key in silico descriptors and prediction models for IgG monoclonal antibody developability properties.


ABSTRACT: Identification of favorable biophysical properties for protein therapeutics as part of developability assessment is a crucial part of the preclinical development process. Successful prediction of such properties and bioassay results from calculated in silico features has potential to reduce the time and cost of delivering clinical-grade material to patients, but nevertheless has remained an ongoing challenge to the field. Here, we demonstrate an automated and flexible machine learning workflow designed to compare and identify the most powerful features from computationally derived physiochemical feature sets, generated from popular commercial software packages. We implement this workflow with medium-sized datasets of human and humanized IgG molecules to generate predictive regression models for two key developability endpoints, hydrophobicity and poly-specificity. The most important features discovered through the automated workflow corroborate several previous literature reports, and newly discovered features suggest directions for further research and potential model improvement.

SUBMITTER: Waight AB 

PROVIDER: S-EPMC10448975 | biostudies-literature | 2023 Jan-Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A machine learning strategy for the identification of key <i>in silico</i> descriptors and prediction models for IgG monoclonal antibody developability properties.

Waight Andrew B AB   Prihoda David D   Shrestha Rojan R   Metcalf Kevin K   Bailly Marc M   Ancona Marco M   Widatalla Talal T   Rollins Zachary Z   Cheng Alan C AC   Bitton Danny A DA   Fayadat-Dilman Laurence L  

mAbs 20230101 1


Identification of favorable biophysical properties for protein therapeutics as part of developability assessment is a crucial part of the preclinical development process. Successful prediction of such properties and bioassay results from calculated <i>in silico</i> features has potential to reduce the time and cost of delivering clinical-grade material to patients, but nevertheless has remained an ongoing challenge to the field. Here, we demonstrate an automated and flexible machine learning wor  ...[more]

Similar Datasets

| S-EPMC10038059 | biostudies-literature
| S-EPMC6380425 | biostudies-literature
| S-EPMC11901398 | biostudies-literature
| S-EPMC5384706 | biostudies-literature
| S-EPMC3437440 | biostudies-literature
| S-EPMC5540074 | biostudies-literature
| S-EPMC5647622 | biostudies-literature
| S-EPMC552959 | biostudies-literature
| S-EPMC1069632 | biostudies-literature
| S-EPMC5290958 | biostudies-literature