Unknown

Dataset Information

0

SIMON, an Automated Machine Learning System, Reveals Immune Signatures of Influenza Vaccine Responses.


ABSTRACT: Machine learning holds considerable promise for understanding complex biological processes such as vaccine responses. Capturing interindividual variability is essential to increase the statistical power necessary for building more accurate predictive models. However, available approaches have difficulty coping with incomplete datasets which is often the case when combining studies. Additionally, there are hundreds of algorithms available and no simple way to find the optimal one. In this study, we developed Sequential Iterative Modeling "OverNight" (SIMON), an automated machine learning system that compares results from 128 different algorithms and is particularly suitable for datasets containing many missing values. We applied SIMON to data from five clinical studies of seasonal influenza vaccination. The results reveal previously unrecognized CD4+ and CD8+ T cell subsets strongly associated with a robust Ab response to influenza Ags. These results demonstrate that SIMON can greatly speed up the choice of analysis modalities. Hence, it is a highly useful approach for data-driven hypothesis generation from disparate clinical datasets. Our strategy could be used to gain biological insight from ever-expanding heterogeneous datasets that are publicly available.

SUBMITTER: Tomic A 

PROVIDER: S-EPMC6643048 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

SIMON, an Automated Machine Learning System, Reveals Immune Signatures of Influenza Vaccine Responses.

Tomic Adriana A   Tomic Ivan I   Rosenberg-Hasson Yael Y   Dekker Cornelia L CL   Maecker Holden T HT   Davis Mark M MM  

Journal of immunology (Baltimore, Md. : 1950) 20190614 3


Machine learning holds considerable promise for understanding complex biological processes such as vaccine responses. Capturing interindividual variability is essential to increase the statistical power necessary for building more accurate predictive models. However, available approaches have difficulty coping with incomplete datasets which is often the case when combining studies. Additionally, there are hundreds of algorithms available and no simple way to find the optimal one. In this study,  ...[more]

Similar Datasets

| S-EPMC10883951 | biostudies-literature
| S-EPMC6269591 | biostudies-literature
| S-EPMC6215005 | biostudies-literature
| S-EPMC4521188 | biostudies-other
| S-EPMC6774277 | biostudies-literature
2022-05-20 | GSE203423 | GEO
| S-EPMC7934923 | biostudies-literature
2023-10-19 | GSE230494 | GEO
| S-EPMC5736694 | biostudies-other
2019-11-07 | GSE124203 | GEO