Dataset Information

Expert-augmented machine learning.

ABSTRACT: Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications.

SUBMITTER: Gennatas ED

PROVIDER: S-EPMC7060733 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Expert-augmented machine learning.

Gennatas Efstathios D ED Friedman Jerome H JH Ungar Lyle H LH Pirracchio Romain R Eaton Eric E Reichmann Lara G LG Interian Yannet Y Luna José Marcio JM Simone Charles B CB Auerbach Andrew A Delgado Elier E van der Laan Mark J MJ Solberg Timothy D TD Valdes Gilmer G

Proceedings of the National Academy of Sciences of the United States of America 20200218 9

Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present e ...[more]

PMID: 32071251

Dataset Information

Expert-augmented machine learning.

Publications

Expert-augmented machine learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Expert-augmented automated machine learning optimizes hemodynamic predictors of spinal cord injury outcome.
| S-EPMC8989303 | biostudies-literature

Expert-enhanced machine learning for cardiac arrhythmia classification.
| S-EPMC8699667 | biostudies-literature

Descriptor-augmented machine learning for enzyme-chemical interaction predictions
| S-EPMC10915406 | biostudies-literature

Perceptual metrics for odorants: Learning from non-expert similarity feedback using machine learning.
| S-EPMC10631653 | biostudies-literature

Machine learning-augmented surface-enhanced spectroscopy toward next-generation molecular diagnostics.
| S-EPMC9890940 | biostudies-literature

Using Expert Driven Machine Learning to Enhance Dynamic Metabolomics Data Analysis.
| S-EPMC6468718 | biostudies-literature

Estimating Visibility of Annotations for View Management in Spatial Augmented Reality Based on Machine-Learning Techniques.
| S-EPMC6412218 | biostudies-other

An Expert Diagnosis System for Parkinson Disease Based on Genetic Algorithm-Wavelet Kernel-Extreme Learning Machine.
| S-EPMC4871978 | biostudies-other

Prediction of Breast Cancer Estrogen Receptor Status using Machine Learning
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress

Machine Learning augmented docking studies of aminothioureas at the SARS-CoV-2-ACE2 interface.
| S-EPMC8428716 | biostudies-literature