Unknown

Dataset Information

0

Uncovering the important acoustic features for detecting vocal fold paralysis with explainable machine learning.


ABSTRACT:

Objective

To detect unilateral vocal fold paralysis (UVFP) from voice recordings using an explainable model of machine learning.

Study design

Case series - retrospective with a control group.

Setting

Tertiary care laryngology practice between 2009 to 2019.

Methods

Patients with confirmed UVFP through endoscopic examination (N=77) and controls with normal voices matched for age and sex (N=77) were included. Two tasks were used to elicit voice samples: reading the Rainbow Passage and sustaining phonation of the vowel "a". The 88 extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) features were extracted as inputs for four machine learning models of differing complexity. SHAP was used to identify important features.

Results

The median bootstrapped Area Under the Receiver Operating Characteristic Curve (ROC AUC) score ranged from 0.79 to 0.87 depending on model and task. After removing redundant features for explainability, the highest median ROC AUC score was 0.84 using only 13 features for the vowel task and 0.87 using 39 features for the reading task. The most important features included intensity measures, mean MFCC1, mean F1 amplitude and frequency, and shimmer variability depending on model and task.

Conclusion

Using the largest dataset studying UVFP to date, we achieve high performance from just a few seconds of voice recordings. Notably, we demonstrate that while similar categories of features related to vocal fold physiology were conserved across models, the models used different combinations of features and still achieved similar effect sizes. Machine learning thus provides a mechanism to detect UVFP and contextualize the accuracy relative to both model architecture and pathophysiology.

SUBMITTER: Low DM 

PROVIDER: S-EPMC7836138 | biostudies-literature | 2021 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying bias in models that detect vocal fold paralysis from audio recordings using explainable machine learning and clinician ratings.

Low Daniel M DM   Rao Vishwanatha V   Randolph Gregory G   Song Phillip C PC   Ghosh Satrajit S SS  

medRxiv : the preprint server for health sciences 20240320


<h4>Introduction</h4>Detecting voice disorders from voice recordings could allow for frequent, remote, and low-cost screening before costly clinical visits and a more invasive laryngoscopy examination. Our goals were to detect unilateral vocal fold paralysis (UVFP) from voice recordings using machine learning, to identify which acoustic variables were important for prediction to increase trust, and to determine model performance relative to clinician performance.<h4>Methods</h4>Patients with con  ...[more]

Similar Datasets

| S-EPMC11851322 | biostudies-literature
| S-EPMC8688980 | biostudies-literature
| S-EPMC5546233 | biostudies-other
| S-EPMC8525866 | biostudies-literature
| S-EPMC10446277 | biostudies-literature
| S-EPMC8171271 | biostudies-literature
| S-EPMC10151348 | biostudies-literature
| S-EPMC9764743 | biostudies-literature
| S-EPMC9354550 | biostudies-literature
| S-EPMC9410197 | biostudies-literature