Dataset Information

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments.

ABSTRACT: Natural speech is processed in the brain as a mixture of auditory and visual features. An example of the importance of visual speech is the McGurk effect and related perceptual illusions that result from mismatching auditory and visual syllables. Although the McGurk effect has widely been applied to the exploration of audio-visual speech processing, it relies on isolated syllables, which severely limits the conclusions that can be drawn from the paradigm. In addition, the extreme variability and the quality of the stimuli usually employed prevents comparability across studies. To overcome these limitations, we present an innovative methodology using 3D virtual characters with realistic lip movements synchronized on computer-synthesized speech. We used commercially accessible and affordable tools to facilitate reproducibility and comparability, and the set-up was validated on 24 participants performing a perception task. Within complete and meaningful French sentences, we paired a labiodental fricative viseme (i.e. /v/) with a bilabial occlusive phoneme (i.e. /b/). This audiovisual mismatch is known to induce the illusion of hearing /v/ in a proportion of trials. We tested the rate of the illusion while varying the magnitude of background noise and audiovisual lag. Overall, the effect was observed in 40% of trials. The proportion rose to about 50% with added background noise and up to 66% when controlling for phonetic features. Our results conclusively demonstrate that computer-generated speech stimuli are judicious, and that they can supplement natural speech with higher control over stimulus timing and content.

SUBMITTER: Theze R

PROVIDER: S-EPMC7511320 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments.

Thézé Raphaël R Gadiri Mehdi Ali MA Albert Louis L Provost Antoine A Giraud Anne-Lise AL Mégevand Pierre P

Scientific reports 20200923 1

Natural speech is processed in the brain as a mixture of auditory and visual features. An example of the importance of visual speech is the McGurk effect and related perceptual illusions that result from mismatching auditory and visual syllables. Although the McGurk effect has widely been applied to the exploration of audio-visual speech processing, it relies on isolated syllables, which severely limits the conclusions that can be drawn from the paradigm. In addition, the extreme variability and ...[more]

PMID: 32968127

Dataset Information

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments.

Publications

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Audio-visual speech cue combination.
| S-EPMC2855706 | biostudies-literature

Studying naturalistic human communication using dual-EEG and audio-visual recordings.
| S-EPMC10511849 | biostudies-literature

Contributions of local speech encoding and functional connectivity to audio-visual speech perception.
| S-EPMC5462535 | biostudies-literature

Audio-visual speech timing sensitivity is enhanced in cluttered conditions.
| S-EPMC3071827 | biostudies-literature

Cue integration in categorical tasks: insights from audio-visual speech perception.
| S-EPMC3102664 | biostudies-literature

Memory and visual search in naturalistic 2D and 3D environments.
| S-EPMC4913723 | biostudies-literature

Functional Connectivity of Attention, Visual, and Language Networks During Audio, Illustrated, and Animated Stories in Preschool-Age Children.
| S-EPMC6775495 | biostudies-literature

A Case for Studying Naturalistic Eye and Head Movements in Virtual Environments.
| S-EPMC8759101 | biostudies-literature

Audio-Visual Perception of Gender by Infants Emerges Earlier for Adult-Directed Speech.
| S-EPMC5218491 | biostudies-literature

Semantic Cues Modulate Children's and Adults' Processing of Audio-Visual Face Mask Speech.
| S-EPMC9343587 | biostudies-literature