Dataset Information

Speech Discrimination in Real-World Group Communication Using Audio-Motion Multimodal Sensing.

ABSTRACT: Speech discrimination that determines whether a participant is speaking at a given moment is essential in investigating human verbal communication. Specifically, in dynamic real-world situations where multiple people participate in, and form, groups in the same space, simultaneous speakers render speech discrimination that is solely based on audio sensing difficult. In this study, we focused on physical activity during speech, and hypothesized that combining audio and physical motion data acquired by wearable sensors can improve speech discrimination. Thus, utterance and physical activity data of students in a university participatory class were recorded, using smartphones worn around their neck. First, we tested the temporal relationship between manually identified utterances and physical motions and confirmed that physical activities in wide-frequency ranges co-occurred with utterances. Second, we trained and tested classifiers for each participant and found a higher performance with the audio-motion classifier (average accuracy 92.2%) than both the audio-only (80.4%) and motion-only (87.8%) classifiers. Finally, we tested inter-individual classification and obtained a higher performance with the audio-motion combined classifier (83.2%) than the audio-only (67.7%) and motion-only (71.9%) classifiers. These results show that audio-motion multimodal sensing using widely available smartphones can provide effective utterance discrimination in dynamic group communications.

SUBMITTER: Nozawa T

PROVIDER: S-EPMC7287755 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Speech Discrimination in Real-World Group Communication Using Audio-Motion Multimodal Sensing.

Nozawa Takayuki T Uchiyama Mizuki M Honda Keigo K Nakano Tamio T Miyake Yoshihiro Y

Sensors (Basel, Switzerland) 20200522 10

Speech discrimination that determines whether a participant is speaking at a given moment is essential in investigating human verbal communication. Specifically, in dynamic real-world situations where multiple people participate in, and form, groups in the same space, simultaneous speakers render speech discrimination that is solely based on audio sensing difficult. In this study, we focused on physical activity during speech, and hypothesized that combining audio and physical motion data acquir ...[more]

PMID: 32456031

Dataset Information

Speech Discrimination in Real-World Group Communication Using Audio-Motion Multimodal Sensing.

Publications

Speech Discrimination in Real-World Group Communication Using Audio-Motion Multimodal Sensing.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Audio-visual speech cue combination.
| S-EPMC2855706 | biostudies-literature

Intermittently tagged real-time MRI reveals internal tongue motion during speech production.
| S-EPMC6510652 | biostudies-literature

Speed change discrimination for motion in depth using constant world and retinal speeds.
| S-EPMC6447190 | biostudies-literature

A multimodal dataset of real world mobility activities in Parkinson's disease.
| S-EPMC10733419 | biostudies-literature

Correlated lip motion and voice audio data.
| S-EPMC6218630 | biostudies-literature

Multimodal Communication in Aphasia: Perception and Production of Co-speech Gestures During Face-to-Face Conversation.
| S-EPMC6010555 | biostudies-literature

Contributions of local speech encoding and functional connectivity to audio-visual speech perception.
| S-EPMC5462535 | biostudies-literature

Students "Tackle" Quantitative Literacy in their Science Communication with Real-World Football Activity.
| S-EPMC5969394 | biostudies-other

SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla.
| S-EPMC8087046 | biostudies-literature

A Noninvasive Brain-Computer Interface for Real-Time Speech Synthesis: The Importance of Multimodal Feedback.
| S-EPMC5906041 | biostudies-literature