Unknown

Dataset Information

0

The CANDOR corpus: Insights from a large multimodal dataset of naturalistic conversation.


ABSTRACT: People spend a substantial portion of their lives engaged in conversation, and yet, our scientific understanding of conversation is still in its infancy. Here, we introduce a large, novel, and multimodal corpus of 1656 conversations recorded in spoken English. This 7+ million word, 850-hour corpus totals more than 1 terabyte of audio, video, and transcripts, with moment-to-moment measures of vocal, facial, and semantic expression, together with an extensive survey of speakers' postconversation reflections. By taking advantage of the considerable scope of the corpus, we explore many examples of how this large-scale public dataset may catalyze future research, particularly across disciplinary boundaries, as scholars from a variety of fields appear increasingly interested in the study of conversation.

SUBMITTER: Reece A 

PROVIDER: S-EPMC10065445 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

The CANDOR corpus: Insights from a large multimodal dataset of naturalistic conversation.

Reece Andrew A   Cooney Gus G   Bull Peter P   Chung Christine C   Dawson Bryn B   Fitzpatrick Casey C   Glazer Tamara T   Knox Dean D   Liebscher Alex A   Marin Sebastian S  

Science advances 20230331 13


People spend a substantial portion of their lives engaged in conversation, and yet, our scientific understanding of conversation is still in its infancy. Here, we introduce a large, novel, and multimodal corpus of 1656 conversations recorded in spoken English. This 7+ million word, 850-hour corpus totals more than 1 terabyte of audio, video, and transcripts, with moment-to-moment measures of vocal, facial, and semantic expression, together with an extensive survey of speakers' postconversation r  ...[more]

Similar Datasets

| S-EPMC7479607 | biostudies-literature
| S-EPMC8938409 | biostudies-literature
| S-EPMC9424229 | biostudies-literature
| S-EPMC8222356 | biostudies-literature
| S-EPMC5906713 | biostudies-literature
| S-EPMC8156799 | biostudies-literature
| S-EPMC9376869 | biostudies-literature
| S-EPMC10659362 | biostudies-literature
| S-EPMC8479122 | biostudies-literature
| S-EPMC9062263 | biostudies-literature