Unknown

Dataset Information

0

A procedure for estimating gestural scores from speech acoustics.


ABSTRACT: Speech can be represented as a constellation of constricting vocal tract actions called gestures, whose temporal patterning with respect to one another is expressed in a gestural score. Current speech datasets do not come with gestural annotation and no formal gestural annotation procedure exists at present. This paper describes an iterative analysis-by-synthesis landmark-based time-warping architecture to perform gestural annotation of natural speech. For a given utterance, the Haskins Laboratories Task Dynamics and Application (TADA) model is employed to generate a corresponding prototype gestural score. The gestural score is temporally optimized through an iterative timing-warping process such that the acoustic distance between the original and TADA-synthesized speech is minimized. This paper demonstrates that the proposed iterative approach is superior to conventional acoustically-referenced dynamic timing-warping procedures and provides reliable gestural annotation for speech datasets.

SUBMITTER: Nam H 

PROVIDER: S-EPMC3528686 | biostudies-other | 2012 Dec

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC5564313 | biostudies-other
| S-EPMC7229780 | biostudies-literature
| S-EPMC11293000 | biostudies-literature
| S-EPMC10852227 | biostudies-literature
| S-EPMC2835806 | biostudies-literature
| S-EPMC5686280 | biostudies-literature
| S-EPMC8575951 | biostudies-literature
| S-EPMC8360444 | biostudies-literature
| S-EPMC8960330 | biostudies-literature
| S-EPMC3508756 | biostudies-literature