Ontology highlight
ABSTRACT:
SUBMITTER: Tsai YH
PROVIDER: S-EPMC7195022 | biostudies-literature | 2019 Jul
REPOSITORIES: biostudies-literature
Tsai Yao-Hung Hubert YH Bai Shaojie S Pu Liang Paul P Kolter J Zico JZ Morency Louis-Philippe LP Salakhutdinov Ruslan R
Proceedings of the conference. Association for Computational Linguistics. Meeting 20190701
Human language is often multimodal, which comprehends a mixture of natural language, facial gestures, and acoustic behaviors. However, two major challenges in modeling such multimodal human language time-series data exist: 1) inherent data non-alignment due to variable sampling rates for the sequences from each modality; and 2) long-range dependencies between elements across modalities. In this paper, we introduce the Multimodal Transformer (MulT) to generically address the above issues in an en ...[more]