Dataset Information

Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction.

ABSTRACT: Similar to specific natural language instructions, intention-related natural language queries also play an essential role in our daily life communication. Inspired by the psychology term "affordance" and its applications in Human-Robot interaction, we propose an object affordance-based natural language visual grounding architecture to ground intention-related natural language queries. Formally, we first present an attention-based multi-visual features fusion network to detect object affordances from RGB images. While fusing deep visual features extracted from a pre-trained CNN model with deep texture features encoded by a deep texture encoding network, the presented object affordance detection network takes into account the interaction of the multi-visual features, and reserves the complementary nature of the different features by integrating attention weights learned from sparse representations of the multi-visual features. We train and validate the attention-based object affordance recognition network on a self-built dataset in which a large number of images originate from MSCOCO and ImageNet. Moreover, we introduce an intention semantic extraction module to extract intention semantics from intention-related natural language queries. Finally, we ground intention-related natural language queries by integrating the detected object affordances with the extracted intention semantics. We conduct extensive experiments to validate the performance of the object affordance detection network and the intention-related natural language queries grounding architecture.

SUBMITTER: Mi J

PROVIDER: S-EPMC7238763 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction.

Mi Jinpeng J Liang Hongzhuo H Katsakis Nikolaos N Tang Song S Li Qingdu Q Zhang Changshui C Zhang Jianwei J

Frontiers in neurorobotics 20200513

Similar to specific natural language instructions, intention-related natural language queries also play an essential role in our daily life communication. Inspired by the psychology term "affordance" and its applications in Human-Robot interaction, we propose an object affordance-based natural language visual grounding architecture to ground intention-related natural language queries. Formally, we first present an attention-based multi-visual features fusion network to detect object affordances ...[more]

PMID: 32477091

Dataset Information

Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction.

Publications

Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Grounding human-object interaction to affordance behavior in multimodal datasets.
| S-EPMC9923013 | biostudies-literature

Scalable incident detection via natural language processing and probabilistic language models.
| S-EPMC11461638 | biostudies-literature

Semantic control of feature extraction from natural scenes.
| S-EPMC3913878 | biostudies-literature

Semantic biomedical resource discovery: a Natural Language Processing framework.
| S-EPMC4591066 | biostudies-literature

Left-handers know what's left is right: Handedness and object affordance.
| S-EPMC6655602 | biostudies-literature

Ants combine object affordance with latent learning to make efficient foraging decisions.
| S-EPMC10468611 | biostudies-literature

Semantic Grounding of Novel Spoken Words in the Primary Visual Cortex.
| S-EPMC7959837 | biostudies-literature

Semantic abnormalities in schizophrenia and bipolar disorder: A natural language processing approach.
| S-EPMC11758559 | biostudies-literature

Automated knowledge extraction from polymer literature using natural language processing.
| S-EPMC7797509 | biostudies-literature

Design considerations for a hierarchical semantic compositional framework for medical natural language understanding.
| S-EPMC10019629 | biostudies-literature