Dataset Information

Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing.

ABSTRACT: Natural language provides an intuitive and effective interaction interface between human beings and robots. Currently, multiple approaches are presented to address natural language visual grounding for human-robot interaction. However, most of the existing approaches handle the ambiguity of natural language queries and achieve target objects grounding via dialogue systems, which make the interactions cumbersome and time-consuming. In contrast, we address interactive natural language grounding without auxiliary information. Specifically, we first propose a referring expression comprehension network to ground natural referring expressions. The referring expression comprehension network excavates the visual semantics via a visual semantic-aware network, and exploits the rich linguistic contexts in expressions by a language attention network. Furthermore, we combine the referring expression comprehension network with scene graph parsing to achieve unrestricted and complicated natural language grounding. Finally, we validate the performance of the referring expression comprehension network on three public datasets, and we also evaluate the effectiveness of the interactive natural language grounding architecture by conducting extensive natural language query groundings in different household scenarios.

SUBMITTER: Mi J

PROVIDER: S-EPMC7331387 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing.

Mi Jinpeng J Lyu Jianzhi J Tang Song S Li Qingdu Q Zhang Jianwei J

Frontiers in neurorobotics 20200625

Natural language provides an intuitive and effective interaction interface between human beings and robots. Currently, multiple approaches are presented to address natural language visual grounding for human-robot interaction. However, most of the existing approaches handle the ambiguity of natural language queries and achieve target objects grounding via dialogue systems, which make the interactions cumbersome and time-consuming. In contrast, we address interactive natural language grounding wi ...[more]

PMID: 32670046

Dataset Information

Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing.

Publications

Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction.
| S-EPMC7238763 | biostudies-literature

Graph theoretical analysis of functional network for comprehension of sign language.
| S-EPMC7061525 | biostudies-literature

A psycholinguistic model of natural language parsing implemented in simulated neurons.
| S-EPMC2777190 | biostudies-other

A hierarchy of linguistic predictions during natural language comprehension.
| S-EPMC9371745 | biostudies-literature

Natural language processing systems for pathology parsing in limited data environments with uncertainty estimation.
| S-EPMC7751177 | biostudies-literature

Dynamic EEG analysis during language comprehension reveals interactive cascades between perceptual processing and sentential expectations.
| S-EPMC7682806 | biostudies-literature

Scene Graph Prediction with Limited Labels.
| S-EPMC7098690 | biostudies-literature

Cortical Surround Interactions and Perceptual Salience via Natural Scene Statistics.
| S-EPMC3291533 | biostudies-other

INDRA-IPM: interactive pathway modeling using natural language with automated assembly.
| S-EPMC6821420 | biostudies-literature

NLPReViz: an interactive tool for natural language processing on clinical text.
| S-EPMC6381768 | biostudies-literature