Dataset Information

Hierarchical Spatial Concept Formation Based on Multimodal Information for Human Support Robots.

ABSTRACT: In this paper, we propose a hierarchical spatial concept formation method based on the Bayesian generative model with multimodal information e.g., vision, position and word information. Since humans have the ability to select an appropriate level of abstraction according to the situation and describe their position linguistically, e.g., "I am in my home" and "I am in front of the table," a hierarchical structure of spatial concepts is necessary in order for human support robots to communicate smoothly with users. The proposed method enables a robot to form hierarchical spatial concepts by categorizing multimodal information using hierarchical multimodal latent Dirichlet allocation (hMLDA). Object recognition results using convolutional neural network (CNN), hierarchical k-means clustering result of self-position estimated by Monte Carlo localization (MCL), and a set of location names are used, respectively, as features in vision, position, and word information. Experiments in forming hierarchical spatial concepts and evaluating how the proposed method can predict unobserved location names and position categories are performed using a robot in the real world. Results verify that, relative to comparable baseline methods, the proposed method enables a robot to predict location names and position categories closer to predictions made by humans. As an application example of the proposed method in a home environment, a demonstration in which a human support robot moves to an instructed place based on human speech instructions is achieved based on the formed hierarchical spatial concept.

SUBMITTER: Hagiwara Y

PROVIDER: S-EPMC5859180 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Hierarchical Spatial Concept Formation Based on Multimodal Information for Human Support Robots.

Hagiwara Yoshinobu Y Inoue Masakazu M Kobayashi Hiroyoshi H Taniguchi Tadahiro T

Frontiers in neurorobotics 20180313

In this paper, we propose a hierarchical spatial concept formation method based on the Bayesian generative model with multimodal information e.g., vision, position and word information. Since humans have the ability to select an appropriate level of abstraction according to the situation and describe their position linguistically, e.g., "I am in my home" and "I am in front of the table," a hierarchical structure of spatial concepts is necessary in order for human support robots to communicate sm ...[more]

PMID: 29593521

Similar Datasets

Project description:Assisting individuals in their daily activities through autonomous mobile robots is a significant concern, especially for users without specialized knowledge. Specifically, the capability of a robot to navigate to destinations based on human speech instructions is crucial. Although robots can take different paths toward the same objective, the shortest path is not always the most suitable. A preferred approach would be to accommodate waypoint specifications flexibly for planning an improved alternative path even with detours. Furthermore, robots require real-time inference capabilities. In this sense, spatial representations include semantic, topological, and metric-level representations, each capturing different aspects of the environment. This study aimed to realize a hierarchical spatial representation using a topometric semantic map and path planning with speech instructions by including waypoints. Thus, we present a hierarchical path planning method called spatial concept-based topometric semantic mapping for hierarchical path planning (SpCoTMHP), which integrates place connectivity. This approach provides a novel integrated probabilistic generative model and fast approximate inferences with interactions among the hierarchy levels. A formulation based on "control as probabilistic inference" theoretically supports the proposed path planning algorithm. We conducted experiments in a home environment using the Toyota human support robot on the SIGVerse simulator and in a lab-office environment with the real robot Albert. Here, the user issues speech commands that specify the waypoint and goal, such as "Go to the bedroom via the corridor." Navigation experiments were performed using speech instructions with a waypoint to demonstrate the performance improvement of the SpCoTMHP over the baseline hierarchical path planning method with heuristic path costs (HPP-I) in terms of the weighted success rate at which the robot reaches the closest target (0.590) and passes the correct waypoints. The computation time was significantly improved by 7.14 s with the SpCoTMHP than the baseline HPP-I in advanced tasks. Thus, hierarchical spatial representations provide mutually understandable instruction forms for both humans and robots, thus enabling language-based navigation.

Project description:One of the core issues of ecology is to understand the effects of landscape patterns on ecological processes. For this, we need to accurately capture changes in the fine landscape structures to avoid losing information about spatial heterogeneity. The landscape pattern indicators (LPIs) can characterize the spatial structures and give some information about landscape patterns. However, researches on LPIs had mainly focused on the horizontal structure of landscape patterns, while few studies addressed vertical relationships between the levels of hierarchical landscape structures. Thus, the ignorance of the vertical hierarchical relationships may cause serious biases and reduce LPIs' representational ability and accuracy. The hierarchy theory about the landscape pattern structures could notably reduce the loss of hierarchical information, and the information entropy could quantitatively describe the vertical status of landscape units. Therefore, we established a new multidimensional fusion method of LPIs based on hierarchy theory and information entropy. Here, we created a general fusion formula for commonly used simple LPIs based on two-grade land use data (whose land use classification system contains two grades/levels) and derived 3 fusion landscape pattern indicators (FLIs) with a case study. The results show that the information about fine spatial structure is captured by the fusion method. The regions with the most differences between the FLIs and the traditional LPIs are those with the largest vertical structure such as the ecological ecotones, where vertical structure was ignored before. The FLIs have a finer spatial representational ability and accuracy, not only retaining the main trend information of first-grade land use data, but also containing the internal detail information of second-grade land use data. Capturing finer spatial information of landscape patterns should encourage the application of fusion method, which should be suitable for more LPIs or more dimensional data. And the increased accuracy of FLIs will improve ecological models that rely on finer spatial information.

Project description:Adaptive information-sampling approaches enable efficient selection of mobile robots' waypoints through which the accurate sensing and mapping of a physical process, such as the radiation or field intensity, can be obtained. A key parameter in the informative sampling objective function could be optimized balance the need to explore new information where the uncertainty is very high and to exploit the data sampled so far, with which a great deal of the underlying spatial fields can be obtained, such as the source locations or modalities of the physical process. However, works in the literature have either assumed the robot's energy is unconstrained or used a homogeneous availability of energy capacity among different robots. Therefore, this paper analyzes the impact of the adaptive information-sampling algorithm's information function used in exploration and exploitation to achieve a tradeoff between balancing the mapping, localization, and energy efficiency objectives. We use Gaussian process regression (GPR) to predict and estimate confidence bounds, thereby determining each point's informativeness. Through extensive experimental data, we provide a deeper and holistic perspective on the effect of information function parameters on the prediction map's accuracy (RMSE), confidence bound (variance), energy consumption (distance), and time spent (sample count) in both single- and multi-robot scenarios. The results provide meaningful insights into choosing the appropriate energy-aware information function parameters based on sensing objectives (e.g., source localization or mapping). Based on our analysis, we can conclude that it would be detrimental to give importance only to the uncertainty of the information function (which would explode the energy needs) or to the predictive mean of the information (which would jeopardize the mapping accuracy). By assigning more importance to the information uncertainly with some non-zero importance to the information value (e.g., 75:25 ratio), it is possible to achieve an optimal tradeoff between exploration and exploitation objectives while keeping the energy requirements manageable.

Project description:Glioblastoma, IDH wild type (GBM) is a primary brain cancer with a poor prognosis and few effective therapies. The ability to investigate the tumor and its microenvironment during treatment would greatly enhance understanding of disease response, progression and impact of therapeutics. Stereotactic needle core biopsies are routine surgical procedures performed primarily for tumor diagnosis. Use of these biopsies for investigations into tumor and microenvironmental responses to treatment is rarely performed but holds great potential to support therapeutic monitoring and understanding of tumor evolution. However, it is unclear whether needle core biopsies are sufficient for multi-omics analysis and tumor models. Here we test the depth of data generation possible from stereotactic needle core biopsy tissue in two separate patients. In the first patient with recurrent GBM we performed highly resolved multi-omics analyses including single cell RNA sequencing, spatial-transcriptomics, metabolomics, proteomics, phosphoproteomics, T-cell clonotype analysis, and MHC Class I immunopeptidomics from biopsy tissue obtained from a single procedure. In a second patient we analyzed multi-regional core biopsies to decipher spatial and genomic variance and quantify intra and inter-sample heterogeneity. We also investigated the utility of stereotactic biopsies as a method for generating patient derived xenograft (PDX) models in a separate patient cohort. Dataset integration across modalities highlighted spatially mapped immune cell associated metabolic pathways, revealed poor correlation between RNA expression and the tumor MHC Class I immunopeptidome, and validated inferred cell-cell ligand receptor interactions. In conclusion, stereotactic needle biopsy cores are of sufficient quality to generate multi-omics data and PDX models, provide data rich insight into a patient’s disease process and tumor immune microenvironment and can be of value in evaluating treatment responses.

Project description:Scientific research is shedding light on the interaction of the gut microbiome with the human host and on its role in human health. Existing machine learning methods have shown great potential in discriminating healthy from diseased microbiome states. Most of them leverage shotgun metagenomic sequencing to extract gut microbial species-relative abundances or strain-level markers. Each of these gut microbial profiling modalities showed diagnostic potential when tested separately; however, no existing approach combines them in a single predictive framework. Here, we propose the Multimodal Variational Information Bottleneck (MVIB), a novel deep learning model capable of learning a joint representation of multiple heterogeneous data modalities. MVIB achieves competitive classification performance while being faster than existing methods. Additionally, MVIB offers interpretable results. Our model adopts an information theoretic interpretation of deep neural networks and computes a joint stochastic encoding of different input data modalities. We use MVIB to predict whether human hosts are affected by a certain disease by jointly analysing gut microbial species-relative abundances and strain-level markers. MVIB is evaluated on human gut metagenomic samples from 11 publicly available disease cohorts covering 6 different diseases. We achieve high performance (0.80 < ROC AUC < 0.95) on 5 cohorts and at least medium performance on the remaining ones. We adopt a saliency technique to interpret the output of MVIB and identify the most relevant microbial species and strain-level markers to the model's predictions. We also perform cross-study generalisation experiments, where we train and test MVIB on different cohorts of the same disease, and overall we achieve comparable results to the baseline approach, i.e. the Random Forest. Further, we evaluate our model by adding metabolomic data derived from mass spectrometry as a third input modality. Our method is scalable with respect to input data modalities and has an average training time of < 1.4 seconds. The source code and the datasets used in this work are publicly available.

Dataset Information

Hierarchical Spatial Concept Formation Based on Multimodal Information for Human Support Robots.

Publications

Hierarchical Spatial Concept Formation Based on Multimodal Information for Human Support Robots.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets