Dataset Information

Joint embedding VQA model based on dynamic word vector.

ABSTRACT: The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector-none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017-winner (glove) model and 2019-winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.

SUBMITTER: Ma Z

PROVIDER: S-EPMC7959642 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Joint embedding VQA model based on dynamic word vector.

Ma Zhiyang Z Zheng Wenfeng W Chen Xiaobing X Yin Lirong L

PeerJ. Computer science 20210303

The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there ma ...[more]

PMID: 33817003

Dataset Information

Joint embedding VQA model based on dynamic word vector.

Publications

Joint embedding VQA model based on dynamic word vector.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Fold2Seq: A Joint Sequence(1D)-Fold(3D) Embedding-based Generative Model for Protein Design.
| S-EPMC8375603 | biostudies-literature

pLMSNOSite: an ensemble-based approach for predicting protein S-nitrosylation sites by integrating supervised word embedding and embedding from pre-trained protein language model.
| S-EPMC9909867 | biostudies-literature

Measuring novelty in science with word embedding.
| S-EPMC8253414 | biostudies-literature

Identify novel elements of knowledge with word embedding.
| S-EPMC10281565 | biostudies-literature

TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.
| S-EPMC8479653 | biostudies-literature

Predicting the host of influenza viruses based on the word vector.
| S-EPMC5518728 | biostudies-literature

FrameAxis: characterizing microframe bias and intensity with word embedding.
| S-EPMC8323720 | biostudies-literature

Impact analysis of keyword extraction using contextual word embedding.
| S-EPMC9202614 | biostudies-literature

Scoring alignments by embedding vector similarity.
| S-EPMC11063651 | biostudies-literature