A Lightweight Visual Question Answering Model based on Semantic Similarity

Mapping Intimacies ◽

10.1145/3490725.3490736 ◽

2021 ◽

Author(s):

Zhiming He ◽

Jingping Zeng

Keyword(s):

Semantic Similarity ◽

Question Answering ◽

Visual Question Answering ◽

Download Full-text

Visual question answering model based on graph neural network and contextual attention

Image and Vision Computing ◽

10.1016/j.imavis.2021.104165 ◽

2021 ◽

pp. 104165

Author(s):

Himanshu Sharma ◽

Anand Singh Jalal

Keyword(s):

Neural Network ◽

Question Answering ◽

Visual Question Answering ◽

Download Full-text

Visual question answering model based on visual relationship detection

Signal Processing Image Communication ◽

10.1016/j.image.2019.115648 ◽

2020 ◽

Vol 80 ◽

pp. 115648 ◽

Author(s):

Yuling Xi ◽

Yanning Zhang ◽

Songtao Ding ◽

Shaohua Wan

Keyword(s):

Question Answering ◽

Visual Question Answering ◽

Download Full-text

Joint embedding VQA model based on dynamic word vector

PeerJ Computer Science ◽

10.7717/peerj-cs.353 ◽

2021 ◽

Vol 7 ◽

pp. e353

Author(s):

Zhiyang Ma ◽

Wenfeng Zheng ◽

Xiaobing Chen ◽

Lirong Yin

Keyword(s):

Question Answering ◽

Feature Fusion ◽

Image Feature ◽

Text And Image ◽

Visual Question Answering ◽

Model Based ◽

Joint Embedding ◽

Real Language ◽

Language Environment ◽

The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector—none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017—winner (glove) model and 2019—winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.

Download Full-text

Vision And Text Transformer For Predicting Answerability On Visual Question Answering

10.1109/icip42928.2021.9506796 ◽

2021 ◽

Author(s):

Tung Le ◽

Huy Tien Nguyen ◽

Minh Le Nguyen

Keyword(s):

Question Answering ◽

Visual Question Answering

Download Full-text

Visual Question Answering for Monas Tourism Object using Deep Learning

2020 International Conference on Advanced Computer Science and Information Systems (ICACSIS) ◽

10.1109/icacsis51025.2020.9263149 ◽

2020 ◽

Author(s):

Ahmad Hasan Siregar ◽

Dina Chahyati

Keyword(s):

Deep Learning ◽

Question Answering ◽

Visual Question Answering

Download Full-text

Cross-modality co-attention networks for visual question answering

Soft Computing ◽

10.1007/s00500-020-05539-7 ◽

2021 ◽

Author(s):

Dezhi Han ◽

Shuli Zhou ◽

Kuan Ching Li ◽

Rodrigo Fernandes de Mello

Keyword(s):

Question Answering ◽

Attention Networks ◽

Visual Question Answering

Download Full-text

Comparative Study of Visual Question Answering Algorithms

2020 15th International Conference on Computer Engineering and Systems (ICCES) ◽

10.1109/icces51560.2020.9334686 ◽

2020 ◽

Author(s):

Ahmed Mostafa ◽

Hazem Abbas ◽

Mahmoud I. Khalil

Keyword(s):

Comparative Study ◽

Question Answering ◽

Visual Question Answering

Download Full-text

Visual Question Answering: Methodologies and Challenges

2020 International Conference on Smart Technologies in Computing, Electrical and Electronics (ICSTCEE) ◽

10.1109/icstcee49637.2020.9277374 ◽

2020 ◽

Author(s):

Liyana Sahir Kallooriyakath ◽

Jithin M V ◽

Bindu P V ◽

Adith P P

Keyword(s):

Question Answering ◽

Visual Question Answering

Download Full-text

Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Proceedings of the 28th ACM International Conference on Multimedia ◽

10.1145/3394171.3413943 ◽

2020 ◽

Author(s):

Guohao Li ◽

Xin Wang ◽

Wenwu Zhu

Keyword(s):

Question Answering ◽

Context Aware ◽

Visual Question Answering

Download Full-text

Adaptive Re-Balancing Network with Gate Mechanism for Long-Tailed Visual Question Answering

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414074 ◽

2021 ◽

Author(s):

Hongyu Chen ◽

Ruifang Liu ◽

Han Fang ◽

Ximing Zhang

Keyword(s):

Question Answering ◽

Visual Question Answering

Download Full-text