Visual Question Answering using Explicit Visual Attention

2018 IEEE International Symposium on Circuits and Systems (ISCAS) ◽

10.1109/iscas.2018.8351158 ◽

2018 ◽

Author(s):

Vasileios Lioutas ◽

Nikolaos Passalis ◽

Anastasios Tefas

Keyword(s):

Visual Attention ◽

Question Answering ◽

Visual Question Answering

Download Full-text

From Pixels to Objects: Cubic Visual Attention for Visual Question Answering

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/126 ◽

2018 ◽

Author(s):

Jingkuan Song ◽

Pengpeng Zeng ◽

Lianli Gao ◽

Heng Tao Shen

Keyword(s):

Visual Attention ◽

Spatial Attention ◽

Question Answering ◽

Public Image ◽

Great Success ◽

Feature Map ◽

Visual Question Answering ◽

The Arts ◽

Object Proposal ◽

Recently, attention-based Visual Question Answering (VQA) has achieved great success by utilizing question to selectively target different visual areas that are related to the answer. Existing visual attention models are generally planar, i.e., different channels of the last conv-layer feature map of an image share the same weight. This conflicts with the attention mechanism because CNN features are naturally spatial and channel-wise. Also, visual attention models are usually conducted on pixel-level, which may cause region discontinuous problem. In this paper we propose a Cubic Visual Attention (CVA) model by successfully applying a novel channel and spatial attention on object regions to improve VQA task. Specifically, instead of attending to pixels, we first take advantage of the object proposal networks to generate a set of object candidates and extract their associated conv features. Then, we utilize the question to guide channel attention and spatial attention calculation based on the con-layer feature map. Finally, the attended visual features and the question are combined to infer the answer. We assess the performance of our proposed CVA on three public image QA datasets, including COCO-QA, VQA and Visual7W. Experimental results show that our proposed method significantly outperforms the state-of-the-arts.

Download Full-text

Vision And Text Transformer For Predicting Answerability On Visual Question Answering

10.1109/icip42928.2021.9506796 ◽

2021 ◽

Author(s):

Tung Le ◽

Huy Tien Nguyen ◽

Minh Le Nguyen

Keyword(s):

Question Answering ◽

Visual Question Answering

Download Full-text

Visual Question Answering for Monas Tourism Object using Deep Learning

2020 International Conference on Advanced Computer Science and Information Systems (ICACSIS) ◽

10.1109/icacsis51025.2020.9263149 ◽

2020 ◽

Author(s):

Ahmad Hasan Siregar ◽

Dina Chahyati

Keyword(s):

Deep Learning ◽

Question Answering ◽

Visual Question Answering

Download Full-text

Cross-modality co-attention networks for visual question answering

Soft Computing ◽

10.1007/s00500-020-05539-7 ◽

2021 ◽

Author(s):

Dezhi Han ◽

Shuli Zhou ◽

Kuan Ching Li ◽

Rodrigo Fernandes de Mello

Keyword(s):

Question Answering ◽

Attention Networks ◽

Visual Question Answering

Download Full-text

Comparative Study of Visual Question Answering Algorithms

2020 15th International Conference on Computer Engineering and Systems (ICCES) ◽

10.1109/icces51560.2020.9334686 ◽

2020 ◽

Author(s):

Ahmed Mostafa ◽

Hazem Abbas ◽

Mahmoud I. Khalil

Keyword(s):

Comparative Study ◽

Question Answering ◽

Visual Question Answering

Download Full-text

Visual Question Answering: Methodologies and Challenges

2020 International Conference on Smart Technologies in Computing, Electrical and Electronics (ICSTCEE) ◽

10.1109/icstcee49637.2020.9277374 ◽

2020 ◽

Author(s):

Liyana Sahir Kallooriyakath ◽

Jithin M V ◽

Bindu P V ◽

Adith P P

Keyword(s):

Question Answering ◽

Visual Question Answering

Download Full-text

Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Proceedings of the 28th ACM International Conference on Multimedia ◽

10.1145/3394171.3413943 ◽

2020 ◽

Author(s):

Guohao Li ◽

Xin Wang ◽

Wenwu Zhu

Keyword(s):

Question Answering ◽

Context Aware ◽

Visual Question Answering

Download Full-text

Adaptive Re-Balancing Network with Gate Mechanism for Long-Tailed Visual Question Answering

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414074 ◽

2021 ◽

Author(s):

Hongyu Chen ◽

Ruifang Liu ◽

Han Fang ◽

Ximing Zhang

Keyword(s):

Question Answering ◽

Visual Question Answering

Download Full-text

Visual question answering model based on graph neural network and contextual attention

Image and Vision Computing ◽

10.1016/j.imavis.2021.104165 ◽

2021 ◽

pp. 104165

Author(s):

Himanshu Sharma ◽

Anand Singh Jalal

Keyword(s):

Neural Network ◽

Question Answering ◽

Visual Question Answering ◽

Download Full-text

Co-attention Spatial Reasoning for Visual Question Answering

Journal of Physics Conference Series ◽

10.1088/1742-6596/1828/1/012145 ◽

2021 ◽

Vol 1828 (1) ◽

pp. 012145

Author(s):

Ye Qin ◽

Zhiping Zhou ◽

Chen Biao ◽

Li Wenjie

Keyword(s):

Question Answering ◽

Spatial Reasoning ◽

Visual Question Answering

Download Full-text