Visual question answering model based on graph neural network and contextual attention

Static Correlative Filter Based Convolutional Neural Network for Visual Question Answering

2018 IEEE International Conference on Big Data and Smart Computing (BigComp) ◽

10.1109/bigcomp.2018.00087 ◽

2018 ◽

Cited By ~ 1

Author(s):

Lijun Chen ◽

Qinyu Li ◽

Hanli Wang ◽

Yu Long

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Question Answering ◽

Visual Question Answering

Download Full-text

A Novel Approach on Visual Question Answering by Parameter Prediction using Faster Region Based Convolutional Neural Network

International Journal of Interactive Multimedia and Artificial Intelligence ◽

10.9781/ijimai.2018.08.004 ◽

2019 ◽

Vol 5 (5) ◽

pp. 30 ◽

Cited By ~ 3

Author(s):

Sudan Jha ◽

Anirban Dey ◽

Raghvendra Kumar ◽

Vijender Kumar-Solanki

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Question Answering ◽

Visual Question Answering ◽

Novel Approach ◽

Parameter Prediction

Download Full-text

Visual question answering model based on visual relationship detection

Signal Processing Image Communication ◽

10.1016/j.image.2019.115648 ◽

2020 ◽

Vol 80 ◽

pp. 115648 ◽

Cited By ~ 10

Author(s):

Yuling Xi ◽

Yanning Zhang ◽

Songtao Ding ◽

Shaohua Wan

Keyword(s):

Question Answering ◽

Visual Question Answering ◽

Model Based

Download Full-text

Multi-modal Contextual Graph Neural Network for Text Visual Question Answering

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9412891 ◽

2021 ◽

Author(s):

Yaoyuan Liang ◽

Xin Wang ◽

Xuguang Duan ◽

Wenwu Zhu

Keyword(s):

Neural Network ◽

Question Answering ◽

Visual Question Answering

Download Full-text

Analysis of English Multitext Reading Comprehension Model Based on Deep Belief Neural Network

Computational Intelligence and Neuroscience ◽

10.1155/2021/5100809 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Qiaohui Tang

Keyword(s):

Neural Network ◽

Reading Comprehension ◽

Question Answering ◽

Test Analysis ◽

Model Based ◽

English Reading ◽

Comprehension Ability ◽

Low Efficiency ◽

Machine Reading ◽

Strong Reading

In order to solve the problems of low accuracy and low efficiency of answer prediction in machine reading comprehension, a multitext English reading comprehension model based on the deep belief neural network is proposed. Firstly, the paragraph selector in the multitext reading comprehension model is constructed. Secondly, the text reader is designed, and the deep belief neural network is introduced to predict the question answering probability. Finally, the popular English dataset of SQuAD is used for test analysis. The final results show that, after the comparative analysis of different learning methods, it is found that the English multitext reading comprehension model has a strong reading comprehension ability. In addition, two evaluation methods are used to score the overall performance of the model, which shows that the overall score of the English multitext reading comprehension model based on the deep confidence neural network is more than 90, and the efficiency will not be reduced because of the change of the number of documents in the dataset. The above results show that the use of the deep belief neural network to improve the probability generation performance of the model can well solve the task of English multitext reading comprehension, effectively reduce the difficulty of machine reading comprehension in multitask reading, and has a good guiding significance for promoting human convenient Internet knowledge acquisition.

Download Full-text

A Lightweight Visual Question Answering Model based on Semantic Similarity

10.1145/3490725.3490736 ◽

2021 ◽

Author(s):

Zhiming He ◽

Jingping Zeng

Keyword(s):

Semantic Similarity ◽

Question Answering ◽

Visual Question Answering ◽

Model Based

Download Full-text

Convolutional Neural Network based Review System for Automatic Past Event Search using Visual Question Answering

2020 International Conference on Inventive Computation Technologies (ICICT) ◽

10.1109/icict48043.2020.9112454 ◽

2020 ◽

Author(s):

Pranita P. Deshmukh ◽

Rani S. Lande

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Question Answering ◽

Review System ◽

Past Event ◽

Visual Question Answering

Download Full-text

Joint embedding VQA model based on dynamic word vector

PeerJ Computer Science ◽

10.7717/peerj-cs.353 ◽

2021 ◽

Vol 7 ◽

pp. e353

Author(s):

Zhiyang Ma ◽

Wenfeng Zheng ◽

Xiaobing Chen ◽

Lirong Yin

Keyword(s):

Question Answering ◽

Feature Fusion ◽

Image Feature ◽

Text And Image ◽

Visual Question Answering ◽

Model Based ◽

Joint Embedding ◽

Real Language ◽

Language Environment ◽

Better Than

The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector—none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017—winner (glove) model and 2019—winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.

Download Full-text

A Convolutional Neural Network Based Approach For Visual Question Answering

10.31979/etd.g2w5-abud ◽

2018 ◽

Author(s):

Lavanya Abhinaya Koduri

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Question Answering ◽

Visual Question Answering

Download Full-text

Visual Question Answering using Convolutional Neural Networks

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i1s.1602 ◽

2021 ◽

Vol 12 (1S) ◽

pp. 170-175

Author(s):

K. P. Moholkar, Et. al.

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Neural Networks ◽

Computer Vision ◽

Recurrent Neural Network ◽

Question Answering ◽

Question Answering System ◽

Visual Question Answering ◽

Generalized System ◽

Multiple Scenarios

The ability of a computer system to be able to understand surroundings and elements and to think like a human being to process the information has always been the major point of focus in the field of Computer Science. One of the ways to achieve this artificial intelligence is Visual Question Answering. Visual Question Answering (VQA) is a trained system which can answer the questions associated to a given image in Natural Language. VQA is a generalized system which can be used in any image-based scenario with adequate training on the relevant data. This is achieved with the help of Neural Networks, particularly Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In this study, we have compared different approaches of VQA, out of which we are exploring CNN based model. With the continued progress in the field of Computer Vision and Question answering system, Visual Question Answering is becoming the essential system which can handle multiple scenarios with their respective data.

Download Full-text