Improving Diversity of Image Captioning Through Variational Autoencoders and Adversarial Learning

2019 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv.2019.00034 ◽

2019 ◽

Author(s):

Li Ren ◽

Guo-Jun Qi ◽

Kien Hua

Keyword(s):

Image Captioning ◽

Adversarial Learning

Download Full-text

Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training

Symmetry ◽

10.3390/sym12121978 ◽

2020 ◽

Vol 12 (12) ◽

pp. 1978

Author(s):

Zhenyu Yang ◽

Qiao Liu ◽

Guojing Liu

Keyword(s):

Learning Ability ◽

Image Captioning ◽

Adversarial Learning ◽

Symmetric Structure ◽

Effective Performance ◽

Meta Information ◽

Adversarial Training ◽

Factor Form ◽

Style Factor ◽

Traditional Image

Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images. However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions. In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR). Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole. At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively. These two parts further enhance the learning ability of the model through adversarial learning. Our experiment has achieved effective performance on the benchmark dataset.

Download Full-text

Feedback Attention Model for Image Captioning

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17505 ◽

2019 ◽

Vol 31 (7) ◽

pp. 1122

Author(s):

Fan Lyu ◽

Fuyuan Hu ◽

Yanning Zhang ◽

Zhenping Xia ◽

S Sheng Victor

Keyword(s):

Image Captioning ◽

Attention Model

Download Full-text

Deep learning based character-oriented image captioning method for visually impaired

Journal of rehabilitation welfare engineering & assistive technology ◽

10.21288/resko.2019.13.2.143 ◽

2019 ◽

Vol 13 (2) ◽

pp. 143-149

Author(s):

H. W. Seol ◽

C. Poleak ◽

J. W. Kwon

Keyword(s):

Deep Learning ◽

Visually Impaired ◽

Image Captioning

Download Full-text

Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning

10.1016/j.neuroimage.2020.117602 ◽

2021 ◽

Vol 228 ◽

pp. 117602

Author(s):

Ziqi Ren ◽

Jie Li ◽

Xuetong Xue ◽

Xin Li ◽

Fan Yang ◽

...

Keyword(s):

Brain Activity ◽

Cognitive Representation ◽

Visually Guided ◽

Adversarial Learning

Download Full-text

An Image Captioning Model Based on Bidirectional Depth Residuals and its Application

IEEE Access ◽

10.1109/access.2021.3057091 ◽

2021 ◽

Vol 9 ◽

pp. 25360-25370

Author(s):

Ziwei Zhou ◽

Liang Xu ◽

Chaoyang Wang ◽

Wei Xie ◽

Shuo Wang ◽

...

Keyword(s):

Image Captioning ◽

Download Full-text

Graph Self-Attention Network for Image Captioning

2020 IEEE/ACS 17th International Conference on Computer Systems and Applications (AICCSA) ◽

10.1109/aiccsa50499.2020.9316518 ◽

2020 ◽

Author(s):

Qitong Zheng ◽

Yuping Wang

Keyword(s):

Image Captioning ◽

Attention Network

Download Full-text

Image Captioning Using Deep Convolutional Neural Networks (CNNs)

Journal of Physics Conference Series ◽

10.1088/1742-6596/1712/1/012015 ◽

2020 ◽

Vol 1712 ◽

pp. 012015

Author(s):

G. Geetha ◽

T. Kirthigadevi ◽

G.Godwin Ponsam ◽

T. Karthik ◽

M. Safa

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Image Captioning ◽

Deep Convolutional Neural Networks

Download Full-text

Image Captioning with Pretrained Language Generators

8th ACM IKDD CODS and 26th COMAD ◽

10.1145/3430984.3431059 ◽

2020 ◽

Author(s):

Saketh Vishnubhatla ◽

Nishant Sinha

Keyword(s):

Image Captioning

Download Full-text

Review on Deep Adversarial Learning of Entity Resolution for Cross-Modal Data

2020 2nd International Conference on Information Technology and Computer Application (ITCA) ◽

10.1109/itca52113.2020.00128 ◽

2020 ◽

Author(s):

Yizhuo Rao ◽

Chengyuan Duan ◽

Xiao Wei

Keyword(s):

Entity Resolution ◽

Adversarial Learning ◽

Download Full-text

SVGAN: Semi-supervised Generative Adversarial Network for Image Captioning

2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS) ◽

10.1109/tocs50858.2020.9339713 ◽

2020 ◽

Author(s):

Yi Zhang ◽

Wei Zeng ◽

Gangqiang He ◽

Yueyuan Liu

Keyword(s):

Image Captioning ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text