Improving Diversity of Image Captioning Through Variational Autoencoders and Adversarial Learning

Author(s):  
Li Ren ◽  
Guo-Jun Qi ◽  
Kien Hua
Symmetry ◽  
2020 ◽  
Vol 12 (12) ◽  
pp. 1978
Author(s):  
Zhenyu Yang ◽  
Qiao Liu ◽  
Guojing Liu

Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images. However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions. In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR). Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole. At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively. These two parts further enhance the learning ability of the model through adversarial learning. Our experiment has achieved effective performance on the benchmark dataset.


2019 ◽  
Vol 31 (7) ◽  
pp. 1122
Author(s):  
Fan Lyu ◽  
Fuyuan Hu ◽  
Yanning Zhang ◽  
Zhenping Xia ◽  
S Sheng Victor

NeuroImage ◽  
2021 ◽  
Vol 228 ◽  
pp. 117602
Author(s):  
Ziqi Ren ◽  
Jie Li ◽  
Xuetong Xue ◽  
Xin Li ◽  
Fan Yang ◽  
...  

IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 25360-25370
Author(s):  
Ziwei Zhou ◽  
Liang Xu ◽  
Chaoyang Wang ◽  
Wei Xie ◽  
Shuo Wang ◽  
...  
Keyword(s):  

2020 ◽  
Vol 1712 ◽  
pp. 012015
Author(s):  
G. Geetha ◽  
T. Kirthigadevi ◽  
G.Godwin Ponsam ◽  
T. Karthik ◽  
M. Safa

Sign in / Sign up

Export Citation Format

Share Document