scholarly journals Improved Image Captioning via Policy Gradient optimization of SPIDEr

Author(s):  
Siqi Liu ◽  
Zhenhai Zhu ◽  
Ning Ye ◽  
Sergio Guadarrama ◽  
Kevin Murphy
2020 ◽  
Vol 167 ◽  
pp. 107329 ◽  
Author(s):  
Shiyang Yan ◽  
Yuan Xie ◽  
Fangyu Wu ◽  
Jeremy S. Smith ◽  
Wenjin Lu ◽  
...  

2010 ◽  
Vol 2010 ◽  
pp. 1-20 ◽  
Author(s):  
Liang Tang ◽  
Hong-sheng Xi ◽  
Jin Zhu ◽  
Bao-qun Yin

A mathematical model forM/G/1-type queueing networks with multiple user applications and limited resources is established. The goal is to develop a dynamic distributed algorithm for this model, which supports all data traffic as efficiently as possible and makes optimally fair decisions about how to minimize the network performance cost. An online policy gradient optimization algorithm based on a single sample path is provided to avoid suffering from a “curse of dimensionality”. The asymptotic convergence properties of this algorithm are proved. Numerical examples provide valuable insights for bridging mathematical theory with engineering practice.


2020 ◽  
Vol 34 (03) ◽  
pp. 2693-2700
Author(s):  
Paul Hongsuck Seo ◽  
Piyush Sharma ◽  
Tomer Levinboim ◽  
Bohyung Han ◽  
Radu Soricut

Human ratings are currently the most accurate way to assess the quality of an image captioning model, yet most often the only used outcome of an expensive human rating evaluation is a few overall statistics over the evaluation dataset. In this paper, we show that the signal from instance-level human caption ratings can be leveraged to improve captioning models, even when the amount of caption ratings is several orders of magnitude less than the caption training data. We employ a policy gradient method to maximize the human ratings as rewards in an off-policy reinforcement learning setting, where policy gradients are estimated by samples from a distribution that focuses on the captions in a caption ratings dataset. Our empirical evidence indicates that the proposed method learns to generalize the human raters' judgments to a previously unseen set of images, as judged by a different set of human judges, and additionally on a different, multi-dimensional side-by-side human evaluation procedure.


2019 ◽  
Vol 31 (7) ◽  
pp. 1122
Author(s):  
Fan Lyu ◽  
Fuyuan Hu ◽  
Yanning Zhang ◽  
Zhenping Xia ◽  
S Sheng Victor

IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 25360-25370
Author(s):  
Ziwei Zhou ◽  
Liang Xu ◽  
Chaoyang Wang ◽  
Wei Xie ◽  
Shuo Wang ◽  
...  
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document