Learning to Generate Grounded Visual Captions Without Localization Supervision

Computer Vision – ECCV 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58523-5_21 ◽

2020 ◽

pp. 353-370

Author(s):

Chih-Yao Ma ◽

Yannis Kalantidis ◽

Ghassan AlRegib ◽

Peter Vajda ◽

Marcus Rohrbach ◽

...

Download Full-text