Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning

Mapping Intimacies ◽

10.1145/3474085.3475439 ◽

2021 ◽

Author(s):

Xinzhi Dong ◽

Chengjiang Long ◽

Wenju Xu ◽

Chunxia Xiao

Keyword(s):

Image Captioning ◽

Convolutional Networks

Download Full-text

Noise Augmented Double-stream Graph Convolutional Networks for Image Captioning

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2020.3036860 ◽

2020 ◽

pp. 1-1

Author(s):

Lingxiang Wu ◽

Min Xu ◽

Lei Sang ◽

Ting Yao ◽

Tao Mei

Keyword(s):

Image Captioning ◽

Convolutional Networks

Download Full-text

One-Shot Learning for Long-Tail Visual Relation Detection

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6904 ◽

2020 ◽

Vol 34 (07) ◽

pp. 12225-12232

Author(s):

Weitao Wang ◽

Meng Wang ◽

Sen Wang ◽

Guodong Long ◽

Lina Yao ◽

...

Keyword(s):

Question Answering ◽

Image Captioning ◽

Long Tail ◽

Training Scheme ◽

Training Samples ◽

Latent Features ◽

The One ◽

Novel Model ◽

Conventional Detection

The aim of visual relation detection is to provide a comprehensive understanding of an image by describing all the objects within the scene, and how they relate to each other, in < object-predicate-object > form; for example, < person-lean on-wall > . This ability is vital for image captioning, visual question answering, and many other applications. However, visual relationships have long-tailed distributions and, thus, the limited availability of training samples is hampering the practicability of conventional detection approaches. With this in mind, we designed a novel model for visual relation detection that works in one-shot settings. The embeddings of objects and predicates are extracted through a network that includes a feature-level attention mechanism. Attention alleviates some of the problems with feature sparsity, and the resulting representations capture more discriminative latent features. The core of our model is a dual graph neural network that passes and aggregates the context information of predicates and objects in an episodic training scheme to improve recognition of the one-shot predicates and then generate the triplets. To the best of our knowledge, we are the first to center on the viability of one-shot learning for visual relation detection. Extensive experiments on two newly-constructed datasets show that our model significantly improved the performance of two tasks PredCls and SGCls from 2.8% to 12.2% compared with state-of-the-art baselines.

Download Full-text

Dual Graph Convolutional Networks for Graph-Based Semi-Supervised Classification

Proceedings of the 2018 World Wide Web Conference on World Wide Web - WWW '18 ◽

10.1145/3178876.3186116 ◽

2018 ◽

Author(s):

Chenyi Zhuang ◽

Qiang Ma

Keyword(s):

Supervised Classification ◽

Convolutional Networks

Download Full-text

Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis

10.18653/v1/2021.acl-long.494 ◽

2021 ◽

Author(s):

Ruifan Li ◽

Hao Chen ◽

Fangxiang Feng ◽

Zhanyu Ma ◽

Xiaojie Wang ◽

...

Keyword(s):

Sentiment Analysis ◽

Convolutional Networks

Download Full-text

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2017.667 ◽

2017 ◽

Author(s):

Long Chen ◽

Hanwang Zhang ◽

Jun Xiao ◽

Liqiang Nie ◽

Jian Shao ◽

...

Keyword(s):

Image Captioning ◽

Convolutional Networks

Download Full-text

A model for designing nature reserves with minimal fragmenta-tion using a primal-dual graph approach

Biodiversity Science ◽

10.3724/sp.j.1003.2011.10020 ◽

2011 ◽

Vol 19 (4) ◽

pp. 404-413

Author(s):

Wang Yicheng

Keyword(s):

Nature Reserves ◽

Download Full-text

Feedback Attention Model for Image Captioning

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17505 ◽

2019 ◽

Vol 31 (7) ◽

pp. 1122

Author(s):

Fan Lyu ◽

Fuyuan Hu ◽

Yanning Zhang ◽

Zhenping Xia ◽

S Sheng Victor

Keyword(s):

Image Captioning ◽

Attention Model

Download Full-text

Deep learning based character-oriented image captioning method for visually impaired

Journal of rehabilitation welfare engineering & assistive technology ◽

10.21288/resko.2019.13.2.143 ◽

2019 ◽

Vol 13 (2) ◽

pp. 143-149

Author(s):

H. W. Seol ◽

C. Poleak ◽

J. W. Kwon

Keyword(s):

Deep Learning ◽

Visually Impaired ◽

Image Captioning

Download Full-text

Deep Attention Gated Dilated Temporal Convolutional Networks with Intra-Parallel Convolutional Modules for End-to-End Monaural Speech Separation

10.21437/interspeech.2019-1373 ◽

2019 ◽

Author(s):

Ziqiang Shi ◽

Huibin Lin ◽

Liu Liu ◽

Rujie Liu ◽

Jiqing Han ◽

...

Keyword(s):

Speech Separation ◽

Convolutional Networks ◽

Download Full-text

Batched Sparse Matrix Multiplication for Accelerating Graph Convolutional Networks

2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) ◽

10.1109/ccgrid.2019.00037 ◽

2019 ◽

Author(s):

Yusuke Nagasaka ◽

Akira Nukada ◽

Ryosuke Kojima ◽

Satoshi Matsuoka

Keyword(s):

Sparse Matrix ◽

Matrix Multiplication ◽

Convolutional Networks

Download Full-text