Multivariate Attention Network for Image Captioning

Graph Self-Attention Network for Image Captioning

2020 IEEE/ACS 17th International Conference on Computer Systems and Applications (AICCSA) ◽

10.1109/aiccsa50499.2020.9316518 ◽

2020 ◽

Author(s):

Qitong Zheng ◽

Yuping Wang

Keyword(s):

Image Captioning ◽

Attention Network

Download Full-text

Normalized and Geometry-Aware Self-Attention Network for Image Captioning

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr42600.2020.01034 ◽

2020 ◽

Cited By ~ 1

Author(s):

Longteng Guo ◽

Jing Liu ◽

Xinxin Zhu ◽

Peng Yao ◽

Shichen Lu ◽

...

Keyword(s):

Image Captioning ◽

Attention Network

Download Full-text

Multi-View Attention Network for Remote Sensing Image Captioning

10.1109/igarss47720.2021.9555083 ◽

2021 ◽

Author(s):

Yun Meng ◽

Yu Gu ◽

Xiutiao Ye ◽

Jingxian Tian ◽

Shuang Wang ◽

...

Keyword(s):

Remote Sensing ◽

Remote Sensing Image ◽

Image Captioning ◽

Attention Network

Download Full-text

Bi-Directional Co-Attention Network for Image Captioning

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3460474 ◽

2021 ◽

Vol 17 (4) ◽

pp. 1-20

Author(s):

Weitao Jiang ◽

Weixuan Wang ◽

Haifeng Hu

Keyword(s):

A Priori ◽

Attention Mechanism ◽

Superior Performance ◽

Significant Advance ◽

Visual Features ◽

Image Captioning ◽

Top Down ◽

Bottom Up ◽

Attention Network ◽

Benchmark Datasets

Image Captioning, which automatically describes an image with natural language, is regarded as a fundamental challenge in computer vision. In recent years, significant advance has been made in image captioning through improving attention mechanism. However, most existing methods construct attention mechanisms based on singular visual features, such as patch features or object features, which limits the accuracy of generated captions. In this article, we propose a Bidirectional Co-Attention Network (BCAN) that combines multiple visual features to provide information from different aspects. Different features are associated with predicting different words, and there are a priori relations between these multiple visual features. Based on this, we further propose a bottom-up and top-down bi-directional co-attention mechanism to extract discriminative attention information. Furthermore, most existing methods do not exploit an effective multimodal integration strategy, generally using addition or concatenation to combine features. To solve this problem, we adopt the Multivariate Residual Module (MRM) to integrate multimodal attention features. Meanwhile, we further propose a Vertical MRM to integrate features of the same category, and a Horizontal MRM to combine features of the different categories, which can balance the contribution of the bottom-up co-attention and the top-down co-attention. In contrast to the existing methods, the BCAN is able to obtain complementary information from multiple visual features via the bi-directional co-attention strategy, and integrate multimodal information via the improved multivariate residual strategy. We conduct a series of experiments on two benchmark datasets (MSCOCO and Flickr30k), and the results indicate that the proposed BCAN achieves the superior performance.

Download Full-text

Hierarchical Attention Network for Image Captioning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018957 ◽

2019 ◽

Vol 33 ◽

pp. 8957-8964 ◽

Cited By ~ 7

Author(s):

Weixuan Wang ◽

Zhihong Chen ◽

Haifeng Hu

Keyword(s):

State Of The Art ◽

The Other ◽

Multimodal Integration ◽

Image Captioning ◽

Attention Network ◽

Spatial Features ◽

Text Features ◽

Integration Strategies ◽

High Level ◽

Hierarchical Features

Recently, attention mechanism has been successfully applied in image captioning, but the existing attention methods are only established on low-level spatial features or high-level text features, which limits richness of captions. In this paper, we propose a Hierarchical Attention Network (HAN) that enables attention to be calculated on pyramidal hierarchy of features synchronously. The pyramidal hierarchy consists of features on diverse semantic levels, which allows predicting different words according to different features. On the other hand, due to the different modalities of features, a Multivariate Residual Module (MRM) is proposed to learn the joint representations from features. The MRM is able to model projections and extract relevant relations among different features. Furthermore, we introduce a context gate to balance the contribution of different features. Compared with the existing methods, our approach applies hierarchical features and exploits several multimodal integration strategies, which can significantly improve the performance. The HAN is verified on benchmark MSCOCO dataset, and the experimental results indicate that our model outperforms the state-of-the-art methods, achieving a BLEU1 score of 80.9 and a CIDEr score of 121.7 in the Karpathy’s test split.

Download Full-text

Multi-Gate Attention Network for Image Captioning

IEEE Access ◽

10.1109/access.2021.3067607 ◽

2021 ◽

pp. 1-1

Author(s):

Weitao Jiang ◽

Xiying Li ◽

Haifeng Hu ◽

Qiang Lu ◽

Bohong Liu

Keyword(s):

Image Captioning ◽

Attention Network

Download Full-text

Attend to Knowledge: Memory-Enhanced Attention Network for Image Captioning

Advances in Brain Inspired Cognitive Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-030-00563-4_16 ◽

2018 ◽

pp. 161-171 ◽

Cited By ~ 2

Author(s):

Hui Chen ◽

Guiguang Ding ◽

Zijia Lin ◽

Yuchen Guo ◽

Jungong Han

Keyword(s):

Image Captioning ◽

Attention Network ◽

Knowledge Memory

Download Full-text

Neurofeedback-Training bei Kindern mit Aufmerksamkeitsdefizit-/ Hyperaktivitätsstörung (ADHS)

Zeitschrift für Kinder- und Jugendpsychiatrie und Psychotherapie ◽

10.1024/1422-4917/a000070 ◽

2010 ◽

Vol 38 (6) ◽

pp. 409-420 ◽

Cited By ~ 10

Author(s):

Holger Gevensleben ◽

Gunther H. Moll ◽

Hartmut Heinrich

Keyword(s):

Attention Network Test ◽

Sich Eine ◽

Neurofeedback Training ◽

Attention Network ◽

Klinische Wirksamkeit

Im Rahmen einer multizentrischen, randomisierten, kontrollierten Studie evaluierten wir die klinische Wirksamkeit eines Neurofeedback-Trainings (NF) bei Kindern mit einer Aufmerksamkeitsdefizit-/Hyperaktivitätsstörung (ADHS) und untersuchten die einem erfolgreichen Training zugrunde liegenden neurophysiologischen Wirkmechanismen. Als Vergleichstraining diente ein computergestütztes Aufmerksamkeitstraining, das dem Setting des Neurofeedback-Trainings in den wesentlichen Anforderungen und Rahmenbedingungen angeglichen war. Auf Verhaltensebene (Eltern- und Lehrerbeurteilung) zeigte sich das NF-Training nach Trainingsende dem Kontrolltraining sowohl hinsichtlich der ADHS-Kernsymptomatik als auch in assoziierten Bereichen überlegen. Für das Hauptzielkriterium (Verbesserung im FBB-HKS Gesamtwert) ergab sich eine mittlere Effektstärke (von 0.6). Sechs Monate nach Trainingsende (follow-up) konnte das gleiche Ergebnismuster gefunden werden. Die Ergebnisse legen somit den Schluss nahe, dass NF einen klinisch wirksamen Therapiebaustein zur Behandlung von Kindern mit ADHS darstellt. Auf neurophysiologischer Ebene (EEG; ereignisbezogene Potentiale, EPs) konnten für die beiden Neurofeedback-Protokolle Theta/Beta-Training und Training langsamer kortikaler Potentiale spezifische Effekte aufgezeigt werden. So war für das Theta/Beta-Training beispielsweise die Abnahme der Theta-Aktivität mit einer Reduzierung der ADHS-Symptomatik assoziiert. Für das SCP-Training wurde u. a. im Attention Network Test eine Erhöhung der kontingenten negativen Variation beobachtet, die die mobilisierten Ressourcen bei Vorbereitungsprozessen widerspiegelt. EEG- und EP-basierte Prädiktorvariablen konnten ermittelt werden. Der vorliegende Artikel bietet einen Gesamtüberblick über die in verschiedenen Publikationen unserer Arbeitsgruppe beschriebenen Ergebnisse der Studie und zeigt zukünftige Fragestellungen auf.

Download Full-text

External modulation of the sustained attention network in traumatic brain injury.

Neuropsychology ◽

10.1037/neu0000442 ◽

2018 ◽

Vol 32 (5) ◽

pp. 541-553 ◽

Cited By ~ 3

Author(s):

Nadine M. Richard ◽

Charlene O'Connor ◽

Ayan Dey ◽

Ian H. Robertson ◽

Brian Levine

Keyword(s):

Traumatic Brain Injury ◽

Brain Injury ◽

Sustained Attention ◽

Attention Network ◽

External Modulation

Download Full-text

El Attention Network Test en el estudio de los déficits cognitivos de pacientes con trastorno por déficit de atención

Revista de Neurología ◽

10.33588/rn.6910.2019202 ◽

2019 ◽

Vol 69 (10) ◽

pp. 423

Author(s):

Manuel Vázquez Marrufo ◽

Macarena García-Valdecasas Colell ◽

Alejandro Galvao Carmona ◽

Esteban Sarrias Arrabal ◽

Javier Tirapu Ustárroz

Keyword(s):

Attention Network Test ◽

Attention Network

Download Full-text