attention mechanism Latest Research Papers

Self-attention mechanisms have recently been embraced for a broad range of text-matching applications. Self-attention model takes only one sentence as an input with no extra information, i.e., one can utilize the final hidden state or pooling. However, text-matching problems can be interpreted either in symmetrical or asymmetrical scopes. For instance, paraphrase detection is an asymmetrical task, while textual entailment classification and question-answer matching are considered asymmetrical tasks. In this article, we leverage attractive properties of self-attention mechanism and proposes an attention-based network that incorporates three key components for inter-sequence attention: global pointwise features, preceding attentive features, and contextual features while updating the rest of the components. Our model follows evaluation on two benchmark datasets cover tasks of textual entailment and question-answer matching. The proposed efficient Self-attention-driven Network for Text Matching outperforms the state of the art on the Stanford Natural Language Inference and WikiQA datasets with much fewer parameters.

Download Full-text

Pin-missing defect recognition based on feature fusion and spatial attention mechanism

Energy Reports ◽

10.1016/j.egyr.2021.11.189 ◽

2022 ◽

Vol 8 ◽

pp. 656-663

Author(s):

Hui He ◽

Yuchen Li ◽

Jing Yang ◽

Zeli Wang ◽

Bo Chen ◽

...

Keyword(s):

Spatial Attention ◽

Feature Fusion ◽

Attention Mechanism ◽

Defect Recognition

Download Full-text

Fine-grained citation count prediction via a transformer-based model with among-attention mechanism

Information Processing & Management ◽

10.1016/j.ipm.2021.102799 ◽

2022 ◽

Vol 59 (2) ◽

pp. 102799

Author(s):

Shengzhi Huang ◽

Yong Huang ◽

Yi Bu ◽

Wei Lu ◽

Jiajia Qian ◽

...

Keyword(s):

Citation Count ◽

Attention Mechanism ◽

Fine Grained

Download Full-text

Efficient Channel Attention Based Encoder–Decoder Approach for Image Captioning in Hindi

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3483597 ◽

2022 ◽

Vol 21 (3) ◽

pp. 1-17

Author(s):

Santosh Kumar Mishra ◽

Gaurav Rai ◽

Sriparna Saha ◽

Pushpak Bhattacharyya

Keyword(s):

Computer Vision ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

English Language ◽

Image Understanding ◽

Attention Mechanism ◽

Image Captioning ◽

Textual Description ◽

Hindi Language

Image captioning refers to the process of generating a textual description that describes objects and activities present in a given image. It connects two fields of artificial intelligence, computer vision, and natural language processing. Computer vision and natural language processing deal with image understanding and language modeling, respectively. In the existing literature, most of the works have been carried out for image captioning in the English language. This article presents a novel method for image captioning in the Hindi language using encoder–decoder based deep learning architecture with efficient channel attention. The key contribution of this work is the deployment of an efficient channel attention mechanism with bahdanau attention and a gated recurrent unit for developing an image captioning model in the Hindi language. Color images usually consist of three channels, namely red, green, and blue. The channel attention mechanism focuses on an image’s important channel while performing the convolution, which is basically to assign higher importance to specific channels over others. The channel attention mechanism has been shown to have great potential for improving the efficiency of deep convolution neural networks (CNNs). The proposed encoder–decoder architecture utilizes the recently introduced ECA-NET CNN to integrate the channel attention mechanism. Hindi is the fourth most spoken language globally, widely spoken in India and South Asia; it is India’s official language. By translating the well-known MSCOCO dataset from English to Hindi, a dataset for image captioning in Hindi is manually created. The efficiency of the proposed method is compared with other baselines in terms of Bilingual Evaluation Understudy (BLEU) scores, and the results obtained illustrate that the method proposed outperforms other baselines. The proposed method has attained improvements of 0.59%, 2.51%, 4.38%, and 3.30% in terms of BLEU-1, BLEU-2, BLEU-3, and BLEU-4 scores, respectively, with respect to the state-of-the-art. Qualities of the generated captions are further assessed manually in terms of adequacy and fluency to illustrate the proposed method’s efficacy.

Download Full-text

A lightweight detector based on attention mechanism for aluminum strip surface defect detection

Computers in Industry ◽

10.1016/j.compind.2021.103585 ◽

2022 ◽

Vol 136 ◽

pp. 103585

Author(s):

Zhuxi MA ◽

Yibo Li ◽

Minghui Huang ◽

Qianbin Huang ◽

Jie Cheng ◽

...

Keyword(s):

Defect Detection ◽

Surface Defect ◽

Attention Mechanism ◽

Strip Surface ◽

Aluminum Strip ◽

Surface Defect Detection

Download Full-text

Short-term load forecasting based on LSTM networks considering attention mechanism

International Journal of Electrical Power & Energy Systems ◽

10.1016/j.ijepes.2021.107818 ◽

2022 ◽

Vol 137 ◽

pp. 107818

Author(s):

Jun Lin ◽

Jin Ma ◽

Jianguo Zhu ◽

Yu Cui

Keyword(s):

Load Forecasting ◽

Attention Mechanism ◽

Short Term ◽

Short Term Load Forecasting

Download Full-text

A novel time–frequency Transformer based on self–attention mechanism and its application in fault diagnosis of rolling bearings

Mechanical Systems and Signal Processing ◽

10.1016/j.ymssp.2021.108616 ◽

2022 ◽

Vol 168 ◽

pp. 108616

Author(s):

Yifei Ding ◽

Minping Jia ◽

Qiuhua Miao ◽

Yudong Cao

Keyword(s):

Fault Diagnosis ◽

Attention Mechanism ◽

Rolling Bearings ◽

Time Frequency

Download Full-text

Short-term wind power forecasting based on Attention Mechanism and Deep Learning

Electric Power Systems Research ◽

10.1016/j.epsr.2022.107776 ◽

2022 ◽

Vol 206 ◽

pp. 107776

Author(s):

Bangru Xiong ◽

Lu Lou ◽

Xinyu Meng ◽

Xin Wang ◽

Hui Ma ◽

...

Keyword(s):

Deep Learning ◽

Wind Power ◽

Attention Mechanism ◽

Short Term ◽

Wind Power Forecasting ◽

Power Forecasting

Download Full-text

KRAN: Knowledge Refining Attention Network for Recommendation

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3470783 ◽

2022 ◽

Vol 16 (2) ◽

pp. 1-20

Author(s):

Zhenyu Zhang ◽

Lei Zhang ◽

Dingqi Yang ◽

Liu Yang

Keyword(s):

State Of The Art ◽

Negative Impact ◽

Cold Start ◽

Attention Mechanism ◽

Knowledge Graph ◽

Convolutional Network ◽

Auxiliary Data ◽

Attention Network ◽

Additional Information ◽

Data Source

Recommender algorithms combining knowledge graph and graph convolutional network are becoming more and more popular recently. Specifically, attributes describing the items to be recommended are often used as additional information. These attributes along with items are highly interconnected, intrinsically forming a Knowledge Graph (KG). These algorithms use KGs as an auxiliary data source to alleviate the negative impact of data sparsity. However, these graph convolutional network based algorithms do not distinguish the importance of different neighbors of entities in the KG, and according to Pareto’s principle, the important neighbors only account for a small proportion. These traditional algorithms can not fully mine the useful information in the KG. To fully release the power of KGs for building recommender systems, we propose in this article KRAN, a Knowledge Refining Attention Network, which can subtly capture the characteristics of the KG and thus boost recommendation performance. We first introduce a traditional attention mechanism into the KG processing, making the knowledge extraction more targeted, and then propose a refining mechanism to improve the traditional attention mechanism to extract the knowledge in the KG more effectively. More precisely, KRAN is designed to use our proposed knowledge-refining attention mechanism to aggregate and obtain the representations of the entities (both attributes and items) in the KG. Our knowledge-refining attention mechanism first measures the relevance between an entity and it’s neighbors in the KG by attention coefficients, and then further refines the attention coefficients using a “richer-get-richer” principle, in order to focus on highly relevant neighbors while eliminating less relevant neighbors for noise reduction. In addition, for the item cold start problem, we propose KRAN-CD, a variant of KRAN, which further incorporates pre-trained KG embeddings to handle cold start items. Experiments show that KRAN and KRAN-CD consistently outperform state-of-the-art baselines across different settings.

Download Full-text

Event temporal relation extraction with attention mechanism and graph neural network

Tsinghua Science & Technology ◽

10.26599/tst.2020.9010063 ◽

2022 ◽

Vol 27 (1) ◽

pp. 79-90

Author(s):

Xiaoliang Xu ◽

Tong Gao ◽

Yuxiang Wang ◽

Xinle Xuan

Keyword(s):

Neural Network ◽

Relation Extraction ◽

Attention Mechanism ◽

Temporal Relation ◽

Temporal Relation Extraction

Download Full-text

attention mechanism
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

SANTM: Efficient Self-attention-driven Network for Text Matching

Pin-missing defect recognition based on feature fusion and spatial attention mechanism

Fine-grained citation count prediction via a transformer-based model with among-attention mechanism

Efficient Channel Attention Based Encoder–Decoder Approach for Image Captioning in Hindi

A lightweight detector based on attention mechanism for aluminum strip surface defect detection

Short-term load forecasting based on LSTM networks considering attention mechanism

A novel time–frequency Transformer based on self–attention mechanism and its application in fault diagnosis of rolling bearings

Short-term wind power forecasting based on Attention Mechanism and Deep Learning

KRAN: Knowledge Refining Attention Network for Recommendation

Event temporal relation extraction with attention mechanism and graph neural network

Export Citation Format

attention mechanismRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

SANTM: Efficient Self-attention-driven Network for Text Matching

Pin-missing defect recognition based on feature fusion and spatial attention mechanism

Fine-grained citation count prediction via a transformer-based model with among-attention mechanism

Efficient Channel Attention Based Encoder–Decoder Approach for Image Captioning in Hindi

A lightweight detector based on attention mechanism for aluminum strip surface defect detection

Short-term load forecasting based on LSTM networks considering attention mechanism

A novel time–frequency Transformer based on self–attention mechanism and its application in fault diagnosis of rolling bearings

Short-term wind power forecasting based on Attention Mechanism and Deep Learning

KRAN: Knowledge Refining Attention Network for Recommendation

Event temporal relation extraction with attention mechanism and graph neural network

attention mechanism
Recently Published Documents