Pin-missing defect recognition based on feature fusion and spatial attention mechanism

In recent years, with the rapid development of computer vision, increasing attention has been paid to remote sensing image scene classification. To improve the classification performance, many studies have increased the depth of convolutional neural networks (CNNs) and expanded the width of the network to extract more deep features, thereby increasing the complexity of the model. To solve this problem, in this paper, we propose a lightweight convolutional neural network based on attention-oriented multi-branch feature fusion (AMB-CNN) for remote sensing image scene classification. Firstly, we propose two convolution combination modules for feature extraction, through which the deep features of images can be fully extracted with multi convolution cooperation. Then, the weights of the feature are calculated, and the extracted deep features are sent to the attention mechanism for further feature extraction. Next, all of the extracted features are fused by multiple branches. Finally, depth separable convolution and asymmetric convolution are implemented to greatly reduce the number of parameters. The experimental results show that, compared with some state-of-the-art methods, the proposed method still has a great advantage in classification accuracy with very few parameters.

Download Full-text

A new Feature-Fusion method based on training dataset prototype for surface defect recognition

Advanced Engineering Informatics ◽

10.1016/j.aei.2021.101392 ◽

2021 ◽

Vol 50 ◽

pp. 101392

Author(s):

Yucheng Wang ◽

Xinyu Li ◽

Yiping Gao ◽

Lijian Wang ◽

Liang Gao

Keyword(s):

Surface Defect ◽

Feature Fusion ◽

Training Dataset ◽

Fusion Method ◽

New Feature ◽

Defect Recognition

Download Full-text

Automatic Extraction of Layover From InSAR Imagery Based on Multilayer Feature Fusion Attention Mechanism

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2021.3105722 ◽

2021 ◽

pp. 1-5

Author(s):

Xingmin Cai ◽

Lifu Chen ◽

Jin Xing ◽

Xuemin Xing ◽

Ru Luo ◽

...

Keyword(s):

Feature Fusion ◽

Attention Mechanism ◽

Automatic Extraction

Download Full-text

Multi-feature fusion gaze estimation based on attention mechanism

10.1117/12.2602019 ◽

2021 ◽

Author(s):

Zhangfang Hu ◽

Yanling Xia ◽

Yuan Luo ◽

Lan Wang

Keyword(s):

Feature Fusion ◽

Attention Mechanism ◽

Gaze Estimation

Download Full-text

UAV-based cross-view geo-localization fusion spatial attention mechanism and Netvlad

10.1109/icsai53574.2021.9664015 ◽

2021 ◽

Author(s):

Zongbao Liang ◽

Xing Liu ◽

Bo Chen ◽

YunFei Yuan ◽

Yang Song ◽

...

Keyword(s):

Spatial Attention ◽

Attention Mechanism

Download Full-text

Feature fusion network based on attention mechanism for 3D semantic segmentation of point clouds

Pattern Recognition Letters ◽

10.1016/j.patrec.2020.03.021 ◽

2020 ◽

Vol 133 ◽

pp. 327-333 ◽

Cited By ~ 1

Author(s):

Heng Zhou ◽

Zhijun Fang ◽

Yongbin Gao ◽

Bo Huang ◽

Cengsi Zhong ◽

...

Keyword(s):

Feature Fusion ◽

Semantic Segmentation ◽

Point Clouds ◽

Attention Mechanism

Download Full-text

Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms

Applied Sciences ◽

10.3390/app10124312 ◽

2020 ◽

Vol 10 (12) ◽

pp. 4312 ◽

Cited By ~ 1

Author(s):

Jie Xu ◽

Haoliang Wei ◽

Linke Li ◽

Qiuru Fu ◽

Jinhong Guo

Keyword(s):

Neural Network ◽

Spatial Attention ◽

Semantic Information ◽

Attention Mechanism ◽

Visual Features ◽

Feature Maps ◽

Global Features ◽

Model Based ◽

Video Description ◽

Video Visualization

Video description plays an important role in the field of intelligent imaging technology. Attention perception mechanisms are extensively applied in video description models based on deep learning. Most existing models use a temporal-spatial attention mechanism to enhance the accuracy of models. Temporal attention mechanisms can obtain the global features of a video, whereas spatial attention mechanisms obtain local features. Nevertheless, because each channel of the convolutional neural network (CNN) feature maps has certain spatial semantic information, it is insufficient to merely divide the CNN features into regions and then apply a spatial attention mechanism. In this paper, we propose a temporal-spatial and channel attention mechanism that enables the model to take advantage of various video features and ensures the consistency of visual features between sentence descriptions to enhance the effect of the model. Meanwhile, in order to prove the effectiveness of the attention mechanism, this paper proposes a video visualization model based on the video description. Experimental results show that, our model has achieved good performance on the Microsoft Video Description (MSVD) dataset and a certain improvement on the Microsoft Research-Video to Text (MSR-VTT) dataset.

Download Full-text