Recognizing Food Places in Egocentric Photo-Streams Using Multi-Scale Atrous Convolutional Networks and Self-Attention Mechanism

Computer vision systems are insensitive to the scale of objects in natural scenes, so it is important to study the multi-scale representation of features. Res2Net implements hierarchical multi-scale convolution in residual blocks, but its random grouping method affects the robustness and intuitive interpretability of the network. We propose a new multi-scale convolution model based on multiple attention. It introduces the attention mechanism into the structure of a Res2-block to better guide feature expression. First, we adopt channel attention to score channels and sort them in descending order of the feature’s importance (Channels-Sort). The sorted residual blocks are grouped and intra-block hierarchically convolved to form a single attention and multi-scale block (AMS-block). Then, we implement channel attention on the residual small blocks to constitute a dual attention and multi-scale block (DAMS-block). Introducing spatial attention before sorting the channels to form multi-attention multi-scale blocks(MAMS-block). A MAMS-convolutional neural network (CNN) is a series of multiple MAMS-blocks. It enables significant information to be expressed at more levels, and can also be easily grafted into different convolutional structures. Limited by hardware conditions, we only prove the validity of the proposed ideas through convolutional networks of the same magnitude. The experimental results show that the convolution model with an attention mechanism and multi-scale features is superior in image classification.

Download Full-text

Deep learning-based tool wear prediction and its application for machining process using multi-scale feature fusion and channel attention mechanism

Measurement ◽

10.1016/j.measurement.2021.109254 ◽

2021 ◽

Vol 177 ◽

pp. 109254

Author(s):

Xingwei Xu ◽

Jianwen Wang ◽

Bingfu Zhong ◽

Weiwei Ming ◽

Ming Chen

Keyword(s):

Deep Learning ◽

Tool Wear ◽

Feature Fusion ◽

Attention Mechanism ◽

Machining Process ◽

Wear Prediction ◽

Scale Feature ◽

Multi Scale ◽

Tool Wear Prediction

Download Full-text

Multi-scale Strategy Based 3D Dual-Encoder Brain Tumor Segmentation Network with Attention Mechanism

2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ◽

10.1109/bibm49941.2020.9313089 ◽

2020 ◽

Author(s):

Yazhou Zhu ◽

Xiang Pan ◽

Jing Zhu ◽

Lihua Li

Keyword(s):

Brain Tumor ◽

Attention Mechanism ◽

Tumor Segmentation ◽

Brain Tumor Segmentation ◽

Multi Scale

Download Full-text

Non-Intrusive Load Disaggregation Based on a Multi-Scale Attention Residual Network

Applied Sciences ◽

10.3390/app10249132 ◽

2020 ◽

Vol 10 (24) ◽

pp. 9132

Author(s):

Liguo Weng ◽

Xiaodong Zhang ◽

Junhao Qian ◽

Min Xia ◽

Yiqing Xu ◽

...

Keyword(s):

Smart Grids ◽

Recognition Rate ◽

Low Frequency ◽

Attention Mechanism ◽

Learning Ability ◽

Residual Network ◽

Multi Scale ◽

Energy Disaggregation ◽

Benchmark Datasets ◽

Load Disaggregation

Non-intrusive load disaggregation (NILD) is of great significance to the development of smart grids. Current energy disaggregation methods extract features from sequences, and this process easily leads to a loss of load features and difficulties in detecting, resulting in a low recognition rate of low-use electrical appliances. To solve this problem, a non-intrusive sequential energy disaggregation method based on a multi-scale attention residual network is proposed. Multi-scale convolutions are used to learn features, and the attention mechanism is used to enhance the learning ability of load features. The residual learning further improves the performance of the algorithm, avoids network degradation, and improves the precision of load decomposition. The experimental results on two benchmark datasets show that the proposed algorithm has more advantages than the existing algorithms in terms of load disaggregation accuracy and judgments of the on/off state, and the attention mechanism can further improve the disaggregation accuracy of low-frequency electrical appliances.

Download Full-text

Image Compressive Sensing via Multi-scale Feature Extraction and Attention Mechanism

2020 International Conference on Intelligent Computing, Automation and Systems (ICICAS) ◽

10.1109/icicas51530.2020.00061 ◽

2020 ◽

Author(s):

Chuning He

Keyword(s):

Feature Extraction ◽

Compressive Sensing ◽

Attention Mechanism ◽

Scale Feature ◽

Multi Scale

Download Full-text

U2-ONet: A Two-Level Nested Octave U-Structure Network with a Multi-Scale Attention Mechanism for Moving Object Segmentation

Remote Sensing ◽

10.3390/rs13010060 ◽

2020 ◽

Vol 13 (1) ◽

pp. 60

Author(s):

Chenjie Wang ◽

Chengyuan Li ◽

Jun Liu ◽

Bin Luo ◽

Xin Su ◽

...

Keyword(s):

Moving Objects ◽

Object Segmentation ◽

Contextual Information ◽

Attention Mechanism ◽

Moving Object ◽

Feature Maps ◽

Moving Object Segmentation ◽

Practical Applications ◽

Multi Scale ◽

Spatial Redundancy

Most scenes in practical applications are dynamic scenes containing moving objects, so accurately segmenting moving objects is crucial for many computer vision applications. In order to efficiently segment all the moving objects in the scene, regardless of whether the object has a predefined semantic label, we propose a two-level nested octave U-structure network with a multi-scale attention mechanism, called U2-ONet. U2-ONet takes two RGB frames, the optical flow between these frames, and the instance segmentation of the frames as inputs. Each stage of U2-ONet is filled with the newly designed octave residual U-block (ORSU block) to enhance the ability to obtain more contextual information at different scales while reducing the spatial redundancy of the feature maps. In order to efficiently train the multi-scale deep network, we introduce a hierarchical training supervision strategy that calculates the loss at each level while adding knowledge-matching loss to keep the optimization consistent. The experimental results show that the proposed U2-ONet method can achieve a state-of-the-art performance in several general moving object segmentation datasets.

Download Full-text

A Multi-Scale Fusion Convolutional Neural Network based on Attention Mechanism for the Visualization Analysis of EEG Signals Decoding

IEEE Transactions on Neural Systems and Rehabilitation Engineering ◽

10.1109/tnsre.2020.3037326 ◽

2020 ◽

pp. 1-1

Author(s):

Donglin Li ◽

Jiacan Xu ◽

Jianhui Wang ◽

Xiaoke Fang ◽

Ji Ying

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Attention Mechanism ◽

Eeg Signals ◽

Multi Scale ◽

Visualization Analysis

Download Full-text

Simultaneous Segmentation of Fetal Hearts and Lungs for Medical Ultrasound Images via an Efficient Multi-scale Model Integrated With Attention Mechanism

Ultrasonic Imaging ◽

10.1177/01617346211042526 ◽

2021 ◽

pp. 016173462110425

Author(s):

Jianing Xi ◽

Jiangang Chen ◽

Zhao Wang ◽

Dean Ta ◽

Bing Lu ◽

...

Keyword(s):

Congenital Anomaly ◽

Large Scale ◽

Automatic Segmentation ◽

Receptive Fields ◽

Semantic Segmentation ◽

Attention Mechanism ◽

Scale Model ◽

Ultrasound Images ◽

Multi Scale ◽

Task Irrelevant

Large scale early scanning of fetuses via ultrasound imaging is widely used to alleviate the morbidity or mortality caused by congenital anomalies in fetal hearts and lungs. To reduce the intensive cost during manual recognition of organ regions, many automatic segmentation methods have been proposed. However, the existing methods still encounter multi-scale problem at a larger range of receptive fields of organs in images, resolution problem of segmentation mask, and interference problem of task-irrelevant features, obscuring the attainment of accurate segmentations. To achieve semantic segmentation with functions of (1) extracting multi-scale features from images, (2) compensating information of high resolution, and (3) eliminating the task-irrelevant features, we propose a multi-scale model with skip connection framework and attention mechanism integrated. The multi-scale feature extraction modules are incorporated with additive attention gate units for irrelevant feature elimination, through a U-Net framework with skip connections for information compensation. The performance of fetal heart and lung segmentation indicates the superiority of our method over the existing deep learning based approaches. Our method also shows competitive performance stability during the task of semantic segmentations, showing a promising contribution on ultrasound based prognosis of congenital anomaly in the early intervention, and alleviating the negative effects caused by congenital anomaly.

Download Full-text