Multi-branch Channel-wise Enhancement Network for Fine-grained Visual Recognition

2021 ◽  
Author(s):  
Guangjun Li ◽  
Yongxiong Wang ◽  
Fengting Zhu
Author(s):  
Chaojian Yu ◽  
Xinyi Zhao ◽  
Qi Zheng ◽  
Peng Zhang ◽  
Xinge You

Author(s):  
Qiuxia Lai ◽  
Yu Li ◽  
Ailing Zeng ◽  
Minhao Liu ◽  
Hanqiu Sun ◽  
...  

The selective visual attention mechanism in the human visual system (HVS) restricts the amount of information to reach visual awareness for perceiving natural scenes, allowing near real-time information processing with limited computational capacity. This kind of selectivity acts as an ‘Information Bottleneck (IB)’, which seeks a trade-off between information compression and predictive accuracy. However, such information constraints are rarely explored in the attention mechanism for deep neural networks (DNNs). In this paper, we propose an IB-inspired spatial attention module for DNN structures built for visual recognition. The module takes as input an intermediate representation of the input image, and outputs a variational 2D attention map that minimizes the mutual information (MI) between the attention-modulated representation and the input, while maximizing the MI between the attention-modulated representation and the task label. To further restrict the information bypassed by the attention map, we quantize the continuous attention scores to a set of learnable anchor values during training. Extensive experiments show that the proposed IB-inspired spatial attention mechanism can yield attention maps that neatly highlight the regions of interest while suppressing backgrounds, and bootstrap standard DNN structures for visual recognition tasks (e.g., image classification, fine-grained recognition, cross-domain classification). The attention maps are interpretable for the decision making of the DNNs as verified in the experiments. Our code is available at this https URL.


-The recognition of Indian food can be considered as a fine-grained visual recognition due to the same class photos may provide considerable amount of variability. Thus, an effective segmentation and classification method is needed to provide refined analysis. While only consideration of CNN may cause limitation through the absence of constraints such as shape and edge that causes output of segmentation to be rough on their edges. In order overcome this difficulty, a post-processing step is required; in this paper we proposed an EA based DCNNs model for effective segmentation. The EA is directly formulated with the DCNNs approach, which allows training step to get beneficial from both the approaches for spatial data relationship. The EA will help to get better-refined output after receiving the features from powerful DCNNs. The EA-DCNN training model contains convolution, rectified linear unit and pooling that is much relevant and practical to get optimize segmentation of food image. In order to evaluate the performance of our proposed model we will compare with the ground-truth data at several validation parameters


Author(s):  
Yan Em ◽  
Feng Gag ◽  
Yihang Lou ◽  
Shiqi Wang ◽  
Tiejun Huang ◽  
...  

2021 ◽  
pp. 1-1
Author(s):  
Huafeng Liu ◽  
Chuanyi Zhang ◽  
Yazhou Yao ◽  
Xiushen Wei ◽  
Fumin Shen ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document