Channel Interaction Networks for Fine-Grained Image Categorization

Fine-grained image categorization is challenging due to the subtle inter-class differences. We posit that exploiting the rich relationships between channels can help capture such differences since different channels correspond to different semantics. In this paper, we propose a channel interaction network (CIN), which models the channel-wise interplay both within an image and across images. For a single image, a self-channel interaction (SCI) module is proposed to explore channel-wise correlation within the image. This allows the model to learn the complementary features from the correlated channels, yielding stronger fine-grained features. Furthermore, given an image pair, we introduce a contrastive channel interaction (CCI) module to model the cross-sample channel interaction with a metric learning framework, allowing the CIN to distinguish the subtle visual differences between images. Our model can be trained efficiently in an end-to-end fashion without the need of multi-stage training and testing. Finally, comprehensive experiments are conducted on three publicly available benchmarks, where the proposed method consistently outperforms the state-of-the-art approaches, such as DFL-CNN(Wang, Morariu, and Davis 2018) and NTS(Yang et al. 2018).

Download Full-text

Fine-grained visual categorization via multi-stage metric learning

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2015.7298995 ◽

2015 ◽

Cited By ~ 50

Author(s):

Qi Qian ◽

Rong Jin ◽

Shenghuo Zhu ◽

Yuanqing Lin

Keyword(s):

Metric Learning ◽

Visual Categorization ◽

Fine Grained ◽

Multi Stage

Download Full-text

Metric Learning Based Fine-Grained Classification for PolSAR Imagery

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9323087 ◽

2020 ◽

Author(s):

Jun Ni ◽

Yunzhe Jia ◽

Qiang Yin ◽

Yongsheng Zhou ◽

Fan Zhang

Keyword(s):

Metric Learning ◽

Fine Grained

Download Full-text

TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization with Few Labeled Samples

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2021.3065693 ◽

2021 ◽

pp. 1-1

Author(s):

Huaxi Huang ◽

Junjie Zhang ◽

Litao Yu ◽

Jian Zhang ◽

Qiang Wu ◽

...

Keyword(s):

Image Categorization ◽

Fine Grained

Download Full-text

A Saliency-based Weakly-supervised Network for Fine-Grained Image Categorization

2020 13th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei51763.2020.9263683 ◽

2020 ◽

Author(s):

Yawen Han ◽

Fang Meng

Keyword(s):

Image Categorization ◽

Fine Grained ◽

Weakly Supervised

Download Full-text

A Public Dataset for Fine-Grained Ship Classification in Optical Remote Sensing Images

Remote Sensing ◽

10.3390/rs13040747 ◽

2021 ◽

Vol 13 (4) ◽

pp. 747

Author(s):

Yanghua Di ◽

Zhiguo Jiang ◽

Haopeng Zhang

Keyword(s):

Remote Sensing ◽

Image Data ◽

Remote Sensing Image ◽

Google Earth ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Visual Categorization ◽

Class Differences ◽

Fine Grained ◽

Ship Classification

Fine-grained visual categorization (FGVC) is an important and challenging problem due to large intra-class differences and small inter-class differences caused by deformation, illumination, angles, etc. Although major advances have been achieved in natural images in the past few years due to the release of popular datasets such as the CUB-200-2011, Stanford Cars and Aircraft datasets, fine-grained ship classification in remote sensing images has been rarely studied because of relative scarcity of publicly available datasets. In this paper, we investigate a large amount of remote sensing image data of sea ships and determine most common 42 categories for fine-grained visual categorization. Based our previous DSCR dataset, a dataset for ship classification in remote sensing images, we collect more remote sensing images containing warships and civilian ships of various scales from Google Earth and other popular remote sensing image datasets including DOTA, HRSC2016, NWPU VHR-10, We call our dataset FGSCR-42, meaning a dataset for Fine-Grained Ship Classification in Remote sensing images with 42 categories. The whole dataset of FGSCR-42 contains 9320 images of most common types of ships. We evaluate popular object classification algorithms and fine-grained visual categorization algorithms to build a benchmark. Our FGSCR-42 dataset is publicly available at our webpages.

Download Full-text

An Enhanced Multi-Stage Deep Learning Framework for Detecting Malicious Activities From Autonomous Vehicles

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3105834 ◽

2021 ◽

pp. 1-10

Author(s):

Izhar Ahmed Khan ◽

Nour Moustafa ◽

Dechang Pi ◽

Waqas Haider ◽

Bentian Li ◽

...

Keyword(s):

Deep Learning ◽

Autonomous Vehicles ◽

Learning Framework ◽

Multi Stage

Download Full-text

Semi-Supervised Aspect-Based Sentiment Analysis for Case-Related Microblog Reviews Using Case Knowledge Graph Embedding

International Journal of Asian Language Processing ◽

10.1142/s2717554520500125 ◽

2021 ◽

pp. 2050012

Author(s):

Peilian Zhao ◽

Cunli Mao ◽

Zhengtao Yu

Keyword(s):

Sentiment Analysis ◽

Domain Knowledge ◽

Opinion Mining ◽

Data Augmentation ◽

Training Data ◽

Knowledge Graph ◽

Fine Grained ◽

Learning Framework ◽

Proposed Model ◽

Real World Applications

Aspect-Based Sentiment Analysis (ABSA), a fine-grained task of opinion mining, which aims to extract sentiment of specific target from text, is an important task in many real-world applications, especially in the legal field. Therefore, in this paper, we study the problem of limitation of labeled training data required and ignorance of in-domain knowledge representation for End-to-End Aspect-Based Sentiment Analysis (E2E-ABSA) in legal field. We proposed a new method under deep learning framework, named Semi-ETEKGs, which applied E2E framework using knowledge graph (KG) embedding in legal field after data augmentation (DA). Specifically, we pre-trained the BERT embedding and in-domain KG embedding for unlabeled data and labeled data with case elements after DA, and then we put two embeddings into the E2E framework to classify the polarity of target-entity. Finally, we built a case-related dataset based on a popular benchmark for ABSA to prove the efficiency of Semi-ETEKGs, and experiments on case-related dataset from microblog comments show that our proposed model outperforms the other compared methods significantly.

Download Full-text

A Social Recommendation Based on Metric Learning and Users’ Co-Occurrence Pattern

Symmetry ◽

10.3390/sym13112158 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2158

Author(s):

Xin Zhang ◽

Jiwei Qin ◽

Jiong Zheng

Keyword(s):

Matrix Factorization ◽

Triangle Inequality ◽

Metric Learning ◽

Social Recommendation ◽

Rating Data ◽

Fine Grained ◽

Information Metric ◽

Occurrence Pattern ◽

Public Datasets ◽

Occurrence Patterns

For personalized recommender systems, matrix factorization and its variants have become mainstream in collaborative filtering. However, the dot product in matrix factorization does not satisfy the triangle inequality and therefore fails to capture fine-grained information. Metric learning-based models have been shown to be better at capturing fine-grained information than matrix factorization. Nevertheless, most of these models only focus on rating data and social information, which are not sufficient for dealing with the challenges of data sparsity. In this paper, we propose a metric learning-based social recommendation model called SRMC. SRMC exploits users’ co-occurrence patterns to discover their potentially similar or dissimilar users with symmetric relationships and change their relative positions to achieve better recommendations. Experiments on three public datasets show that our model is more effective than the compared models.

Download Full-text

Deep Polarized Network for Supervised Learning of Accurate Binary Hashing Codes

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/115 ◽

2020 ◽

Cited By ~ 1

Author(s):

Lixin Fan ◽

Kam Woh Ng ◽

Ce Ju ◽

Tianyu Zhang ◽

Chee Seng Chan

Keyword(s):

Large Deviations ◽

Hamming Distance ◽

State Of The Art ◽

Metric Learning ◽

Binary Codes ◽

Distance Metric Learning ◽

Learning Framework ◽

Label Information ◽

Learning To Hash ◽

Random Codes

This paper proposes a novel deep polarized network (DPN) for learning to hash, in which each channel in the network outputs is pushed far away from zero by employing a differentiable bit-wise hinge-like loss which is dubbed as polarization loss. Reformulated within a generic Hamming Distance Metric Learning framework [Norouzi et al., 2012], the proposed polarization loss bypasses the requirement to prepare pairwise labels for (dis-)similar items and, yet, the proposed loss strictly bounds from above the pairwise Hamming Distance based losses. The intrinsic connection between pairwise and pointwise label information, as disclosed in this paper, brings about the following methodological improvements: (a) we may directly employ the proposed differentiable polarization loss with no large deviations incurred from the target Hamming distance based loss; and (b) the subtask of assigning binary codes becomes extremely simple --- even random codes assigned to each class suffice to result in state-of-the-art performances, as demonstrated in CIFAR10, NUS-WIDE and ImageNet100 datasets.

Download Full-text

Image matching based on a structured deep coupled metric learning framework

Signal Image and Video Processing ◽

10.1007/s11760-021-02120-z ◽

2022 ◽

Author(s):

Guixia Fu ◽

Guofeng Zou ◽

Mingliang Gao ◽

Zhenzhou Wang ◽

Zheng Liu

Keyword(s):

Image Matching ◽

Metric Learning ◽

Learning Framework

Download Full-text