iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection

Chenfan Zhuang; Xintong Han; Weilin Huang; Matthew Scott

doi:10.1609/aaai.v34i07.7015

iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.7015 ◽

2020 ◽

Vol 34 (07) ◽

pp. 13122-13129 ◽

Cited By ~ 3

Author(s):

Chenfan Zhuang ◽

Xintong Han ◽

Weilin Huang ◽

Matthew Scott

Keyword(s):

Object Detection ◽

Best Practice ◽

Domain Adaptation ◽

Metric Learning ◽

Strong Relationship ◽

Adversarial Learning ◽

Multi Scale ◽

Rich Domain ◽

Multiple Alignments ◽

Coarse To Fine

Training an object detector on a data-rich domain and applying it to a data-poor one with limited performance drop is highly attractive in industry, because it saves huge annotation cost. Recent research on unsupervised domain adaptive object detection has verified that aligning data distributions between source and target images through adversarial learning is very useful. The key is when, where and how to use it to achieve best practice. We propose Image-Instance Full Alignment Networks (iFAN) to tackle this problem by precisely aligning feature distributions on both image and instance levels: 1) Image-level alignment: multi-scale features are roughly aligned by training adversarial domain classifiers in a hierarchically-nested fashion. 2) Full instance-level alignment: deep semantic information and elaborate instance representations are fully exploited to establish a strong relationship among categories and domains. Establishing these correlations is formulated as a metric learning problem by carefully constructing instance pairs. Above-mentioned adaptations can be integrated into an object detector (e.g. Faster R-CNN), resulting in an end-to-end trainable framework where multiple alignments can work collaboratively in a coarse-to-fine manner. In two domain adaptation tasks: synthetic-to-real (SIM10K → Cityscapes) and normal-to-foggy weather (Cityscapes → Foggy Cityscapes), iFAN outperforms the state-of-the-art methods with a boost of 10%+ AP over the source-only baseline.

Download Full-text

C2FDA: Coarse-to-Fine Domain Adaptation for Traffic Object Detection

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3115823 ◽

2021 ◽

pp. 1-15

Author(s):

Hui Zhang ◽

Guiyang Luo ◽

Jinglin Li ◽

Fei-Yue Wang

Keyword(s):

Object Detection ◽

Domain Adaptation ◽

Coarse To Fine ◽

Fine Domain

Download Full-text

Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection

10.1109/ijcnn52387.2021.9533339 ◽

2021 ◽

Author(s):

Zekun Li ◽

Yufan Liu ◽

Bing Li ◽

Weiming Hu ◽

Xue Zhou

Keyword(s):

Object Detection ◽

Multi Scale ◽

Coarse To Fine

Download Full-text

Spatial hierarchy perception and hard samples metric learning for high-resolution remote sensing image object detection

Applied Intelligence ◽

10.1007/s10489-021-02335-0 ◽

2021 ◽

Author(s):

Dongjun Zhu ◽

Shixiong Xia ◽

Jiaqi Zhao ◽

Yong Zhou ◽

Qiang Niu ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Metric Learning ◽

Remote Sensing Image ◽

Spatial Hierarchy ◽

Image Object Detection ◽

Image Object ◽

Hard Samples

Download Full-text

Generative and self-supervised domain adaptation for one-stage object detection

Array ◽

10.1016/j.array.2021.100071 ◽

2021 ◽

pp. 100071

Author(s):

Kazuma Fujii ◽

Kazuhiko Kawamoto

Keyword(s):

Object Detection ◽

Domain Adaptation ◽

One Stage

Download Full-text

Deep multi-scale adversarial network with attention: A novel domain adaptation method for intelligent fault diagnosis

Journal of Manufacturing Systems ◽

10.1016/j.jmsy.2021.03.024 ◽

2021 ◽

Vol 59 ◽

pp. 565-576

Author(s):

Bo Zhao ◽

Xianmin Zhang ◽

Zhenhui Zhan ◽

Qiqiang Wu

Keyword(s):

Fault Diagnosis ◽

Domain Adaptation ◽

Intelligent Fault Diagnosis ◽

Multi Scale ◽

Adversarial Network ◽

Adaptation Method

Download Full-text

A new multi-scale backbone network for object detection based on asymmetric convolutions

Science Progress ◽

10.1177/00368504211011343 ◽

2021 ◽

Vol 104 (2) ◽

pp. 003685042110113

Author(s):

Xianghua Ma ◽

Zhenkun Yang

Keyword(s):

Object Detection ◽

Image Features ◽

Detection Accuracy ◽

Mobile Platforms ◽

Multi Scale ◽

Backbone Network ◽

Aspect Ratios ◽

Pascal Voc ◽

Scale Characteristics ◽

Detection Speed

Real-time object detection on mobile platforms is a crucial but challenging computer vision task. However, it is widely recognized that although the lightweight object detectors have a high detection speed, the detection accuracy is relatively low. In order to improve detecting accuracy, it is beneficial to extract complete multi-scale image features in visual cognitive tasks. Asymmetric convolutions have a useful quality, that is, they have different aspect ratios, which can be used to exact image features of objects, especially objects with multi-scale characteristics. In this paper, we exploit three different asymmetric convolutions in parallel and propose a new multi-scale asymmetric convolution unit, namely MAC block to enhance multi-scale representation ability of CNNs. In addition, MAC block can adaptively merge the features with different scales by allocating learnable weighted parameters to three different asymmetric convolution branches. The proposed MAC blocks can be inserted into the state-of-the-art backbone such as ResNet-50 to form a new multi-scale backbone network of object detectors. To evaluate the performance of MAC block, we conduct experiments on CIFAR-100, PASCAL VOC 2007, PASCAL VOC 2012 and MS COCO 2014 datasets. Experimental results show that the detection precision can be greatly improved while a fast detection speed is guaranteed as well.

Download Full-text

Multi-scale object detection for high-speed railway clearance intrusion

Applied Intelligence ◽

10.1007/s10489-021-02534-9 ◽

2021 ◽

Author(s):

Runliang Tian ◽

Hongmei Shi ◽

Baoqing Guo ◽

Liqiang Zhu

Keyword(s):

Object Detection ◽

High Speed ◽

High Speed Railway ◽

Multi Scale

Download Full-text

Augmenting Crop Detection for Precision Agriculture with Deep Visual Transfer Learning—A Case Study of Bale Detection

Remote Sensing ◽

10.3390/rs13010023 ◽

2020 ◽

Vol 13 (1) ◽

pp. 23

Author(s):

Wei Zhao ◽

William Yamada ◽

Tianxin Li ◽

Matthew Digman ◽

Troy Runge

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Precision Agriculture ◽

Crop Production ◽

Domain Adaptation ◽

Training Data ◽

Detection Accuracy ◽

Detection Model ◽

Agriculture Products

In recent years, precision agriculture has been researched to increase crop production with less inputs, as a promising means to meet the growing demand of agriculture products. Computer vision-based crop detection with unmanned aerial vehicle (UAV)-acquired images is a critical tool for precision agriculture. However, object detection using deep learning algorithms rely on a significant amount of manually prelabeled training datasets as ground truths. Field object detection, such as bales, is especially difficult because of (1) long-period image acquisitions under different illumination conditions and seasons; (2) limited existing prelabeled data; and (3) few pretrained models and research as references. This work increases the bale detection accuracy based on limited data collection and labeling, by building an innovative algorithms pipeline. First, an object detection model is trained using 243 images captured with good illimitation conditions in fall from the crop lands. In addition, domain adaptation (DA), a kind of transfer learning, is applied for synthesizing the training data under diverse environmental conditions with automatic labels. Finally, the object detection model is optimized with the synthesized datasets. The case study shows the proposed method improves the bale detecting performance, including the recall, mean average precision (mAP), and F measure (F1 score), from averages of 0.59, 0.7, and 0.7 (the object detection) to averages of 0.93, 0.94, and 0.89 (the object detection + DA), respectively. This approach could be easily scaled to many other crop field objects and will significantly contribute to precision agriculture.

Download Full-text