ADA: Adversarial Data Augmentation for Object Detection

Object detection in uncrewed aerial vehicle (UAV) images has been a longstanding challenge in the field of computer vision. Specifically, object detection in drone images is a complex task due to objects of various scales such as humans, buildings, water bodies, and hills. In this paper, we present an implementation of ensemble transfer learning to enhance the performance of the base models for multiscale object detection in drone imagery. Combined with a test-time augmentation pipeline, the algorithm combines different models and applies voting strategies to detect objects of various scales in UAV images. The data augmentation also presents a solution to the deficiency of drone image datasets. We experimented with two specific datasets in the open domain: the VisDrone dataset and the AU-AIR Dataset. Our approach is more practical and efficient due to the use of transfer learning and two-level voting strategy ensemble instead of training custom models on entire datasets. The experimentation shows significant improvement in the mAP for both VisDrone and AU-AIR datasets by employing the ensemble transfer learning method. Furthermore, the utilization of voting strategies further increases the 3reliability of the ensemble as the end-user can select and trace the effects of the mechanism for bounding box predictions.

Download Full-text

Data Augmentation Methods Applying Grayscale Images for Convolutional Neural Networks in Machine Vision

Applied Sciences ◽

10.3390/app11156721 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6721

Author(s):

Jinyeong Wang ◽

Sanghwan Lee

Keyword(s):

Neural Networks ◽

Machine Vision ◽

Object Detection ◽

Image Classification ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Image Data ◽

Manufacturing Productivity ◽

Smart Factories ◽

Grayscale Images

In increasing manufacturing productivity with automated surface inspection in smart factories, the demand for machine vision is rising. Recently, convolutional neural networks (CNNs) have demonstrated outstanding performance and solved many problems in the field of computer vision. With that, many machine vision systems adopt CNNs to surface defect inspection. In this study, we developed an effective data augmentation method for grayscale images in CNN-based machine vision with mono cameras. Our method can apply to grayscale industrial images, and we demonstrated outstanding performance in the image classification and the object detection tasks. The main contributions of this study are as follows: (1) We propose a data augmentation method that can be performed when training CNNs with industrial images taken with mono cameras. (2) We demonstrate that image classification or object detection performance is better when training with the industrial image data augmented by the proposed method. Through the proposed method, many machine-vision-related problems using mono cameras can be effectively solved by using CNNs.

Download Full-text

Deep Learning Based Active Monitoring for Anti-collision between Vessels and Bridges

IABSE Symposium, Guimarães 2019: Towards a Resilient Built Environment Risk and Asset Management ◽

10.2749/guimaraes.2019.0487 ◽

2019 ◽

Author(s):

Limu Chen ◽

Ye Xia ◽

Dexiong Pan ◽

Chengbin Wang

Keyword(s):

Decision Making ◽

Deep Learning ◽

Object Detection ◽

Large Scale ◽

Data Augmentation ◽

Information Support ◽

Single Shot ◽

Active Monitoring ◽

Detection Model ◽

Comparison Results

<p>Deep-learning based navigational object detection is discussed with respect to active monitoring system for anti-collision between vessel and bridge. Motion based object detection method widely used in existing anti-collision monitoring systems is incompetent in dealing with complicated and changeable waterway for its limitations in accuracy, robustness and efficiency. The video surveillance system proposed contains six modules, including image acquisition, detection, tracking, prediction, risk evaluation and decision-making, and the detection module is discussed in detail. A vessel-exclusive dataset with tons of image samples is established for neural network training and a SSD (Single Shot MultiBox Detector) based object detection model with both universality and pertinence is generated attributing to tactics of sample filtering, data augmentation and large-scale optimization, which make it capable of stable and intelligent vessel detection. Comparison results with conventional methods indicate that the proposed deep-learning method shows remarkable advantages in robustness, accuracy, efficiency and intelligence. In-situ test is carried out at Songpu Bridge in Shanghai, and the results illustrate that the method is qualified for long-term monitoring and providing information support for further analysis and decision making.</p>

Download Full-text

Light-Weight Mixed Stage Partial Network for Surveillance Object Detection with Background Data Augmentation

10.1109/icip42928.2021.9506212 ◽

2021 ◽

Author(s):

Chen Ping-Yang ◽

Jun-Wei Hsieh ◽

Munkhjargal Gochoo ◽

Yong-Sheng Chen

Keyword(s):

Object Detection ◽

Data Augmentation ◽

Light Weight ◽

Background Data ◽

Mixed Stage

Download Full-text

Multiscale Object Detection in Infrared Streetscape Images Based on Deep Learning and Instance Level Data Augmentation

Applied Sciences ◽

10.3390/app9030565 ◽

2019 ◽

Vol 9 (3) ◽

pp. 565 ◽

Cited By ~ 6

Author(s):

Hao Qu ◽

Lilian Zhang ◽

Xuesong Wu ◽

Xiaofeng He ◽

Xiaoping Hu ◽

...

Keyword(s):

Object Detection ◽

Data Augmentation ◽

Region Of Interest ◽

Complex Environments ◽

Feature Maps ◽

Multi Scale ◽

Level Data ◽

Training Stage ◽

Street Scene ◽

Layer Region

The development of object detection in infrared images has attracted more attention in recent years. However, there are few studies on multi-scale object detection in infrared street scene images. Additionally, the lack of high-quality infrared datasets hinders research into such algorithms. In order to solve these issues, we firstly make a series of modifications based on Faster Region-Convolutional Neural Network (R-CNN). In this paper, a double-layer region proposal network (RPN) is proposed to predict proposals of different scales on both fine and coarse feature maps. Secondly, a multi-scale pooling module is introduced into the backbone of the network to explore the response of objects on different scales. Furthermore, the inception4 module and the position sensitive region of interest (ROI) align (PSalign) pooling layer are utilized to explore richer features of the objects. Thirdly, this paper proposes instance level data augmentation, which takes into account the imbalance between categories while enlarging dataset. In the training stage, the online hard example mining method is utilized to further improve the robustness of the algorithm in complex environments. The experimental results show that, compared with baseline, our detection method has state-of-the-art performance.

Download Full-text

IDA: Improved Data Augmentation Applied to Salient Object Detection

2020 33rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) ◽

10.1109/sibgrapi51738.2020.00036 ◽

2020 ◽

Author(s):

Daniel V. Ruiz ◽

Bruno A. Krinski ◽

Eduardo Todt

Keyword(s):

Object Detection ◽

Data Augmentation ◽

Salient Object Detection ◽

Salient Object

Download Full-text

One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation

Applied Sciences ◽

10.3390/app10010013 ◽

2019 ◽

Vol 10 (1) ◽

pp. 13 ◽

Cited By ~ 2

Author(s):

Shichao Zhang ◽

Zhe Zhang ◽

Libo Sun ◽

Wenhu Qin

Keyword(s):

Object Detection ◽

Data Augmentation ◽

Semantic Segmentation ◽

Detection Task ◽

Training Set ◽

Segmentation Accuracy ◽

Enhancement Method ◽

Road Segmentation ◽

Accuracy Performance

Generally, most approaches using methods such as cropping, rotating, and flipping achieve more data to train models for improving the accuracy of detection and segmentation. However, due to the difficulties of labeling such data especially semantic segmentation data, those traditional data augmentation methodologies cannot help a lot when the training set is really limited. In this paper, a model named OFA-Net (One For All Network) is proposed to combine object detection and semantic segmentation tasks. Meanwhile, using a strategy called “1-N Alternation” to train the OFA-Net model, which can make a fusion of features from detection and segmentation data. The results show that object detection data can be recruited to better the segmentation accuracy performance, and furthermore, segmentation data assist a lot to enhance the confidence of predictions for object detection. Finally, the OFA-Net model is trained without traditional data augmentation methodologies and tested on the KITTI test server. The model works well on the KITTI Road Segmentation challenge and can do a good job on the object detection task.

Download Full-text

Object Detection in X-ray Images Using Transfer Learning with Data Augmentation

International Journal on Advanced Science Engineering and Information Technology ◽

10.18517/ijaseit.9.6.9960 ◽

2019 ◽

Vol 9 (6) ◽

pp. 2147

Author(s):

Reagan L. Galvez ◽

Elmer P. Dadios ◽

Argel A. Bandala ◽

Ryan Rhay P. Vicerra

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Data Augmentation ◽

X Ray

Download Full-text

Autonomous Incident Detection on Spectrometers Using Deep Convolutional Models

Sensors ◽

10.3390/s22010160 ◽

2021 ◽

Vol 22 (1) ◽

pp. 160

Author(s):

Xuelin Zhang ◽

Donghao Zhang ◽

Alexander Leye ◽

Adrian Scott ◽

Luke Visser ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Real Time ◽

Data Augmentation ◽

Expert Knowledge ◽

Poor Quality ◽

Sample Introduction ◽

Spray Chamber ◽

Chemistry Industry ◽

Real Time System

This paper focuses on improving the performance of scientific instrumentation that uses glass spray chambers for sample introduction, such as spectrometers, which are widely used in analytical chemistry, by detecting incidents using deep convolutional models. The performance of these instruments can be affected by the quality of the introduction of the sample into the spray chamber. Among the indicators of poor quality sample introduction are two primary incidents: The formation of liquid beads on the surface of the spray chamber, and flooding at the bottom of the spray chamber. Detecting such events autonomously as they occur can assist with improving the overall operational accuracy and efficacy of the chemical analysis, and avoid severe incidents such as malfunction and instrument damage. In contrast to objects commonly seen in the real world, beading and flooding detection are more challenging since they are of significantly small size and transparent. Furthermore, the non-rigid property increases the difficulty of the detection of these incidents, as such that existing deep-learning-based object detection frameworks are prone to fail for this task. There is no former work that uses computer vision to detect these incidents in the chemistry industry. In this work, we propose two frameworks for the detection task of these two incidents, which not only leverage the modern deep learning architectures but also integrate with expert knowledge of the problems. Specifically, the proposed networks first localize the regions of interest where the incidents are most likely generated and then refine these incident outputs. The use of data augmentation and synthesis, and choice of negative sampling in training, allows for a large increase in accuracy while remaining a real-time system for inference. In the data collected from our laboratory, our method surpasses widely used object detection baselines and can correctly detect 95% of the beads and 98% of the flooding. At the same time, out method can process four frames per second and is able to be implemented in real time.

Download Full-text

Image Synthesisation and Data Augmentation for Safe Object Detection in Aircraft Auto-landing System

Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications ◽

10.5220/0010248801230135 ◽

2021 ◽

Author(s):

Najda Vidimlic ◽

Alexandra Levin ◽

Mohammad Loni ◽

Masoud Daneshtalab

Keyword(s):

Object Detection ◽

Data Augmentation

Download Full-text