Multi-scale Object Detection Algorithm Based on Faster R-CNN

As a precursor step for computer vision algorithms, object detection plays an important role in various practical application scenarios. With the objects to be detected becoming more complex, the problem of multi-scale object detection has attracted more and more attention, especially in the field of remote sensing detection. Early convolutional neural network detection algorithms are mostly based on artificially preset anchor-boxes to divide different regions in the image, and then obtain the prior position of the target. However, the anchor box is difficult to set reasonably and will cause a large amount of computational redundancy, which affects the generality of the detection model obtained under fixed parameters. In the past two years, anchor-free detection algorithm has achieved remarkable development in the field of detection on natural image. However, there is no sufficient research on how to deal with multi-scale detection more effectively in anchor-free framework and use these detectors on remote sensing images. In this paper, we propose a specific-attention Feature Pyramid Network (FPN) module, which is able to generate a feature pyramid, basing on the characteristics of objects with various sizes. In addition, this pyramid suits multi-scale object detection better. Besides, a scale-aware detection head is proposed which contains a multi-receptive feature fusion module and a size-based feature compensation module. The new anchor-free detector can obtain a more effective multi-scale feature expression. Experiments on challenging datasets show that our approach performs favorably against other methods in terms of the multi-scale object detection performance.

Download Full-text

DC-YOLOv3: A novel efficient object detection algorithm

Journal of Physics Conference Series ◽

10.1088/1742-6596/2082/1/012012 ◽

2021 ◽

Vol 2082 (1) ◽

pp. 012012

Author(s):

Xu Zhang ◽

Fang Han ◽

Ping Wang ◽

Wei Jiang ◽

Chen Wang

Keyword(s):

Object Detection ◽

Real Time ◽

Real World ◽

Network Architecture ◽

Detection Algorithm ◽

Training Time ◽

Feature Representations ◽

Multi Scale ◽

Time Performance ◽

Real World Applications

Abstract Feature pyramids have become an essential component in most modern object detectors, such as Mask RCNN, YOLOv3, RetinaNet. In these detectors, the pyramidal feature representations are commonly used which represent an image with multi-scale feature layers. However, the detectors can’t be used in many real world applications which require real time performance under a computationally limited circumstance. In the paper, we study network architecture in YOLOv3 and modify the classical backbone--darknet53 of YOLOv3 by using a group of convolutions and dilated convolutions (DC). Then, a novel one-stage object detection network framework called DC-YOLOv3 is proposed. A lot of experiments on the Pascal 2017 benchmark prove the effectiveness of our framework. The results illustrate that DC-YOLOv3 achieves comparable results with YOLOv3 while being about 1.32× faster in training time and 1.38× faster in inference time.

Download Full-text

Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images

Sensors ◽

10.3390/s20185080 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5080

Author(s):

Baohua Qiang ◽

Ruidong Chen ◽

Mingliang Zhou ◽

Yuanchao Pang ◽

Yijie Zhai ◽

...

Keyword(s):

Object Detection ◽

Real Time ◽

Image Data ◽

Semantic Segmentation ◽

Image Understanding ◽

Detection Algorithm ◽

Vital Role ◽

Multi Scale ◽

Segmentation Task ◽

High Level

In recent years, increasing image data comes from various sensors, and object detection plays a vital role in image understanding. For object detection in complex scenes, more detailed information in the image should be obtained to improve the accuracy of detection task. In this paper, we propose an object detection algorithm by jointing semantic segmentation (SSOD) for images. First, we construct a feature extraction network that integrates the hourglass structure network with the attention mechanism layer to extract and fuse multi-scale features to generate high-level features with rich semantic information. Second, the semantic segmentation task is used as an auxiliary task to allow the algorithm to perform multi-task learning. Finally, multi-scale features are used to predict the location and category of the object. The experimental results show that our algorithm substantially enhances object detection performance and consistently outperforms other three comparison algorithms, and the detection speed can reach real-time, which can be used for real-time detection.

Download Full-text

Foreign Object Detection Algorithm Based on Multi-scale Convolutional Network

Journal of Physics Conference Series ◽

10.1088/1742-6596/1952/2/022017 ◽

2021 ◽

Vol 1952 (2) ◽

pp. 022017

Author(s):

Jiexin Zheng ◽

Ye Chen ◽

Hong Zhang ◽

Dan Liu

Keyword(s):

Object Detection ◽

Detection Algorithm ◽

Foreign Object ◽

Convolutional Network ◽

Multi Scale

Download Full-text

Ship Detection from Optical Remote Sensing Images Using Multi-Scale Analysis and Fourier HOG Descriptor

Remote Sensing ◽

10.3390/rs11131529 ◽

2019 ◽

Vol 11 (13) ◽

pp. 1529 ◽

Cited By ~ 6

Author(s):

Chao Dong ◽

Jinghong Liu ◽

Fang Xu ◽

Chenglong Liu

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Detection System ◽

Detection Algorithm ◽

Optical Remote Sensing ◽

Scale Analysis ◽

Ship Detection ◽

Multi Scale ◽

Saliency Maps ◽

Multi Scale Analysis

Automatic ship detection by Unmanned Airborne Vehicles (UAVs) and satellites is one of the fundamental challenges in maritime research due to the variable appearances of ships and complex sea backgrounds. To address this issue, in this paper, a novel multi-level ship detection algorithm is proposed to detect various types of offshore ships more precisely and quickly under all possible imaging variations. Our object detection system consists of two phases. First, in the category-independent region proposal phase, the steerable pyramid for multi-scale analysis is performed to generate a set of saliency maps in which the candidate region pixels are assigned to high salient values. Then, the set of saliency maps is used for constructing the graph-based segmentation, which can produce more accurate candidate regions compared with the threshold segmentation. More importantly, the proposed algorithm can produce a rather smaller set of candidates in comparison with the classical sliding window object detection paradigm or the other region proposal algorithms. Second, in the target identification phase, a rotation-invariant descriptor, which combines the histogram of oriented gradients (HOG) cells and the Fourier basis together, is investigated to distinguish between ships and non-ships. Meanwhile, the main direction of the ship can also be estimated in this phase. The overall algorithm can account for large variations in scale and rotation. Experiments on optical remote sensing (ORS) images demonstrate the effectiveness and robustness of our detection system.

Download Full-text

LOROD: Fully Convolutional Network for Realtime Multi-scale Object Detection Algorithm

10.1109/swc50871.2021.00086 ◽

2021 ◽

Author(s):

Shaoqi Hou ◽

Chao Li ◽

Xueting Liu ◽

Yuhao Zeng ◽

Wenyi Du ◽

...

Keyword(s):

Object Detection ◽

Detection Algorithm ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Multi Scale

Download Full-text

Improved AdaBoost Based Infrared Object Detection Algorithm

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.490-491.1739 ◽

2014 ◽

Vol 490-491 ◽

pp. 1739-1744

Author(s):

De Qin Shi ◽

Wei Yang ◽

Shu Ping Wang ◽

Qin Ying Lin

Keyword(s):

Object Detection ◽

Detection Algorithm ◽

Experimental Results ◽

Infrared Images ◽

Sliding Windows ◽

Image Gradient ◽

Multi Scale ◽

Perfect Performance ◽

Haar Features ◽

Visible Images

The traditional AdaBoost based object detection algorithm can get perfect performance for visible images, but not for infrared images because of the worse description of Haar features. In this paper, a novel infrared object detection algorithm based on improved AdaBoost with powerful histogram features is proposed. Firstly, HOG histogram with fourth grid area of image gradient is used for the input of classifiers. And then, AdaBoost with weighted FLD as weak classifiers is employed to select the multi-dimension features. Finally, the infrared object detection is realized by multi-scale sliding windows and non-maxima suppression algorithm. The experimental results demonstrate the better performance of the improved AdaBoost detection algorithm which can detect multi-scale infrared targets accurately.

Download Full-text