Attentional single-shot network with multi-scale feature fusion for object detection in aerial images

With the rapid advances in remote-sensing technologies and the larger number of satellite images, fast and effective object detection plays an important role in understanding and analyzing image information, which could be further applied to civilian and military fields. Recently object detection methods with region-based convolutional neural network have shown excellent performance. However, these two-stage methods contain region proposal generation and object detection procedures, resulting in low computation speed. Because of the expensive manual costs, the quantity of well-annotated aerial images is scarce, which also limits the progress of geospatial object detection in remote sensing. In this paper, on the one hand, we construct and release a large-scale remote-sensing dataset for geospatial object detection (RSD-GOD) that consists of 5 different categories with 18,187 annotated images and 40,990 instances. On the other hand, we design a single shot detection framework with multi-scale feature fusion. The feature maps from different layers are fused together through the up-sampling and concatenation blocks to predict the detection results. High-level features with semantic information and low-level features with fine details are fully explored for detection tasks, especially for small objects. Meanwhile, a soft non-maximum suppression strategy is put into practice to select the final detection results. Extensive experiments have been conducted on two datasets to evaluate the designed network. Results show that the proposed approach achieves a good detection performance and obtains the mean average precision value of 89.0% on a newly constructed RSD-GOD dataset and 83.8% on the Northwestern Polytechnical University very high spatial resolution-10 (NWPU VHR-10) dataset at 18 frames per second (FPS) on a NVIDIA GTX-1080Ti GPU.

Download Full-text

MTI-YOLO: A Light-Weight and Real-Time Deep Neural Network for Insulator Detection in Complex Aerial Images

Energies ◽

10.3390/en14051426 ◽

2021 ◽

Vol 14 (5) ◽

pp. 1426

Author(s):

Chuanyang Liu ◽

Yiquan Wu ◽

Jingjing Liu ◽

Jiaming Han

Keyword(s):

Feature Detection ◽

Feature Fusion ◽

Memory Storage ◽

Aerial Images ◽

Detection Accuracy ◽

Composite Insulator ◽

Running Time ◽

Scale Feature ◽

Multi Scale ◽

Good Trade

Insulator detection is an essential task for the safety and reliable operation of intelligent grids. Owing to insulator images including various background interferences, most traditional image-processing methods cannot achieve good performance. Some You Only Look Once (YOLO) networks are employed to meet the requirements of actual applications for insulator detection. To achieve a good trade-off among accuracy, running time, and memory storage, this work proposes the modified YOLO-tiny for insulator (MTI-YOLO) network for insulator detection in complex aerial images. First of all, composite insulator images are collected in common scenes and the “CCIN_detection” (Chinese Composite INsulator) dataset is constructed. Secondly, to improve the detection accuracy of different sizes of insulator, multi-scale feature detection headers, a structure of multi-scale feature fusion, and the spatial pyramid pooling (SPP) model are adopted to the MTI-YOLO network. Finally, the proposed MTI-YOLO network and the compared networks are trained and tested on the “CCIN_detection” dataset. The average precision (AP) of our proposed network is 17% and 9% higher than YOLO-tiny and YOLO-v2. Compared with YOLO-tiny and YOLO-v2, the running time of the proposed network is slightly higher. Furthermore, the memory usage of the proposed network is 25.6% and 38.9% lower than YOLO-v2 and YOLO-v3, respectively. Experimental results and analysis validate that the proposed network achieves good performance in both complex backgrounds and bright illumination conditions.

Download Full-text

Multi-scale Feature Fusion Single Shot Object Detector Based on DenseNet

Intelligent Robotics and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-27541-9_37 ◽

2019 ◽

pp. 450-460 ◽

Cited By ~ 1

Author(s):

Minghao Zhai ◽

Junchen Liu ◽

Wei Zhang ◽

Chen Liu ◽

Wei Li ◽

...

Keyword(s):

Feature Fusion ◽

Single Shot ◽

Scale Feature ◽

Multi Scale

Download Full-text

Global and Local Multi-scale Feature Fusion for Object Detection and Semantic Segmentation

2019 IEEE Intelligent Vehicles Symposium (IV) ◽

10.1109/ivs.2019.8813786 ◽

2019 ◽

Author(s):

Young-Chul Lim ◽

Minsung Kang

Keyword(s):

Object Detection ◽

Feature Fusion ◽

Semantic Segmentation ◽

Scale Feature ◽

Multi Scale ◽

Global And Local

Download Full-text

A Novel Multi-Scale Feature Fusion Method for Region Proposal Network in Fast Object Detection

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2020070107 ◽

2020 ◽

Vol 16 (3) ◽

pp. 132-145

Author(s):

Gang Liu ◽

Chuyi Wang

Keyword(s):

Object Detection ◽

Multiple Scales ◽

Feature Fusion ◽

Uniform Space ◽

Fusion Method ◽

Well Performance ◽

Feature Maps ◽

Neural Network Models ◽

Scale Feature ◽

Multi Scale

Neural network models have been widely used in the field of object detecting. The region proposal methods are widely used in the current object detection networks and have achieved well performance. The common region proposal methods hunt the objects by generating thousands of the candidate boxes. Compared to other region proposal methods, the region proposal network (RPN) method improves the accuracy and detection speed with several hundred candidate boxes. However, since the feature maps contains insufficient information, the ability of RPN to detect and locate small-sized objects is poor. A novel multi-scale feature fusion method for region proposal network to solve the above problems is proposed in this article. The proposed method is called multi-scale region proposal network (MS-RPN) which can generate suitable feature maps for the region proposal network. In MS-RPN, the selected feature maps at multiple scales are fine turned respectively and compressed into a uniform space. The generated fusion feature maps are called refined fusion features (RFFs). RFFs incorporate abundant detail information and context information. And RFFs are sent to RPN to generate better region proposals. The proposed approach is evaluated on PASCAL VOC 2007 and MS COCO benchmark tasks. MS-RPN obtains significant improvements over the comparable state-of-the-art detection models.

Download Full-text

Multi Scale Object Detection Based on Single Shot Multibox Detector with Feature Fusion and Inception Network

The Journal of Korean Institute of Information Technology ◽

10.14801/jkiit.2018.16.10.93 ◽

2018 ◽

Vol 16 (10) ◽

pp. 93-100 ◽

Cited By ~ 1

Author(s):

Md Foysal Haque ◽

Dae-Seong Kang

Keyword(s):

Object Detection ◽

Feature Fusion ◽

Single Shot ◽

Multi Scale

Download Full-text

BMF-CNN: an object detection method based on multi-scale feature fusion in VHR remote sensing images

Remote Sensing Letters ◽

10.1080/2150704x.2019.1706007 ◽

2019 ◽

Vol 11 (3) ◽

pp. 215-224

Author(s):

Zhong Dong ◽

Baojun Lin

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Detection Method ◽

Feature Fusion ◽

Remote Sensing Images ◽

Scale Feature ◽

Multi Scale

Download Full-text

Multi-Scale Bidirectional Feature Fusion for One-Stage Oriented Object Detection in Aerial Images

10.1109/igarss47720.2021.9555142 ◽

2021 ◽

Author(s):

Lei Pei ◽

Gong Cheng ◽

Xuxiang Sun ◽

Qingyang Li ◽

Meili Zhang ◽

...

Keyword(s):

Object Detection ◽

Feature Fusion ◽

Aerial Images ◽

Multi Scale ◽

One Stage ◽

Oriented Object

Download Full-text

Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection

Sensors ◽

10.3390/s20154276 ◽

2020 ◽

Vol 20 (15) ◽

pp. 4276 ◽

Cited By ~ 2

Author(s):

Danqing Xu ◽

Yiquan Wu

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Target Detection ◽

Poor Performance ◽

Aerial Images ◽

Single Shot ◽

Multi Scale ◽

Dense Distribution ◽

Different Dimensions ◽

Small Targets

Remote sensing targets have different dimensions, and they have the characteristics of dense distribution and a complex background. This makes remote sensing target detection difficult. With the aim at detecting remote sensing targets at different scales, a new You Only Look Once (YOLO)-V3-based model was proposed. YOLO-V3 is a new version of YOLO. Aiming at the defect of poor performance of YOLO-V3 in detecting remote sensing targets, we adopted DenseNet (Densely Connected Network) to enhance feature extraction capability. Moreover, the detection scales were increased to four based on the original YOLO-V3. The experiment on RSOD (Remote Sensing Object Detection) dataset and UCS-AOD (Dataset of Object Detection in Aerial Images) dataset showed that our approach performed better than Faster-RCNN, SSD (Single Shot Multibox Detector), YOLO-V3, and YOLO-V3 tiny in terms of accuracy. Compared with original YOLO-V3, the mAP (mean Average Precision) of our approach increased from 77.10% to 88.73% in the RSOD dataset. In particular, the mAP of detecting targets like aircrafts, which are mainly made up of small targets increased by 12.12%. In addition, the detection speed was not significantly reduced. Generally speaking, our approach achieved higher accuracy and gave considerations to real-time performance simultaneously for remote sensing target detection.

Download Full-text

Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV

Acta Optica Sinica ◽

10.3788/aos202040.1015002 ◽

2020 ◽

Vol 40 (10) ◽

pp. 1015002

Author(s):

刘芳 Liu Fang ◽

吴志威 Wu Zhiwei ◽

杨安喆 Yang Anzhe ◽

韩笑 Han Xiao

Keyword(s):

Object Detection ◽

Feature Fusion ◽

Scale Feature ◽

Multi Scale

Download Full-text