Small Object Detection Based on Deep Learning

Small object detection is an interesting topic in computer vision. With the rapid development in deep learning, it has drawn attention of several researchers with innovations in approaches to join a race. These innovations proposed comprise region proposals, divided grid cell, multiscale feature maps, and new loss function. As a result, performance of object detection has recently had significant improvements. However, most of the state-of-the-art detectors, both in one-stage and two-stage approaches, have struggled with detecting small objects. In this study, we evaluate current state-of-the-art models based on deep learning in both approaches such as Fast RCNN, Faster RCNN, RetinaNet, and YOLOv3. We provide a profound assessment of the advantages and limitations of models. Specifically, we run models with different backbones on different datasets with multiscale objects to find out what types of objects are suitable for each model along with backbones. Extensive empirical evaluation was conducted on 2 standard datasets, namely, a small object dataset and a filtered dataset from PASCAL VOC 2007. Finally, comparative results and analyses are then presented.

Download Full-text

Small Object Detection From Video and Classification Using Deep Learning

Advances in Systems, Control and Automations - Lecture Notes in Electrical Engineering ◽

10.1007/978-981-15-8685-9_10 ◽

2021 ◽

pp. 101-107

Author(s):

R. Arthi ◽

Jai Ahuja ◽

Sachin Kumar ◽

Pushpendra Thakur ◽

Tanay Sharma

Keyword(s):

Deep Learning ◽

Object Detection ◽

Small Object ◽

Small Object Detection

Download Full-text

Improved YOLOv3 with duplex FPN for object detection based on deep learning

International Journal of Electrical Engineering Education ◽

10.1177/0020720920983524 ◽

2021 ◽

pp. 002072092098352

Author(s):

Seokyong Shin ◽

Hyunho Han ◽

Sang Hun Lee

Keyword(s):

Deep Learning ◽

Object Detection ◽

Autonomous Vehicles ◽

Detection Accuracy ◽

Small Object ◽

Feature Maps ◽

Low Level ◽

Small Object Detection ◽

High Level ◽

Networks Structure

YOLOv3 is a deep learning-based real-time object detector and is mainly used in applications such as video surveillance and autonomous vehicles. In this paper, we proposed an improved YOLOv3 (You Only Look Once version 3) applied Duplex FPN, which enhanced large object detection by utilizing low-level feature information. The conventional YOLOv3 improved the small object detection performance by applying FPN (Feature Pyramid Networks) structure to YOLOv2. However, YOLOv3 with an FPN structure specialized in detecting small objects, so it is difficult to detect large objects. Therefore, this paper proposed an improved YOLOv3 applied Duplex FPN, which can utilize low-level location information in high-level feature maps instead of the existing FPN structure of YOLOv3. This improved the detection accuracy of large objects. Also, an extra detection layer was added to the top-level feature map to prevent failure of detection of parts of large objects. Further, dimension clusters of each detection layer were reassigned to learn quickly how to accurately detect objects. The proposed method was compared and analyzed in the PASCAL VOC dataset. The experimental results showed that the bounding box accuracy of large objects improved owing to the Duplex FPN and extra detection layer, and the proposed method succeeded in detecting large objects that the existing YOLOv3 did not.

Download Full-text

Deep learning for small object detection in images

10.32469/10355/79470 ◽

2020 ◽

Author(s):

◽

Yang Liu

Keyword(s):

Deep Learning ◽

Object Detection ◽

State Of The Art ◽

Aerial Imagery ◽

Small Object ◽

Learning Models ◽

Learning Methods ◽

Bounding Boxes ◽

Small Object Detection ◽

Instance Segmentation

[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT REQUEST OF AUTHOR.] With the rapid development of deep learning in computer vision, especially deep convolutional neural networks (CNNs), significant advances have been made in recent years on object recognition and detection in images. Highly accurate detection results have been achieved for large objects, whereas detection accuracy on small objects remains to be low. This dissertation focuses on investigating deep learning methods for small object detection in images and proposing new methods with improved performance. First, we conducted a comprehensive review of existing deep learning methods for small object detections, in which we summarized and categorized major techniques and models, identified major challenges, and listed some future research directions. Existing techniques were categorized into using contextual information, combining multiple feature maps, creating sufficient positive examples, and balancing foreground and background examples. Methods developed in four related areas, generic object detection, face detection, object detection in aerial imagery, and segmentation, were summarized and compared. In addition, the performances of several leading deep learning methods for small object detection, including YOLOv3, Faster R-CNN, and SSD, were evaluated based on three large benchmark image datasets of small objects. Experimental results showed that Faster R-CNN performed the best, while YOLOv3 was a close second. Furthermore, a new deep learning method, called Retina-context Net, was proposed and outperformed state-of-the art one-stage deep learning models, including SSD, YOLOv3 and RetinaNet, on the COCO and SUN benchmark datasets. Secondly, we created a new dataset for bird detection, called Little Birds in Aerial Imagery (LBAI), from real-life aerial imagery. LBAI contains birds with sizes ranging from 10 by 10 pixels to 40 by 40 pixels. We adapted and applied several state-of-the-art deep learning models to LBAI, including object detection models such as YOLOv2, SSH, and Tiny Face, and instance segmentation models such as U-Net and Mask R-CNN. Our empirical results illustrated the strength and weakness of these methods, showing that SSH performed the best for easy cases, whereas Tiny Face performed the best for hard cases with cluttered backgrounds. Among small instance segmentation methods, U-Net achieved slightly better performance than Mask R-CNN. Thirdly, we proposed a new graph neural network-based object detection algorithm, called GODM, to take the spatial information of candidate objects into consideration in small object detection. Instead of detecting small objects independently as the existing deep learning methods do, GODM treats the candidate bounding boxes generated by existing object detectors as nodes and creates edges based on the spatial or semantic relationship between the candidate bounding boxes. GODM contains four major components: node feature generation, graph generation, node class labelling, and graph convolutional neural network model. Several graph generation methods were proposed. Experimental results on the LBDA dataset show that GODM outperformed existing state-of-the-art object detector Faster R-CNN significantly, up to 12% better in accuracy. Finally, we proposed a new computer vision-based grass analysis using machine learning. To deal with the variation of lighting condition, a two-stage segmentation strategy is proposed for grass coverage computation based on a blackboard background. On a real world dataset we collected from natural environments, the proposed method was robust to varying environments, lighting, and colors. For grass detection and coverage computation, the error rate was just 3%.

Download Full-text

A survey and performance evaluation of deep learning methods for small object detection

Expert Systems with Applications ◽

10.1016/j.eswa.2021.114602 ◽

2021 ◽

Vol 172 ◽

pp. 114602 ◽

Cited By ~ 1

Author(s):

Yang Liu ◽

Peng Sun ◽

Nickolas Wergeles ◽

Yi Shang

Keyword(s):

Performance Evaluation ◽

Deep Learning ◽

Object Detection ◽

Small Object ◽

Learning Methods ◽

And Performance ◽

Small Object Detection

Download Full-text

Deep learning-based small object detection

Journal of the Institute of Electronics and Information Engineers ◽

10.5573/ieie.2018.55.7.57 ◽

2018 ◽

Vol 55 (7) ◽

pp. 57-66

Author(s):

Jun Ho Choi ◽

Tae Young Han ◽

Seung Hyun Lee ◽

Byung Cheol Song

Keyword(s):

Deep Learning ◽

Object Detection ◽

Small Object ◽

Small Object Detection

Download Full-text

Multi-View Object Detection Based on Deep Learning

Applied Sciences ◽

10.3390/app8091423 ◽

2018 ◽

Vol 8 (9) ◽

pp. 1423 ◽

Cited By ~ 15

Author(s):

Cong Tang ◽

Yongshun Ling ◽

Xing Yang ◽

Wei Jin ◽

Chao Zheng

Keyword(s):

Deep Learning ◽

Object Detection ◽

Regression Models ◽

Detection Methods ◽

Detection Accuracy ◽

Single Shot ◽

Small Object ◽

Object Retrieval ◽

Detection Approach ◽

Small Object Detection

A multi-view object detection approach based on deep learning is proposed in this paper. Classical object detection methods based on regression models are introduced, and the reasons for their weak ability to detect small objects are analyzed. To improve the performance of these methods, a multi-view object detection approach is proposed, and the model structure and working principles of this approach are explained. Additionally, the object retrieval ability and object detection accuracy of both the multi-view methods and the corresponding classical methods are evaluated and compared based on a test on a small object dataset. The experimental results show that in terms of object retrieval capability, Multi-view YOLO (You Only Look Once: Unified, Real-Time Object Detection), Multi-view YOLOv2 (based on an updated version of YOLO), and Multi-view SSD (Single Shot Multibox Detector) achieve AF (average F-measure) scores that are higher than those of their classical counterparts by 0.177, 0.06, and 0.169, respectively. Moreover, in terms of the detection accuracy, when difficult objects are not included, the mAP (mean average precision) scores of the multi-view methods are higher than those of the classical methods by 14.3%, 7.4%, and 13.1%, respectively. Thus, the validity of the approach proposed in this paper has been verified. In addition, compared with state-of-the-art methods based on region proposals, multi-view detection methods are faster while achieving mAPs that are approximately the same in small object detection.

Download Full-text

Small Object Detection Based on Deep Learning

Small Object Detection of Table Tennis Based on Deep Learning Network

A Survey of Small Object Detection Based on Deep Learning

A novel two-stage deep learning-based small-object detection using hyperspectral images

An Evaluation of Deep Learning Methods for Small Object Detection

Small Object Detection From Video and Classification Using Deep Learning

Improved YOLOv3 with duplex FPN for object detection based on deep learning

Deep learning for small object detection in images

A survey and performance evaluation of deep learning methods for small object detection

Deep learning-based small object detection

Multi-View Object Detection Based on Deep Learning

Export Citation Format