Sea-floor characterization related to object-detection performance of sonar systems: A case study

Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).

Download Full-text

Augmenting Crop Detection for Precision Agriculture with Deep Visual Transfer Learning—A Case Study of Bale Detection

Remote Sensing ◽

10.3390/rs13010023 ◽

2020 ◽

Vol 13 (1) ◽

pp. 23

Author(s):

Wei Zhao ◽

William Yamada ◽

Tianxin Li ◽

Matthew Digman ◽

Troy Runge

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Precision Agriculture ◽

Crop Production ◽

Domain Adaptation ◽

Training Data ◽

Detection Accuracy ◽

Detection Model ◽

Agriculture Products

In recent years, precision agriculture has been researched to increase crop production with less inputs, as a promising means to meet the growing demand of agriculture products. Computer vision-based crop detection with unmanned aerial vehicle (UAV)-acquired images is a critical tool for precision agriculture. However, object detection using deep learning algorithms rely on a significant amount of manually prelabeled training datasets as ground truths. Field object detection, such as bales, is especially difficult because of (1) long-period image acquisitions under different illumination conditions and seasons; (2) limited existing prelabeled data; and (3) few pretrained models and research as references. This work increases the bale detection accuracy based on limited data collection and labeling, by building an innovative algorithms pipeline. First, an object detection model is trained using 243 images captured with good illimitation conditions in fall from the crop lands. In addition, domain adaptation (DA), a kind of transfer learning, is applied for synthesizing the training data under diverse environmental conditions with automatic labels. Finally, the object detection model is optimized with the synthesized datasets. The case study shows the proposed method improves the bale detecting performance, including the recall, mean average precision (mAP), and F measure (F1 score), from averages of 0.59, 0.7, and 0.7 (the object detection) to averages of 0.93, 0.94, and 0.89 (the object detection + DA), respectively. This approach could be easily scaled to many other crop field objects and will significantly contribute to precision agriculture.

Download Full-text

A computer vision-based object detection and counting for COVID-19 protocol compliance: a case study of Jakarta

2020 International Conference on ICT for Smart Society (ICISS) ◽

10.1109/iciss50791.2020.9307594 ◽

2020 ◽

Author(s):

Muhammad Lanang Afkaar Ar ◽

Sulthan Muzakki Adytia S ◽

Yudhistira Nugraha ◽

Farizah Rizka R ◽

Andy Ernesto ◽

...

Keyword(s):

Computer Vision ◽

Object Detection ◽

Detection And Counting ◽

Protocol Compliance

Download Full-text

Small Object Detection in Remote Sensing Images with Residual Feature Aggregation-Based Super-Resolution and Object Detector Network

Remote Sensing ◽

10.3390/rs13091854 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1854

Author(s):

Syed Muhammad Arsalan Bashir ◽

Yi Wang

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Resolution Enhancement ◽

Super Resolution ◽

Detection Performance ◽

Image Resolution ◽

Small Object ◽

Remote Sensing Images ◽

Feature Aggregation ◽

Image Super Resolution

This paper deals with detecting small objects in remote sensing images from satellites or any aerial vehicle by utilizing the concept of image super-resolution for image resolution enhancement using a deep-learning-based detection method. This paper provides a rationale for image super-resolution for small objects by improving the current super-resolution (SR) framework by incorporating a cyclic generative adversarial network (GAN) and residual feature aggregation (RFA) to improve detection performance. The novelty of the method is threefold: first, a framework is proposed, independent of the final object detector used in research, i.e., YOLOv3 could be replaced with Faster R-CNN or any object detector to perform object detection; second, a residual feature aggregation network was used in the generator, which significantly improved the detection performance as the RFA network detected complex features; and third, the whole network was transformed into a cyclic GAN. The image super-resolution cyclic GAN with RFA and YOLO as the detection network is termed as SRCGAN-RFA-YOLO, which is compared with the detection accuracies of other methods. Rigorous experiments on both satellite images and aerial images (ISPRS Potsdam, VAID, and Draper Satellite Image Chronology datasets) were performed, and the results showed that the detection performance increased by using super-resolution methods for spatial resolution enhancement; for an IoU of 0.10, AP of 0.7867 was achieved for a scale factor of 16.

Download Full-text

A Note on Advantages of the Fuzzy Gabor Filter in Object and Text Detection

Symmetry ◽

10.3390/sym13040678 ◽

2021 ◽

Vol 13 (4) ◽

pp. 678

Author(s):

Vladimir Tadic ◽

Tatjana Loncar-Turukalo ◽

Akos Odry ◽

Zeljen Trpovski ◽

Attila Toth ◽

...

Keyword(s):

Object Detection ◽

Detection Method ◽

Gabor Filter ◽

Low Cost ◽

Fuzzy Optimization ◽

Detection Performance ◽

Text Detection ◽

License Plate ◽

2D Gabor Filter ◽

Fine Tune

This note presents a fuzzy optimization of Gabor filter-based object and text detection. The derivation of a 2D Gabor filter and the guidelines for the fuzzification of the filter parameters are described. The fuzzy Gabor filter proved to be a robust text an object detection method in low-quality input images as extensively evaluated in the problem of license plate localization. The extended set of examples confirmed that the fuzzy optimized Gabor filter with adequately fuzzified parameters detected the desired license plate texture components and highly improved the object detection when compared to the classic Gabor filter. The robustness of the proposed approach was further demonstrated on other images of various origin containing text and different textures, captured using low-cost or modest quality acquisition procedures. The possibility to fine tune the fuzzification procedure to better suit certain applications offers the potential to further boost detection performance.

Download Full-text

The effects of super-resolution on object detection performance in an aerial image

2020 7th NAFOSTED Conference on Information and Computer Science (NICS) ◽

10.1109/nics51282.2020.9335859 ◽

2020 ◽

Author(s):

Ngan T. Truong ◽

Nguyen D. Vo ◽

Khang Nguyen

Keyword(s):

Object Detection ◽

Super Resolution ◽

Detection Performance ◽

Aerial Image

Download Full-text

Comparing lesion detection performance for PET image reconstruction algorithms: A case study

1996 IEEE Nuclear Science Symposium. Conference Record ◽

10.1109/nssmic.1996.587921 ◽

2002 ◽

Cited By ~ 1

Author(s):

M.T. Chan ◽

R.M. Leahy ◽

E.U. Mumcuoglu ◽

S.R. Cherry ◽

J. Czernin ◽

...

Keyword(s):

Image Reconstruction ◽

Detection Performance ◽

Lesion Detection ◽

Reconstruction Algorithms ◽

Pet Image Reconstruction ◽

Image Reconstruction Algorithms

Download Full-text

Justifying the Importance of Color Cues in Object Detection: A Case Study on Pedestrian

The Era of Interactive Media ◽

10.1007/978-1-4614-3501-3_32 ◽

2012 ◽

pp. 387-397 ◽

Cited By ~ 3

Author(s):

Qingyuan Wang ◽

Junbiao Pang ◽

Lei Qin ◽

Shuqiang Jiang ◽

Qingming Huang

Keyword(s):

Object Detection ◽

Color Cues

Download Full-text

Object Detection Via Flexible Anchor Generation

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421550120 ◽

2021 ◽

Author(s):

Pengxin Ding ◽

Huan Zhou ◽

Jinxia Shang ◽

Xiang Zou ◽

Minghui Wang

Keyword(s):

Object Detection ◽

Detection Performance ◽

Experimental Results ◽

Object Size ◽

The Difference ◽

Simple Network

This paper designs a method that can generate anchors of various shapes for the object detection framework. This method has the characteristics of novelty and flexibility. Different from the previous anchors generated by a pre-defined manner, our anchors are generated dynamically by an anchor generator. Specially, the anchor generator is not fixed but learned from the hand-designed anchors, which means that our anchor generator is able to work well in various scenes. In the inference time, the weights of anchor generator are estimated by a simple network where the input is some hand-designed anchor. In addition, in order to make the difference between the number of positive and negative samples smaller, we use an adaptive IOU threshold related to the object size to solve this problem. At the same time, we proved that our proposed method is effective and conducted a lot of experiments on the COCO dataset. Experimental results show that after replacing the anchor generation method in the previous object detectors (such as SSD, mask RCNN, and Retinanet) with our proposed method, the detection performance of the model has been greatly improved compared to before the replacement, which proves our method is effective.

Download Full-text

CNN-based Object Detection on Low Precision Hardware: Racing Car Case Study

2019 IEEE Intelligent Vehicles Symposium (IV) ◽

10.1109/ivs.2019.8814001 ◽

2019 ◽

Cited By ~ 1

Author(s):

Nicolo De Rita ◽

Alessandro Aimar ◽

Tobi Delbruck

Keyword(s):

Object Detection

Download Full-text