An Embeddable Algorithm for Automatic Garbage Detection Based on Complex Marine Environment

With the continuous development of artificial intelligence, embedding object detection algorithms into autonomous underwater detectors for marine garbage cleanup has become an emerging application area. Considering the complexity of the marine environment and the low resolution of the images taken by underwater detectors, this paper proposes an improved algorithm based on Mask R-CNN, with the aim of achieving high accuracy marine garbage detection and instance segmentation. First, the idea of dilated convolution is introduced in the Feature Pyramid Network to enhance feature extraction ability for small objects. Secondly, the spatial-channel attention mechanism is used to make features learn adaptively. It can effectively focus attention on detection objects. Third, the re-scoring branch is added to improve the accuracy of instance segmentation by scoring the predicted masks based on the method of Generalized Intersection over Union. Finally, we train the proposed algorithm in this paper on the Transcan dataset, evaluating its effectiveness by various metrics and comparing it with existing algorithms. The experimental results show that compared to the baseline provided by the Transcan dataset, the algorithm in this paper improves the mAP indexes on the two tasks of garbage detection and instance segmentation by 9.6 and 5.0, respectively, which significantly improves the algorithm performance. Thus, it can be better applied in the marine environment and achieve high precision object detection and instance segmentation.

Download Full-text

Small object detection combining attention mechanism and a novel FPN

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211905 ◽

2021 ◽

pp. 1-13

Author(s):

Junying Chen ◽

Shipeng Liu ◽

Liang Zhao ◽

Dengfeng Chen ◽

Weihua Zhang

Keyword(s):

Object Detection ◽

Small Object ◽

Extraction Ability ◽

Feature Pyramid ◽

Face Datasets ◽

New Feature ◽

Small Object Detection ◽

Low Sensitivity ◽

Objects Detection ◽

Transmission Ability

Since small objects occupy less pixels in the image and are difficult to recognize. Small object detection has always been a research difficulty in the field of computer vision. Aiming at the problems of low sensitivity and poor detection performance of YOLOv3 for small objects. AFYOLO, which is more sensitive to small objects detection was proposed in this paper. Firstly, the DenseNet module is introduced into the low-level layers of backbone to enhance the transmission ability of objects information. At the same time, a new mechanism combining channel attention and spatial attention is introduced to improve the feature extraction ability of the backbone. Secondly, a new feature pyramid network (FPN) is proposed to better obtain the features of small objects. Finally, ablation studies on ImageNet classification task and MS-COCO object detection task verify the effectiveness of the proposed attention module and FPN. The results on Wider Face datasets show that the AP of the proposed method is 11.89%higher than that of YOLOv3 and 8.59%higher than that of YOLOv4. All of results show that AFYOLO has better ability for small object detection.

Download Full-text

Dynamic Object Detection Algorithm Based on Lightweight Shared Feature Pyramid

Remote Sensing ◽

10.3390/rs13224610 ◽

2021 ◽

Vol 13 (22) ◽

pp. 4610

Author(s):

Li Zhu ◽

Zihao Xie ◽

Jing Luo ◽

Yuhang Qi ◽

Liman Liu ◽

...

Keyword(s):

Object Detection ◽

Feature Fusion ◽

Computational Cost ◽

Detection Algorithm ◽

Dynamic Object ◽

Shared Feature ◽

Detection Model ◽

Detection Algorithms ◽

Adaptive Inference ◽

Feature Pyramid

Current object detection algorithms perform inference on all samples at a fixed computational cost in the inference stage, which wastes computing resources and is not flexible. To solve this problem, a dynamic object detection algorithm based on a lightweight shared feature pyramid is proposed, which performs adaptive inference according to computing resources and the difficulty of samples, greatly improving the efficiency of inference. Specifically, a lightweight shared feature pyramid network and lightweight detection head is proposed to reduce the amount of computation and parameters in the feature fusion part and detection head of the dynamic object detection model. On the PASCAL VOC dataset, under the two conditions of “anytime prediction” and “budgeted batch object detection”, the performance, computation amount and parameter amount are better than the dynamic object detection models constructed by networks such as ResNet, DenseNet and MSDNet.

Download Full-text

A High-precision Forest Fire Smoke Detection Approach Based on DRGNet to Remote Sensing Through Uavs

10.21203/rs.3.rs-960274/v1 ◽

2021 ◽

Author(s):

Jialei Zhan ◽

Yaowen Hu ◽

Guoxiong Zhou ◽

Yanfeng Wang ◽

Weiwei Cai ◽

...

Keyword(s):

Forest Fire ◽

High Accuracy ◽

Detection Methods ◽

Detection Accuracy ◽

Smoke Detection ◽

High Transparency ◽

Dilated Convolution ◽

Fire Smoke ◽

Feature Pyramid ◽

Global Optimal

Abstract The occurrence of forest fires can lead to ecological damage, property loss, and human casualties. Current forest fire smoke detection methods do not sufficiently consider the characteristics of smoke with high transparency and no clear edges and have low detection accuracy, which cannot meet the needs of complex aerial forest fire smoke detection tasks. In this paper, we propose Dual-ResNet50-vd with SoftPool based on a recursive feature pyramid with deconvolution and dilated convolution and global optimal nonmaximum suppression (DRGNet) for high-accuracy detection of forest fire smoke. First, the Dual-ResNet50-vd module is proposed to enhance the extraction of smoke features with high transparency and no clear edges, and SoftPool is used to retain more feature information of smoke. Then, a recursive feature pyramid with deconvolution and dilated convolution (RDDFPN) is proposed to fuse shallow visual features and deep semantic information in the channel dimension to improve the accuracy of long-range aerial smoke detection. Finally, global optimal nonmaximum suppression (GO-NMS) sets the objective function to globally optimize the selection of anchor frames to adapt to the aerial photography of multiple smoke locations in forest fire scenes. The experimental results show that the DRGNet parametric number on the UAV-IoT platform is as low as 53.48 M, mAP reaches 79.03%, mAP50 reaches 90.26%, mAP75 reaches 82.35%, FPS reaches 122.5, and GFLOPs reaches 55.78. Compared with other mainstream methods, it has the advantages of real-time detection and high accuracy.

Download Full-text

A Deep Detection Network Based on Interaction of Instance Segmentation and Object Detection for SAR Images

Remote Sensing ◽

10.3390/rs13132582 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2582

Author(s):

Zitong Wu ◽

Biao Hou ◽

Bo Ren ◽

Zhongle Ren ◽

Shuang Wang ◽

...

Keyword(s):

Object Detection ◽

Multiple Scales ◽

Interaction Space ◽

Sar Images ◽

Ship Detection ◽

Detection Algorithms ◽

Level Information ◽

Bounding Boxes ◽

The Relationship ◽

Instance Segmentation

Ship detection is a challenging task for synthetic aperture radar (SAR) images. Ships have arbitrary directionality and multiple scales in SAR images. Furthermore, there is a lot of clutter near the ships. Traditional detection algorithms are not robust to these situations and easily cause redundancy in the detection area. With the continuous improvement in resolution, the traditional algorithms cannot achieve high-precision ship detection in SAR images. An increasing number of deep learning algorithms have been applied to SAR ship detection. In this study, a new ship detection network, known as the instance segmentation assisted ship detection network (ISASDNet), is presented. ISASDNet is a two-stage detection network with two branches. A branch is called an object branch and can extract object-level information to obtain positioning bounding boxes and classification results. Another branch called the pixel branch can be utilized for instance segmentation. In the pixel branch, the designed global relational inference layer maps the features to interaction space to learn the relationship between ship and background. The global reasoning module (GRM) based on global relational inference layers can better extract the instance segmentation results of ships. A mask assisted ship detection module (MASDM) is behind the two branches. The MASDM can improve detection results by interacting with the outputs of the two branches. In addition, a strategy is designed to extract the mask of SAR ships, which enables ISASDNet to perform object detection training and instance segmentation training at the same time. Experiments carried out two different datasets demonstrated the superiority of ISASDNet over other networks.

Download Full-text

Multi Angle Rotation Object Detection for Remote Sensing Image Based on Modified Feature Pyramid Networks

International Journal of Remote Sensing ◽

10.1080/01431161.2021.1910371 ◽

2021 ◽

Vol 42 (14) ◽

pp. 5257-5280

Author(s):

Lianyu Cao ◽

Xiaolu Zhang ◽

Zhaoshun Wang ◽

Guangyu Ding

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Remote Sensing Image ◽

Angle Rotation ◽

Feature Pyramid

Download Full-text

A dual U-Net algorithm for automating feature extraction from satellite imagery

The Journal of Defense Modeling and Simulation Applications Methodology Technology ◽

10.1177/1548512920983549 ◽

2021 ◽

pp. 154851292098354

Author(s):

Samuel Humphries ◽

Trevor Parker ◽

Bryan Jonas ◽

Bryan Adams ◽

Nicholas J Clark

Keyword(s):

Neural Networks ◽

Object Detection ◽

Satellite Images ◽

Detection Algorithm ◽

Military Operations ◽

Detection Algorithms ◽

Us Military ◽

Standard Product ◽

Different Types ◽

Road Intersections

Quick identification of building and roads is critical for execution of tactical US military operations in an urban environment. To this end, a gridded, referenced, satellite images of an objective, often referred to as a gridded reference graphic or GRG, has become a standard product developed during intelligence preparation of the environment. At present, operational units identify key infrastructure by hand through the work of individual intelligence officers. Recent advances in Convolutional Neural Networks, however, allows for this process to be streamlined through the use of object detection algorithms. In this paper, we describe an object detection algorithm designed to quickly identify and label both buildings and road intersections present in an image. Our work leverages both the U-Net architecture as well the SpaceNet data corpus to produce an algorithm that accurately identifies a large breadth of buildings and different types of roads. In addition to predicting buildings and roads, our model numerically labels each building by means of a contour finding algorithm. Most importantly, the dual U-Net model is capable of predicting buildings and roads on a diverse set of test images and using these predictions to produce clean GRGs.

Download Full-text

Bi-directional skip connection feature pyramid network and Sub-pixel convolution for high-quality object detection

Neurocomputing ◽

10.1016/j.neucom.2021.01.021 ◽

2021 ◽

Author(s):

Shuqi Xiong ◽

Xiaohong Wu ◽

Honggang Chen ◽

Linbo Qing ◽

Tong Chen ◽

...

Keyword(s):

Object Detection ◽

High Quality ◽

Feature Pyramid

Download Full-text

Feature pyramid of bi-directional stepped concatenation for small object detection

Multimedia Tools and Applications ◽

10.1007/s11042-021-10718-1 ◽

2021 ◽

Author(s):

Qiyuan Zheng ◽

Ying Chen

Keyword(s):

Object Detection ◽

Small Object ◽

Feature Pyramid ◽

Small Object Detection

Download Full-text

Attention-Based Feature Pyramid Network for Object Detection

Proceedings of the 2019 8th International Conference on Computing and Pattern Recognition ◽

10.1145/3373509.3373529 ◽

2019 ◽

Author(s):

Ziyuan Liu ◽

Ping Gong ◽

Jing Wang

Keyword(s):

Object Detection ◽

Feature Pyramid

Download Full-text

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text