A Survey on Deep Learning Based Methods and Datasets for Monocular 3D Object Detection

Seong-heum Kim; Youngbae Hwang

doi:10.3390/electronics10040517

A Survey on Deep Learning Based Methods and Datasets for Monocular 3D Object Detection

Electronics ◽

10.3390/electronics10040517 ◽

2021 ◽

Vol 10 (4) ◽

pp. 517

Author(s):

Seong-heum Kim ◽

Youngbae Hwang

Keyword(s):

Deep Learning ◽

Object Detection ◽

Low Cost ◽

Detection Methods ◽

Future Research ◽

3D Object ◽

Practical Applications ◽

Depth Sensors ◽

Significant Research ◽

3D Object Detection

Owing to recent advancements in deep learning methods and relevant databases, it is becoming increasingly easier to recognize 3D objects using only RGB images from single viewpoints. This study investigates the major breakthroughs and current progress in deep learning-based monocular 3D object detection. For relatively low-cost data acquisition systems without depth sensors or cameras at multiple viewpoints, we first consider existing databases with 2D RGB photos and their relevant attributes. Based on this simple sensor modality for practical applications, deep learning-based monocular 3D object detection methods that overcome significant research challenges are categorized and summarized. We present the key concepts and detailed descriptions of representative single-stage and multiple-stage detection solutions. In addition, we discuss the effectiveness of the detection models on their baseline benchmarks. Finally, we explore several directions for future research on monocular 3D object detection.

Download Full-text

A comprehensive survey of LIDAR-based 3D object detection methods with Deep learning for autonomous driving

Computers & Graphics ◽

10.1016/j.cag.2021.07.003 ◽

2021 ◽

Author(s):

Georgios Zamanakos ◽

Lazaros Tsochatzidis ◽

Angelos Amanatiadis ◽

Ioannis Pratikakis

Keyword(s):

Deep Learning ◽

Object Detection ◽

Autonomous Driving ◽

Detection Methods ◽

3D Object ◽

Comprehensive Survey ◽

3D Object Detection

Download Full-text

Deep Learning on 3D Object Detection for Automatic Plug-in Charging Using a Mobile Manipulator

10.1109/icra48506.2021.9561106 ◽

2021 ◽

Author(s):

Zhengxue Zhou ◽

Leihui Li ◽

Riwei Wang ◽

Xuping Zhang

Keyword(s):

Deep Learning ◽

Object Detection ◽

Mobile Manipulator ◽

3D Object ◽

3D Object Detection

Download Full-text

Optimization of PointPillars (A Deep Learning Network for LiDAR-based 3D Object Detection) on Intel Platform

10.1109/icpics52425.2021.9524176 ◽

2021 ◽

Author(s):

Shengxian Liu ◽

Qing Xu ◽

Hua Ma ◽

Jessica Du ◽

Ming Lei ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

3D Object ◽

Learning Network ◽

Deep Learning Network ◽

3D Object Detection

Download Full-text

A Novel Regional Fusion Network for 3D Object Detection based on RGB Images and Point Clouds

10.5121/csit.2021.111812 ◽

2021 ◽

Author(s):

Hung-Hao Chen ◽

Chia-Hung Wang ◽

Hsueh-Wei Chen ◽

Pei-Yung Hsiao ◽

Li-Chen Fu ◽

...

Keyword(s):

Object Detection ◽

Receptive Fields ◽

Point Clouds ◽

Detection Methods ◽

Lidar Data ◽

3D Object ◽

Multi Scale ◽

Interest Level ◽

Rgb Images ◽

3D Object Detection

The current fusion-based methods transform LiDAR data into bird’s eye view (BEV) representations or 3D voxel, leading to information loss and heavy computation cost of 3D convolution. In contrast, we directly consume raw point clouds and perform fusion between two modalities. We employ the concept of region proposal network to generate proposals from two streams, respectively. In order to make two sensors compensate the weakness of each other, we utilize the calibration parameters to project proposals from one stream onto the other. With the proposed multi-scale feature aggregation module, we are able to combine the extracted regionof-interest-level (RoI-level) features of RGB stream from different receptive fields, resulting in fertilizing feature richness. Experiments on KITTI dataset show that our proposed network outperforms other fusion-based methods with meaningful improvements as compared to 3D object detection methods under challenging setting.

Download Full-text

3D Object Detection and Tracking Methods using Deep Learning for Computer Vision Applications

10.1109/rteict52294.2021.9573964 ◽

2021 ◽

Author(s):

E Shreyas ◽

Manav Hiren Sheth ◽

Mohana

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Object Detection ◽

3D Object ◽

Object Detection And Tracking ◽

Detection And Tracking ◽

Computer Vision Applications ◽

3D Object Detection

Download Full-text

Deep Learning of Local RGB-D Patches for 3D Object Detection and 6D Pose Estimation

Computer Vision – ECCV 2016 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-46487-9_13 ◽

2016 ◽

pp. 205-220 ◽

Cited By ~ 65

Author(s):

Wadim Kehl ◽

Fausto Milletari ◽

Federico Tombari ◽

Slobodan Ilic ◽

Nassir Navab

Keyword(s):

Deep Learning ◽

Object Detection ◽

Pose Estimation ◽

3D Object ◽

3D Object Detection

Download Full-text

A Survey on 3D Object Detection Methods for Autonomous Driving Applications

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2019.2892405 ◽

2019 ◽

Vol 20 (10) ◽

pp. 3782-3795 ◽

Cited By ~ 38

Author(s):

Eduardo Arnold ◽

Omar Y. Al-Jarrah ◽

Mehrdad Dianati ◽

Saber Fallah ◽

David Oxtoby ◽

...

Keyword(s):

Object Detection ◽

Autonomous Driving ◽

Detection Methods ◽

3D Object ◽

3D Object Detection

Download Full-text

A Survey on Monocular 3D Object Detection Algorithms Based on Deep Learning

Journal of Physics Conference Series ◽

10.1088/1742-6596/1518/1/012049 ◽

2020 ◽

Vol 1518 ◽

pp. 012049

Author(s):

Junhui Wu ◽

Dong Yin ◽

Jie Chen ◽

Yusheng Wu ◽

Huiping Si ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

3D Object ◽

Detection Algorithms ◽

3D Object Detection

Download Full-text

Outdoor Mobile Mapping and AI-Based 3D Object Detection with Low-Cost RGB-D Cameras: The Use Case of On-Street Parking Statistics

Remote Sensing ◽

10.3390/rs13163099 ◽

2021 ◽

Vol 13 (16) ◽

pp. 3099

Author(s):

Stephan Nebiker ◽

Jonas Meyer ◽

Stefan Blaser ◽

Manuela Ammann ◽

Severin Rhyner

Keyword(s):

Object Detection ◽

Smart Cities ◽

Low Cost ◽

Point Clouds ◽

Mobile Mapping ◽

3D Object ◽

Depth Measurement ◽

Detection Algorithms ◽

3D Point Clouds ◽

3D Object Detection

A successful application of low-cost 3D cameras in combination with artificial intelligence (AI)-based 3D object detection algorithms to outdoor mobile mapping would offer great potential for numerous mapping, asset inventory, and change detection tasks in the context of smart cities. This paper presents a mobile mapping system mounted on an electric tricycle and a procedure for creating on-street parking statistics, which allow government agencies and policy makers to verify and adjust parking policies in different city districts. Our method combines georeferenced red-green-blue-depth (RGB-D) imagery from two low-cost 3D cameras with state-of-the-art 3D object detection algorithms for extracting and mapping parked vehicles. Our investigations demonstrate the suitability of the latest generation of low-cost 3D cameras for real-world outdoor applications with respect to supported ranges, depth measurement accuracy, and robustness under varying lighting conditions. In an evaluation of suitable algorithms for detecting vehicles in the noisy and often incomplete 3D point clouds from RGB-D cameras, the 3D object detection network PointRCNN, which extends region-based convolutional neural networks (R-CNNs) to 3D point clouds, clearly outperformed all other candidates. The results of a mapping mission with 313 parking spaces show that our method is capable of reliably detecting parked cars with a precision of 100% and a recall of 97%. It can be applied to unslotted and slotted parking and different parking types including parallel, perpendicular, and angle parking.

Download Full-text

Scale-Aware Attention-Based PillarsNet (SAPN) Based 3D Object Detection for Point Cloud

Mathematical Problems in Engineering ◽

10.1155/2020/3927365 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Xiang Song ◽

Weiqin Zhan ◽

Xiaoyu Che ◽

Huilin Jiang ◽

Biao Yang

Keyword(s):

Object Detection ◽

Autonomous Navigation ◽

Point Clouds ◽

Detection Performance ◽

Detection Methods ◽

Feature Maps ◽

3D Object ◽

3D Point Clouds ◽

Detection Approach ◽

3D Object Detection

Three-dimensional object detection can provide precise positions of objects, which can be beneficial to many robotics applications, such as self-driving cars, housekeeping robots, and autonomous navigation. In this work, we focus on accurate object detection in 3D point clouds and propose a new detection pipeline called scale-aware attention-based PillarsNet (SAPN). SAPN is a one-stage 3D object detection approach similar to PointPillar. However, SAPN achieves better performance than PointPillar by introducing the following strategies. First, we extract multiresolution pillar-level features from the point clouds to make the detection approach more scale-aware. Second, a spatial-attention mechanism is used to highlight the object activations in the feature maps, which can improve detection performance. Finally, SE-attention is employed to reweight the features fed into the detection head, which performs 3D object detection in a multitask learning manner. Experiments on the KITTI benchmark show that SAPN achieved similar or better performance compared with several state-of-the-art LiDAR-based 3D detection methods. The ablation study reveals the effectiveness of each proposed strategy. Furthermore, strategies used in this work can be embedded easily into other LiDAR-based 3D detection approaches, which improve their detection performance with slight modifications.

Download Full-text