Real-Time 3D object detection using improved convolutional neural network based on image-driven point cloud

Real Time ◽

Point Cloud ◽

Point Clouds ◽

3D Point Cloud ◽

3D Object ◽

3D Object Detection ◽

Instance Segmentation

Background: While detecting the object directly from the 3D point cloud, the natural 3D patterns and invariance of 3D data are often obscure. Objective: In this work, we aimed at studying the 3D object detection from discrete, disordered and sparse 3D point clouds. Methods: The CNN is composed of the frustum sequence module, 3D instance segmentation module S-NET, 3D point cloud transformation module T-NET, and 3D boundary box estimation module E-NET. The search space of the object is determined by the frustum sequence module. The instance segmentation of the point cloud is performed by the 3D instance segmentation module. The 3D coordinates of the object are confirmed by the transformation module and the 3D bounding box estimation module. Results: Evaluated on KITTI benchmark dataset, our method outperforms the state of the art by remarkable margins while having real-time capability. Conclusion: We achieve real-time 3D object detection by proposing an improved convolutional neural network (CNN) based on image-driven point clouds.

A 3D Convolutional Neural Network Towards Real-Time Amodal 3D Object Detection

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2018.8593837 ◽

2018 ◽

Author(s):

Hao Sun ◽

Zehui Meng ◽

Xinxin Du ◽

Marcelo H. Ang

Keyword(s):

Neural Network ◽

Object Detection ◽

Real Time ◽

3D Object ◽

3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network

IEEE Access ◽

10.1109/access.2019.2955995 ◽

2019 ◽

Vol 7 ◽

pp. 171461-171470

Author(s):

Dianwei Wang ◽

Yanhui He ◽

Ying Liu ◽

Daxiang Li ◽

Shiqian Wu ◽

...

Keyword(s):

Neural Network ◽

Object Detection ◽

Detection Algorithm ◽

3D Object ◽

Multi Scale ◽

Panoramic Images ◽

PIXOR: Real-time 3D Object Detection from Point Clouds

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvpr.2018.00798 ◽

2018 ◽

Cited By ~ 141

Author(s):

Bin Yang ◽

Wenjie Luo ◽

Raquel Urtasun

Keyword(s):

Object Detection ◽

Real Time ◽

Point Clouds ◽

3D Object ◽

Optimization of the PointPillars network for 3D object detection in point clouds

10.36227/techrxiv.12593555.v1 ◽

2020 ◽

Author(s):

Joanna Stanisz ◽

Konrad Lis ◽

Tomasz Kryjak ◽

Marek Gorgon

Keyword(s):

Object Detection ◽

Point Cloud ◽

Main Part ◽

Point Clouds ◽

Lidar Data ◽

Detection Accuracy ◽

3D Object ◽

Fold Reduction ◽

Low Energy Consumption ◽

In this paper we present our research on the optimisation of a deep neural network for 3D object detection in a point cloud. Techniques like quantisation and pruning available in the Brevitas and PyTorch tools were used. We performed the experiments for the PointPillars network, which offers a reasonable compromise between detection accuracy and calculation complexity. The aim of this work was to propose a variant of the network which we will ultimately implement in an FPGA device. This will allow for real-time LiDAR data processing with low energy consumption. The obtained results indicate that even a significant quantisation from 32-bit floating point to 2-bit integer in the main part of the algorithm, results in 5%-9% decrease of the detection accuracy, while allowing for almost a 16-fold reduction in size of the model.

Convolutional Neural Network Using for Multi-Sensor 3D Object Detection

Journal of Physics Conference Series ◽

10.1088/1742-6596/1979/1/012020 ◽

2021 ◽

Vol 1979 (1) ◽

pp. 012020

Author(s):

Gadug Sudhansu ◽

A N Mohamed Zabeeulla ◽

M N Nachappa

Keyword(s):

Neural Network ◽

Object Detection ◽

3D Object ◽

A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection

Sensors ◽

10.3390/s20216043 ◽

2020 ◽

Vol 20 (21) ◽

pp. 6043

Author(s):

Yujun Jiao ◽

Zhishuai Yin

Keyword(s):

Object Detection ◽

Point Cloud ◽

Point Clouds ◽

Second Phase ◽

Two Phase ◽

3D Object ◽

Rgb Images ◽

Fusion Scheme ◽

3D Object Detection ◽

Level Fusion

A two-phase cross-modality fusion detector is proposed in this study for robust and high-precision 3D object detection with RGB images and LiDAR point clouds. First, a two-stream fusion network is built into the framework of Faster RCNN to perform accurate and robust 2D detection. The visible stream takes the RGB images as inputs, while the intensity stream is fed with the intensity maps which are generated by projecting the reflection intensity of point clouds to the front view. A multi-layer feature-level fusion scheme is designed to merge multi-modal features across multiple layers in order to enhance the expressiveness and robustness of the produced features upon which region proposals are generated. Second, a decision-level fusion is implemented by projecting 2D proposals to the space of the point cloud to generate 3D frustums, on the basis of which the second-phase 3D detector is built to accomplish instance segmentation and 3D-box regression on the filtered point cloud. The results on the KITTI benchmark show that features extracted from RGB images and intensity maps complement each other, and our proposed detector achieves state-of-the-art performance on 3D object detection with a substantially lower running time as compared to available competitors.

Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception

Sensors ◽

10.3390/s19040893 ◽

2019 ◽

Vol 19 (4) ◽

pp. 893 ◽

Cited By ~ 6

Author(s):

Li Wang ◽

Ruifeng Li ◽

Hezi Shi ◽

Jingwen Sun ◽

Lijun Zhao ◽

...

Keyword(s):

Neural Network ◽

Object Detection ◽

Environmental Perception ◽

Service Robots ◽

Service Robot ◽

Abstract Concepts ◽

3D Object ◽

Long Time ◽

Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot.