PNNPU: A 11.9 TOPS/W High-speed 3D Point Cloud-based Neural Network Processor with Block-based Point Processing for Regular DRAM Access

Background: While detecting the object directly from the 3D point cloud, the natural 3D patterns and invariance of 3D data are often obscure. Objective: In this work, we aimed at studying the 3D object detection from discrete, disordered and sparse 3D point clouds. Methods: The CNN is composed of the frustum sequence module, 3D instance segmentation module S-NET, 3D point cloud transformation module T-NET, and 3D boundary box estimation module E-NET. The search space of the object is determined by the frustum sequence module. The instance segmentation of the point cloud is performed by the 3D instance segmentation module. The 3D coordinates of the object are confirmed by the transformation module and the 3D bounding box estimation module. Results: Evaluated on KITTI benchmark dataset, our method outperforms the state of the art by remarkable margins while having real-time capability. Conclusion: We achieve real-time 3D object detection by proposing an improved convolutional neural network (CNN) based on image-driven point clouds.

Download Full-text

Food Volume Estimation Based on Deep Learning View Synthesis from a Single Depth Map

Nutrients ◽

10.3390/nu10122005 ◽

2018 ◽

Vol 10 (12) ◽

pp. 2005 ◽

Cited By ~ 12

Author(s):

Frank Lo ◽

Yingnan Sun ◽

Jianing Qiu ◽

Benny Lo

Keyword(s):

Neural Network ◽

Deep Learning ◽

Point Cloud ◽

Volume Estimation ◽

Assessment System ◽

View Synthesis ◽

Depth Image ◽

3D Point Cloud ◽

Viewing Angle ◽

3D Point Clouds

An objective dietary assessment system can help users to understand their dietary behavior and enable targeted interventions to address underlying health problems. To accurately quantify dietary intake, measurement of the portion size or food volume is required. For volume estimation, previous research studies mostly focused on using model-based or stereo-based approaches which rely on manual intervention or require users to capture multiple frames from different viewing angles which can be tedious. In this paper, a view synthesis approach based on deep learning is proposed to reconstruct 3D point clouds of food items and estimate the volume from a single depth image. A distinct neural network is designed to use a depth image from one viewing angle to predict another depth image captured from the corresponding opposite viewing angle. The whole 3D point cloud map is then reconstructed by fusing the initial data points with the synthesized points of the object items through the proposed point cloud completion and Iterative Closest Point (ICP) algorithms. Furthermore, a database with depth images of food object items captured from different viewing angles is constructed with image rendering and used to validate the proposed neural network. The methodology is then evaluated by comparing the volume estimated by the synthesized 3D point cloud with the ground truth volume of the object items.

Download Full-text

Application of generalized regression neural network on the prediction of the main girths of young females based on 3D point cloud data

Journal of Interdisciplinary Mathematics ◽

10.1080/09720502.2017.1384217 ◽

2017 ◽

Vol 20 (6-7) ◽

pp. 1355-1358

Author(s):

Cai-Yuan Kuang

Keyword(s):

Neural Network ◽

Point Cloud ◽

Generalized Regression Neural Network ◽

3D Point Cloud ◽

Point Cloud Data ◽

Cloud Data ◽

Young Females ◽

Generalized Regression

Download Full-text

Deep Neural Network for 3D Point Cloud Completion with Multistage Loss Function

2019 Chinese Control And Decision Conference (CCDC) ◽

10.1109/ccdc.2019.8832956 ◽

2019 ◽

Author(s):

Haohao Huang ◽

Hongliang Chen ◽

Jianxun Li

Keyword(s):

Neural Network ◽

Loss Function ◽

Point Cloud ◽

Deep Neural Network ◽

3D Point Cloud

Download Full-text

Roof bolt identification in underground coal mines from 3D point cloud data using local point descriptors and artificial neural network

International Journal of Remote Sensing ◽

10.1080/2150704x.2020.1809734 ◽

2020 ◽

Vol 42 (1) ◽

pp. 367-377

Author(s):

Sarvesh Kumar Singh ◽

Simit Raval ◽

Bikram Banerjee

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Point Cloud ◽

Coal Mines ◽

3D Point Cloud ◽

Point Cloud Data ◽

Roof Bolt ◽

Cloud Data ◽

Local Point ◽

Underground Coal Mines

Download Full-text

Learning Collision Situation to Convolutional Neural Network Using Collision Grid Map Based on Probability Scheme

Applied Sciences ◽

10.3390/app10020617 ◽

2020 ◽

Vol 10 (2) ◽

pp. 617

Author(s):

Jo ◽

Moon

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Point Cloud ◽

Probability Method ◽

3D Point Cloud ◽

Point Cloud Data ◽

Cloud Data ◽

Location Data ◽

Grid Map ◽

Permutation Problem

In this paper, a Collision Grid Map (CGM) is proposed by using 3d point cloud data to predict the collision between the cattle and the end effector of the manipulator in the barn environment. The Generated Collision Grid Map using x-y plane and depth z data in 3D point cloud data is applied to a Convolutional Neural Network to predict a collision situation. There is an invariant of the permutation problem, which is not efficiently learned in occurring matter of different orders when 3d point cloud data is applied to Convolutional Neural Network. The Collision Grid Map is generated by point cloud data based on the probability method. The Collision Grid Map scheme is composed of a 2-channel. The first channel is constructed by location data in the x-y plane. The second channel is composed of depth data in the z-direction. 3D point cloud is measured in a barn environment and created a Collision Grid Map. Then the generated Collision Grid Map is applied to the Convolutional Neural Network to predict the collision with cattle. The experimental results show that the proposed scheme is reliable and robust in a barn environment.

Download Full-text

3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2017.265 ◽

2017 ◽

Cited By ~ 45

Author(s):

Gil Elbaz ◽

Tamar Avraham ◽

Anath Fischer

Keyword(s):

Neural Network ◽

Point Cloud ◽

Deep Neural Network ◽

3D Point Cloud ◽

Point Cloud Registration

Download Full-text

GRAPH NEURAL NETWORK BASED MULTI-FEATURE FUSION FOR BUILDING CHANGE DETECTION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b3-2021-377-2021 ◽

2021 ◽

Vol XLIII-B3-2021 ◽

pp. 377-382

Author(s):

W. Yuan ◽

X. Yuan ◽

Z. Fan ◽

Z. Guo ◽

X. Shi ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Change Detection ◽

Point Cloud ◽

Feature Fusion ◽

Experimental Comparison ◽

3D Point Cloud ◽

Remote Sensing Images ◽

Illumination Variations ◽

Multi Temporal

Abstract. Building Change Detection (BCD) via multi-temporal remote sensing images is essential for various applications such as urban monitoring, urban planning, and disaster assessment. However, most building change detection approaches only extract features from different kinds of remote sensing images for change index determination, which can not determine the insignificant changes of small buildings. Given co-registered multi-temporal remote sensing images, the illumination variations and misregistration errors always lead to inaccurate change detection results. This study investigates the applicability of multi-feature fusion from both directly extract 2D features from remote sensing images and 3D features extracted by the dense image matching (DIM) generated 3D point cloud for accurate building change index generation. This paper introduces a graph neural network (GNN) based end-to-end learning framework for building change detection. The proposed framework includes feature extraction, feature fusion, and change index prediction. It starts with a pre-trained VGG-16 network as a backend and uses U-net architecture with five layers for feature map extraction. The extracted 2D features and 3D features are utilized as input into GNN based feature fusion parts. In the GNN parts, we introduce a flexible context aggregation mechanism based on attention to address the illumination variations and misregistration errors, enabling the framework to reason about the image-based texture information and depth information introduced by DIM generated 3D point cloud jointly. After that, the GNN generated affinity matrix is utilized for change index determination through a Hungarian algorithm. The experiment conducted on a dataset that covered Setagaya-Ku, Tokyo area, shows that the proposed method generated change map achieved the precision of 0.762 and the F1-score of 0.68 at pixel-level. Compared to traditional image-based change detection methods, our approach learns prior over geometrical structure information from the real 3D world, which robust to the misregistration errors. Compared to CNN based methods, the proposed method learns to fuse 2D and 3D features together to represent more comprehensive information for building change index determination. The experimental comparison results demonstrated that the proposed approach outperforms the traditional methods and CNN based methods.

Download Full-text

Asphalt Pavement Friction Coefficient Prediction Method Based on Genetic-Algorithm-Improved Neural Network(GAI-NN) Model

Canadian Journal of Civil Engineering ◽

10.1139/cjce-2020-0051 ◽

2021 ◽

Author(s):

Zhaoyun Sun ◽

Xueli Hao ◽

Wei Li ◽

Ju Huyan ◽

Hongchao Sun

Keyword(s):

Neural Network ◽

Genetic Algorithm ◽

Friction Coefficient ◽

Point Cloud ◽

Spline Interpolation ◽

Asphalt Pavement ◽

3D Point Cloud ◽

Cloud Data ◽

Pavement Surface ◽

Pavement Friction

To overcome the limitations of pavement skid resistance prediction using the friction coefficient, a Genetic-Algorithm-Improved Neural Network (GAI-NN) was developed in this study. First, three-dimensional (3D) point-cloud data of an asphalt pavement surface were obtained using a smart sensor (Gocator 3110). The friction coefficient of the pavement was then obtained using a pendulum friction tester. The 3D point-cloud dataset was then analyzed to recover missing data and perform denoising. In particular, these data were filled using cubic-spline interpolation. Parameters for texture characterization were defined, and methods for computing the parameters were developed. Finally, the GAI-NN model was developed via modification of the weights and thresholds. The test results indicated that using pavement surface texture 3D data, the GAI-NN was capable of predicting the pavement friction coefficient with sufficient accuracy, with an error of 12.1%.

Download Full-text