ON-ROAD INFORMATION EXTRACTION FROM LIDAR DATA VIA MULTIPLE FEATURE MAPS

Abstract. On-road information, including road boundaries, road markings, and road cracks, provides significant guidance or warning to all road users. Recently, the on-road information extraction from LiDAR data have been widely studied. However, for the LiDAR data with lower accuracy and higher noise, some detailed information, such as road boundary, is difficult to be extracted correctly. Furthermore, most of previous studies lack an exploration of efficiently extracting multiple on-road information from a single framework. In this paper, we propose a new framework that can simultaneously extract multiple on-road information from high accuracy LiDAR data and can also more robustly extract detailed road boundaries from low accuracy LiDAR data. First, we propose a Curb-Aware Ground Filter to extract ground points with rich curb structure features. Second, we transform the vertical density, elevation gradient and intensity features of the ground points into multiple feature maps and extract multiple on-road information from the feature maps by employing a semantic segmentation network. Experimental results on three datasets with different data accuracy demonstrate that our method outperforms other recent competitive methods.

Download Full-text

Are We Hungry for 3D LiDAR Data for Semantic Segmentation? A Survey of Datasets and Methods

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3076844 ◽

2021 ◽

pp. 1-19

Author(s):

Biao Gao ◽

Yancheng Pan ◽

Chengkun Li ◽

Sibo Geng ◽

Huijing Zhao

Keyword(s):

Semantic Segmentation ◽

Lidar Data ◽

3D Lidar

Download Full-text

Domain Transfer for Semantic Segmentation of LiDAR Data using Deep Neural Networks

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9341508 ◽

2020 ◽

Author(s):

Ferdinand Langer ◽

Andres Milioto ◽

Alexandre Haag ◽

Jens Behley ◽

Cyrill Stachniss

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Semantic Segmentation ◽

Lidar Data ◽

Domain Transfer

Download Full-text

Semantic Segmentation for Remote Sensing based on RGB Images and Lidar Data using Model-Agnostic Meta-Learning and Partical Swarm Optimization

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2021.04.117 ◽

2020 ◽

Vol 53 (5) ◽

pp. 397-402

Author(s):

Kai Zhang ◽

Yu Han ◽

Jian Chen ◽

Zichao Zhang ◽

Shubo Wang

Keyword(s):

Remote Sensing ◽

Semantic Segmentation ◽

Lidar Data ◽

Swarm Optimization ◽

Meta Learning ◽

Rgb Images

Download Full-text

Fully‐connected semantic segmentation of hyperspectral and LiDAR data

IET Computer Vision ◽

10.1049/iet-cvi.2018.5067 ◽

2019 ◽

Vol 13 (3) ◽

pp. 285-293

Author(s):

Hakan Aytaylan ◽

Seniha Esen Yuksel

Keyword(s):

Semantic Segmentation ◽

Lidar Data ◽

Fully Connected

Download Full-text

GAN-Based Steganography with the Concatenation of Multiple Feature Maps

Digital Forensics and Watermarking - Lecture Notes in Computer Science ◽

10.1007/978-3-030-43575-2_1 ◽

2020 ◽

pp. 3-17

Author(s):

Haibin Wu ◽

Fengyong Li ◽

Xinpeng Zhang ◽

Kui Wu

Keyword(s):

Feature Maps ◽

Multiple Feature

Download Full-text

Multiple feature maps computed from a stack model of vision

Ophthalmic and Physiological Optics ◽

10.1016/0275-5408(88)90097-x ◽

1988 ◽

Vol 8 (1) ◽

pp. 102

Author(s):

N Haigh

Keyword(s):

Feature Maps ◽

Multiple Feature ◽

Stack Model

Download Full-text

U-net Network for Building Information Extraction of Remote-Sensing Imagery

International Journal of Online and Biomedical Engineering (iJOE) ◽

10.3991/ijoe.v14i12.9335 ◽

2018 ◽

Vol 14 (12) ◽

pp. 179

Author(s):

Jingtan Li ◽

Maolin Xu ◽

Hongling Xiu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Information Extraction ◽

Image Data ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Training Set ◽

Building Information ◽

The Face

With the resolution of remote sensing images is getting higher and higher, high-resolution remote sensing images are widely used in many areas. Among them, image information extraction is one of the basic applications of remote sensing images. In the face of massive high-resolution remote sensing image data, the traditional method of target recognition is difficult to cope with. Therefore, this paper proposes a remote sensing image extraction based on U-net network. Firstly, the U-net semantic segmentation network is used to train the training set, and the validation set is used to verify the training set at the same time, and finally the test set is used for testing. The experimental results show that U-net can be applied to the extraction of buildings.

Download Full-text

Concrete Cracks Detection Based on FCN with Dilated Convolution

Applied Sciences ◽

10.3390/app9132686 ◽

2019 ◽

Vol 9 (13) ◽

pp. 2686 ◽

Cited By ~ 15

Author(s):

Jianming Zhang ◽

Chaoquan Lu ◽

Jin Wang ◽

Lei Wang ◽

Xiao-Guang Yue

Keyword(s):

Crack Detection ◽

Receptive Fields ◽

Semantic Segmentation ◽

Concrete Surface ◽

Input Image ◽

Feature Maps ◽

Test Set ◽

Dilated Convolution ◽

Fully Convolutional Networks ◽

Segmentation Task

In civil engineering, the stability of concrete is of great significance to safety of people’s life and property, so it is necessary to detect concrete damage effectively. In this paper, we treat crack detection on concrete surface as a semantic segmentation task that distinguishes background from crack at the pixel level. Inspired by Fully Convolutional Networks (FCN), we propose a full convolution network based on dilated convolution for concrete crack detection, which consists of an encoder and a decoder. Specifically, we first used the residual network to extract the feature maps of the input image, designed the dilated convolutions with different dilation rates to extract the feature maps of different receptive fields, and fused the extracted features from multiple branches. Then, we exploited the stacked deconvolution to do up-sampling operator in the fused feature maps. Finally, we used the SoftMax function to classify the feature maps at the pixel level. In order to verify the validity of the model, we introduced the commonly used evaluation indicators of semantic segmentation: Pixel Accuracy (PA), Mean Pixel Accuracy (MPA), Mean Intersection over Union (MIoU), and Frequency Weighted Intersection over Union (FWIoU). The experimental results show that the proposed model converges faster and has better generalization performance on the test set by introducing dilated convolutions with different dilation rates and a multi-branch fusion strategy. Our model has a PA of 96.84%, MPA of 92.55%, MIoU of 86.05% and FWIoU of 94.22% on the test set, which is superior to other models.

Download Full-text

Novel Method of Semantic Segmentation Applicable to Augmented Reality

Sensors ◽

10.3390/s20061737 ◽

2020 ◽

Vol 20 (6) ◽

pp. 1737 ◽

Cited By ~ 1

Author(s):

Tae-young Ko ◽

Seung-ho Lee

Keyword(s):

Augmented Reality ◽

Spatial Information ◽

Ground Truth ◽

Semantic Segmentation ◽

Frame Rate ◽

Feature Maps ◽

Residual Network ◽

Feature Map ◽

Pascal Voc ◽

Novel Method

This paper proposes a novel method of semantic segmentation, consisting of modified dilated residual network, atrous pyramid pooling module, and backpropagation, that is applicable to augmented reality (AR). In the proposed method, the modified dilated residual network extracts a feature map from the original images and maintains spatial information. The atrous pyramid pooling module places convolutions in parallel and layers feature maps in a pyramid shape to extract objects occupying small areas in the image; these are converted into one channel using a 1 × 1 convolution. Backpropagation compares the semantic segmentation obtained through convolution from the final feature map with the ground truth provided by a database. Losses can be reduced by applying backpropagation to the modified dilated residual network to change the weighting. The proposed method was compared with other methods on the Cityscapes and PASCAL VOC 2012 databases. The proposed method achieved accuracies of 82.8 and 89.8 mean intersection over union (mIOU) and frame rates of 61 and 64.3 frames per second (fps) for the Cityscapes and PASCAL VOC 2012 databases, respectively. These results prove the applicability of the proposed method for implementing natural AR applications at actual speeds because the frame rate is greater than 60 fps.

Download Full-text

FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning

Sensors ◽

10.3390/s20123568 ◽

2020 ◽

Vol 20 (12) ◽

pp. 3568 ◽

Cited By ~ 2

Author(s):

Takayuki Shinohara ◽

Haoyi Xiu ◽

Masashi Matsuoka

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Point Clouds ◽

Lidar Data ◽

Global Features ◽

Waveform Data ◽

Full Waveform ◽

3D Point Clouds ◽

Waveform Lidar ◽

Full Waveform Lidar

In the computer vision field, many 3D deep learning models that directly manage 3D point clouds (proposed after PointNet) have been published. Moreover, deep learning-based-techniques have demonstrated state-of-the-art performance for supervised learning tasks on 3D point cloud data, such as classification and segmentation tasks for open datasets in competitions. Furthermore, many researchers have attempted to apply these deep learning-based techniques to 3D point clouds observed by aerial laser scanners (ALSs). However, most of these studies were developed for 3D point clouds without radiometric information. In this paper, we investigate the possibility of using a deep learning method to solve the semantic segmentation task of airborne full-waveform light detection and ranging (lidar) data that consists of geometric information and radiometric waveform data. Thus, we propose a data-driven semantic segmentation model called the full-waveform network (FWNet), which handles the waveform of full-waveform lidar data without any conversion process, such as projection onto a 2D grid or calculating handcrafted features. Our FWNet is based on a PointNet-based architecture, which can extract the local and global features of each input waveform data, along with its corresponding geographical coordinates. Subsequently, the classifier consists of 1D convolutional operational layers, which predict the class vector corresponding to the input waveform from the extracted local and global features. Our trained FWNet achieved higher scores in its recall, precision, and F1 score for unseen test data—higher scores than those of previously proposed methods in full-waveform lidar data analysis domain. Specifically, our FWNet achieved a mean recall of 0.73, a mean precision of 0.81, and a mean F1 score of 0.76. We further performed an ablation study, that is assessing the effectiveness of our proposed method, of the above-mentioned metric. Moreover, we investigated the effectiveness of our PointNet based local and global feature extraction method using the visualization of the feature vector. In this way, we have shown that our network for local and global feature extraction allows training with semantic segmentation without requiring expert knowledge on full-waveform lidar data or translation into 2D images or voxels.

Download Full-text