A Comparative Study of Real-Time Semantic Segmentation for Autonomous Driving

As the techniques of autonomous driving become increasingly valued and universal, real-time semantic segmentation has become very popular and challenging in the field of deep learning and computer vision in recent years. However, in order to apply the deep learning model to edge devices accompanying sensors on vehicles, we need to design a structure that has the best trade-off between accuracy and inference time. In previous works, several methods sacrificed accuracy to obtain a faster inference time, while others aimed to find the best accuracy under the condition of real time. Nevertheless, the accuracies of previous real-time semantic segmentation methods still have a large gap compared to general semantic segmentation methods. As a result, we propose a network architecture based on a dual encoder and a self-attention mechanism. Compared with preceding works, we achieved a 78.6% mIoU with a speed of 39.4 FPS with a 1024 × 2048 resolution on a Cityscapes test submission.

Download Full-text

A Comparative Study of Recent Real Time Semantic Segmentation Algorithms for Visual Semantic SLAM

2020 IEEE International Conference on Big Data and Smart Computing (BigComp) ◽

10.1109/bigcomp48618.2020.00-22 ◽

2020 ◽

Author(s):

Zeeshan Javed ◽

Gon-Woo Kim

Keyword(s):

Comparative Study ◽

Real Time ◽

Semantic Segmentation ◽

Segmentation Algorithms

Download Full-text

RTSeg: Real-Time Semantic Segmentation Comparative Study

2018 25th IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2018.8451495 ◽

2018 ◽

Cited By ~ 29

Author(s):

Mennatullah Siam ◽

Mostafa Gamal ◽

Moemen Abdel-Razek ◽

Senthil Yogamani ◽

Martin Jagersand

Keyword(s):

Comparative Study ◽

Real Time ◽

Semantic Segmentation

Download Full-text

Real-Time LiDAR Point Cloud Semantic Segmentation for Autonomous Driving

Electronics ◽

10.3390/electronics11010011 ◽

2021 ◽

Vol 11 (1) ◽

pp. 11

Author(s):

Xing Xie ◽

Lin Bai ◽

Xinming Huang

Keyword(s):

Real Time ◽

Power Efficiency ◽

Point Cloud ◽

Processing Time ◽

State Of The Art ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Geometric Information ◽

Embedded Platform ◽

Gpu Implementation

LiDAR has been widely used in autonomous driving systems to provide high-precision 3D geometric information about the vehicle’s surroundings for perception, localization, and path planning. LiDAR-based point cloud semantic segmentation is an important task with a critical real-time requirement. However, most of the existing convolutional neural network (CNN) models for 3D point cloud semantic segmentation are very complex and can hardly be processed at real-time on an embedded platform. In this study, a lightweight CNN structure was proposed for projection-based LiDAR point cloud semantic segmentation with only 1.9 M parameters that gave an 87% reduction comparing to the state-of-the-art networks. When evaluated on a GPU, the processing time was 38.5 ms per frame, and it achieved a 47.9% mIoU score on Semantic-KITTI dataset. In addition, the proposed CNN is targeted on an FPGA using an NVDLA architecture, which results in a 2.74x speedup over the GPU implementation with a 46 times improvement in terms of power efficiency.

Download Full-text

Point Cloud Semantic Segmentation with Cross-Correction Features

10.21203/rs.3.rs-1218117/v1 ◽

2022 ◽

Author(s):

Yuehua Zhao ◽

Ma Jie ◽

Chong Nannan ◽

Wen Junjie

Keyword(s):

Real Time ◽

Point Cloud ◽

Large Scale ◽

Spatial Information ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Basic Unit ◽

Semantic Features ◽

Point Cloud Segmentation ◽

Scale Point

Abstract Real time large scale point cloud segmentation is an important but challenging task for practical application like autonomous driving. Existing real time methods have achieved acceptance performance by aggregating local information. However, most of them only exploit local spatial information or local semantic information dependently, few considering the complementarity of both. In this paper, we propose a model named Spatial-Semantic Incorporation Network (SSI-Net) for real time large scale point cloud segmentation. A Spatial-Semantic Cross-correction (SSC) module is introduced in SSI-Net as a basic unit. High quality contextual features can be learned through SSC by correct and update semantic features using spatial cues, and vice verse. Adopting the plug-and-play SSC module, we design SSI-Net as an encoder-decoder architecture. To ensure efficiency, it also adopts a random sample based hierarchical network structure. Extensive experiments on several prevalent datasets demonstrate that our method can achieve state-of-the-art performance.

Download Full-text

Real-time object detection and semantic segmentation for autonomous driving

MIPPR 2017: Automatic Target Recognition and Navigation ◽

10.1117/12.2288713 ◽

2018 ◽

Author(s):

Weichao Xu ◽

Baojun Li ◽

Sun Liu ◽

Wei Qiu

Keyword(s):

Object Detection ◽

Real Time ◽

Semantic Segmentation ◽

Autonomous Driving

Download Full-text

Implementation of a Lightweight Semantic Segmentation Algorithm in Road Obstacle Detection

Sensors ◽

10.3390/s20247089 ◽

2020 ◽

Vol 20 (24) ◽

pp. 7089

Author(s):

Bushi Liu ◽

Yongbo Lv ◽

Yang Gu ◽

Wanjun Lv

Keyword(s):

Real Time ◽

Spatial Information ◽

Feature Fusion ◽

Semantic Segmentation ◽

Spatial Location ◽

Autonomous Driving ◽

Obstacle Detection ◽

Depth Information ◽

Long Time ◽

Deep Learning Network

Due to deep learning’s accurate cognition of the street environment, the convolutional neural network has achieved dramatic development in the application of street scenes. Considering the needs of autonomous driving and assisted driving, in a general way, computer vision technology is used to find obstacles to avoid collisions, which has made semantic segmentation a research priority in recent years. However, semantic segmentation has been constantly facing new challenges for quite a long time. Complex network depth information, large datasets, real-time requirements, etc., are typical problems that need to be solved urgently in the realization of autonomous driving technology. In order to address these problems, we propose an improved lightweight real-time semantic segmentation network, which is based on an efficient image cascading network (ICNet) architecture, using multi-scale branches and a cascaded feature fusion unit to extract rich multi-level features. In this paper, a spatial information network is designed to transmit more prior knowledge of spatial location and edge information. During the course of the training phase, we append an external loss function to enhance the learning process of the deep learning network system as well. This lightweight network can quickly perceive obstacles and detect roads in the drivable area from images to satisfy autonomous driving characteristics. The proposed model shows substantial performance on the Cityscapes dataset. With the premise of ensuring real-time performance, several sets of experimental comparisons illustrate that SP-ICNet enhances the accuracy of road obstacle detection and provides nearly ideal prediction outputs. Compared to the current popular semantic segmentation network, this study also demonstrates the effectiveness of our lightweight network for road obstacle detection in autonomous driving.

Download Full-text

DNS: A multi-scale deconvolution semantic segmentation network for joint detection and segmentation

MATEC Web of Conferences ◽

10.1051/matecconf/201927702005 ◽

2019 ◽

Vol 277 ◽

pp. 02005

Author(s):

Ning Feng ◽

Le Dong ◽

Qianni Zhang ◽

Ning Zhang ◽

Xi Wu ◽

...

Keyword(s):

Image Analysis ◽

Object Detection ◽

Real Time ◽

Medical Image ◽

Medical Image Analysis ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Joint Detection ◽

Multi Scale ◽

Segmentation Task

Real-time semantic segmentation has become crucial in many applications such as medical image analysis and autonomous driving. In this paper, we introduce a single semantic segmentation network, called DNS, for joint object detection and segmentation task. We take advantage of multi-scale deconvolution mechanism to perform real time computations. To this goal, down-scale and up-scale streams are utilized to combine the multi-scale features for the final detection and segmentation task. By using the proposed DNS, not only the tradeoff between accuracy and cost but also the balance of detection and segmentation performance are settled. Experimental results for PASCAL VOC datasets show competitive performance for joint object detection and segmentation task.

Download Full-text

REAL-TIME SEMANTIC SLAM WITH DCNN-BASED FEATURE POINT DETECTION, MATCHING AND DENSE POINT CLOUD AGGREGATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2021-399-2021 ◽

2021 ◽

Vol XLIII-B2-2021 ◽

pp. 399-404

Author(s):

B. Vishnyakov ◽

I. Sgibnev ◽

V. Sheverdin ◽

A. Sorokin ◽

P. Masalov ◽

...

Keyword(s):

Neural Networks ◽

Real Time ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Scene Reconstruction ◽

Deep Convolutional Neural Networks ◽

Dense Point ◽

Robotic Vehicle ◽

Semantic Scene ◽

Point Detection

Abstract. In this paper we present the semantic SLAM method based on a bundle of deep convolutional neural networks. It provides real-time dense semantic scene reconstruction for the autonomous driving system of an off-road robotic vehicle. Most state-of-the-art neural networks require large computing resources that go beyond the capabilities of many robotic platforms. We propose an architecture for 3D semantic scene reconstruction on top of the recent progress in computer vision by integrating SuperPoint, SuperGlue, Bi3D, DeepLabV3+, RTM3D and additional module with pre-processing, inference and postprocessing operations performed on GPU. We also updated our simulated dataset for semantic segmentation and added disparity images.

Download Full-text

A Comparative Study for Open Set Semantic Segmentation Methods

10.21528/cbic2021-65 ◽

2021 ◽

Author(s):

Anderson Brilhador ◽

Matheus Gutoski ◽

André Eugênio Lazzaretti ◽

Heitor Silvério Lopes

Keyword(s):

Comparative Study ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Synthetic Dataset ◽

Open World ◽

Open Set ◽

Segmentation Methods ◽

Unseen Objects ◽

World Environment ◽

And Robotics

Typical semantic segmentation methods do not recognize unknown pixels during the test or deployment stage. This capability is critical for open-world environment applications where unseen objects appear all the time. Recently, to solve those limitations, Open Set Semantic Segmentation (OSSS) was introduced. This task aims to produce known and unknown pixels semantic segments. However, due to its recent introduction, few works are found in the literature, and consequently, few datasets are publicly available. This work carried out a comparative study between the existing OSSS methods on a new synthetic dataset of images and the well-known PASCAL VOC 2012 dataset. The compared methods include SoftMax-T, OpenMax-based, and OpenIPCS. The results are encouraging and show some of the advantages and main limitations of each technique. However, in general, they demonstrate that the problem of OSSS remains open and demands further research aiming at real applications, such as autonomous driving and robotics.

Download Full-text