Limited Receptive Field Network for Real-Time Driving Scene Semantic Segmentation

Mixed reality (MR) is rapidly becoming a vital tool, not just in gaming, but also in education, medicine, construction and environmental management. The term refers to systems in which computer-generated content is superimposed over objects in a real-world environment across one or more sensory modalities. Although most of us have heard of the use of MR in computer games, it also has applications in military and aviation training, as well as tourism, healthcare and more. In addition, it has the potential for use in architecture and design, where buildings can be superimposed in existing locations to render 3D generations of plans. However, one major challenge that remains in MR development is the issue of real-time occlusion. This refers to hiding 3D virtual objects behind real articles. Dr Tomohiro Fukuda, who is based at the Division of Sustainable Energy and Environmental Engineering, Graduate School of Engineering at Osaka University in Japan, is an expert in this field. Researchers, led by Dr Tomohiro Fukuda, are tackling the issue of occlusion in MR. They are currently developing a MR system that realises real-time occlusion by harnessing deep learning to achieve an outdoor landscape design simulation using a semantic segmentation technique. This methodology can be used to automatically estimate the visual environment prior to and after construction projects.

Download Full-text

A lightweight network with attention decoder for real-time semantic segmentation

The Visual Computer ◽

10.1007/s00371-021-02115-4 ◽

2021 ◽

Author(s):

Kang Wang ◽

Jinfu Yang ◽

Shuai Yuan ◽

Mingai Li

Keyword(s):

Real Time ◽

Semantic Segmentation

Download Full-text

Real-time 2D–3D door detection and state classification on a low-power device

SN Applied Sciences ◽

10.1007/s42452-021-04588-3 ◽

2021 ◽

Vol 3 (5) ◽

Author(s):

João Gaspar Ramôa ◽

Vasco Lopes ◽

Luís A. Alexandre ◽

S. Mogo

Keyword(s):

Low Power ◽

Real Time ◽

Object Classification ◽

Semantic Segmentation ◽

Detection Algorithm ◽

Power Device ◽

Indoor Environments ◽

State Classification ◽

Segmentation Algorithms ◽

Indoor Spaces

AbstractIn this paper, we propose three methods for door state classification with the goal to improve robot navigation in indoor spaces. These methods were also developed to be used in other areas and applications since they are not limited to door detection as other related works are. Our methods work offline, in low-powered computers as the Jetson Nano, in real-time with the ability to differentiate between open, closed and semi-open doors. We use the 3D object classification, PointNet, real-time semantic segmentation algorithms such as, FastFCN, FC-HarDNet, SegNet and BiSeNet, the object detection algorithm, DetectNet and 2D object classification networks, AlexNet and GoogleNet. We built a 3D and RGB door dataset with images from several indoor environments using a 3D Realsense camera D435. This dataset is freely available online. All methods are analysed taking into account their accuracy and the speed of the algorithm in a low powered computer. We conclude that it is possible to have a door classification algorithm running in real-time on a low-power device.

Download Full-text

Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation

International Journal of Computer Vision ◽

10.1007/s11263-021-01433-3 ◽

2021 ◽

Author(s):

Peng Sun ◽

Jiaxiang Wu ◽

Songyuan Li ◽

Peiwen Lin ◽

Junzhou Huang ◽

...

Keyword(s):

Real Time ◽

Semantic Segmentation ◽

Joint Decision ◽

Feature Aggregation

Download Full-text

Real-time Semantic Segmentation with Context Aggregation Network

ISPRS Journal of Photogrammetry and Remote Sensing ◽

10.1016/j.isprsjprs.2021.06.006 ◽

2021 ◽

Vol 178 ◽

pp. 124-134

Author(s):

Michael Ying Yang ◽

Saumya Kumaar ◽

Ye Lyu ◽

Francesco Nex

Keyword(s):

Real Time ◽

Semantic Segmentation

Download Full-text

SPMNet: A light-weighted network with separable pyramid module for real-time semantic segmentation

Journal of Experimental & Theoretical Artificial Intelligence ◽

10.1080/0952813x.2021.1908432 ◽

2021 ◽

pp. 1-12

Author(s):

Shiwei Gao ◽

Changzhu Zhang ◽

Zhuping Wang ◽

Hao Zhang ◽

Chao Huang

Keyword(s):

Real Time ◽

Semantic Segmentation ◽

Weighted Network

Download Full-text

RSNet: Rail semantic segmentation network for extracting aerial railroad images

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210349 ◽

2021 ◽

pp. 1-18

Author(s):

R.S. Rampriya ◽

Sabarinathan ◽

R. Suganya

Keyword(s):

Real Time ◽

Visual Processing ◽

Feature Fusion ◽

Semantic Segmentation ◽

Vital Role ◽

Obstacle Detection ◽

Aerial Images ◽

Computationally Efficient ◽

Fusion Algorithm ◽

Uav Images

In the near future, combo of UAV (Unmanned Aerial Vehicle) and computer vision will play a vital role in monitoring the condition of the railroad periodically to ensure passenger safety. The most significant module involved in railroad visual processing is obstacle detection, in which caution is obstacle fallen near track gage inside or outside. This leads to the importance of detecting and segment the railroad as three key regions, such as gage inside, rails, and background. Traditional railroad segmentation methods depend on either manual feature selection or expensive dedicated devices such as Lidar, which is typically less reliable in railroad semantic segmentation. Also, cameras mounted on moving vehicles like a drone can produce high-resolution images, so segmenting precise pixel information from those aerial images has been challenging due to the railroad surroundings chaos. RSNet is a multi-level feature fusion algorithm for segmenting railroad aerial images captured by UAV and proposes an attention-based efficient convolutional encoder for feature extraction, which is robust and computationally efficient and modified residual decoder for segmentation which considers only essential features and produces less overhead with higher performance even in real-time railroad drone imagery. The network is trained and tested on a railroad scenic view segmentation dataset (RSSD), which we have built from real-time UAV images and achieves 0.973 dice coefficient and 0.94 jaccard on test data that exhibits better results compared to the existing approaches like a residual unit and residual squeeze net.

Download Full-text