A Multi-level Feature Fusion Network for Real-time Semantic Segmentation

In the near future, combo of UAV (Unmanned Aerial Vehicle) and computer vision will play a vital role in monitoring the condition of the railroad periodically to ensure passenger safety. The most significant module involved in railroad visual processing is obstacle detection, in which caution is obstacle fallen near track gage inside or outside. This leads to the importance of detecting and segment the railroad as three key regions, such as gage inside, rails, and background. Traditional railroad segmentation methods depend on either manual feature selection or expensive dedicated devices such as Lidar, which is typically less reliable in railroad semantic segmentation. Also, cameras mounted on moving vehicles like a drone can produce high-resolution images, so segmenting precise pixel information from those aerial images has been challenging due to the railroad surroundings chaos. RSNet is a multi-level feature fusion algorithm for segmenting railroad aerial images captured by UAV and proposes an attention-based efficient convolutional encoder for feature extraction, which is robust and computationally efficient and modified residual decoder for segmentation which considers only essential features and produces less overhead with higher performance even in real-time railroad drone imagery. The network is trained and tested on a railroad scenic view segmentation dataset (RSSD), which we have built from real-time UAV images and achieves 0.973 dice coefficient and 0.94 jaccard on test data that exhibits better results compared to the existing approaches like a residual unit and residual squeeze net.

Download Full-text

Real-Time Semantic Segmentation Algorithm Based on Feature Fusion Technology

Laser & Optoelectronics Progress ◽

10.3788/lop57.021011 ◽

2020 ◽

Vol 57 (2) ◽

pp. 021011

Author(s):

蔡雨 Cai Yu ◽

黄学功 Huang Xuegong ◽

张志安 Zhang Zhian ◽

朱新年 Zhu Xinnian ◽

马祥 Ma Xiang

Keyword(s):

Real Time ◽

Feature Fusion ◽

Semantic Segmentation ◽

Segmentation Algorithm ◽

Fusion Technology

Download Full-text

EFRNet: A Lightweight Network with Efficient Feature Fusion and Refinement for Real-Time Semantic Segmentation

2021 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme51207.2021.9428371 ◽

2021 ◽

Author(s):

Kuayue Zhang ◽

Qingmin Liao ◽

Juncheng Zhang ◽

Shaojun Liu ◽

Haoyu Ma ◽

...

Keyword(s):

Real Time ◽

Feature Fusion ◽

Semantic Segmentation

Download Full-text

RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation

2017 IEEE International Conference on Computer Vision (ICCV) ◽

10.1109/iccv.2017.533 ◽

2017 ◽

Cited By ~ 11

Author(s):

Seungyong Lee ◽

Seong-Jin Park ◽

Ki-Sang Hong

Keyword(s):

Feature Fusion ◽

Semantic Segmentation ◽

Multi Level

Download Full-text

Multi-level feature fusion model-based real-time person re-identification for forensics

Journal of Real-Time Image Processing ◽

10.1007/s11554-019-00908-4 ◽

2019 ◽

Vol 17 (1) ◽

pp. 73-81 ◽

Cited By ~ 2

Author(s):

Shiqin Wang ◽

Xin Xu ◽

Lei Liu ◽

Jing Tian

Keyword(s):

Real Time ◽

Feature Fusion ◽

Fusion Model ◽

Model Based ◽

Multi Level

Download Full-text

MFENet: Multi-level feature enhancement network for real-time semantic segmentation

Neurocomputing ◽

10.1016/j.neucom.2020.02.019 ◽

2020 ◽

Vol 393 ◽

pp. 54-65 ◽

Cited By ~ 1

Author(s):

Boxiang Zhang ◽

Wenhui Li ◽

Yuming Hui ◽

Jiayun Liu ◽

Yuanyuan Guan

Keyword(s):

Real Time ◽

Semantic Segmentation ◽

Feature Enhancement ◽

Multi Level

Download Full-text

Implementation of a Lightweight Semantic Segmentation Algorithm in Road Obstacle Detection

Sensors ◽

10.3390/s20247089 ◽

2020 ◽

Vol 20 (24) ◽

pp. 7089

Author(s):

Bushi Liu ◽

Yongbo Lv ◽

Yang Gu ◽

Wanjun Lv

Keyword(s):

Real Time ◽

Spatial Information ◽

Feature Fusion ◽

Semantic Segmentation ◽

Spatial Location ◽

Autonomous Driving ◽

Obstacle Detection ◽

Depth Information ◽

Long Time ◽

Deep Learning Network

Due to deep learning’s accurate cognition of the street environment, the convolutional neural network has achieved dramatic development in the application of street scenes. Considering the needs of autonomous driving and assisted driving, in a general way, computer vision technology is used to find obstacles to avoid collisions, which has made semantic segmentation a research priority in recent years. However, semantic segmentation has been constantly facing new challenges for quite a long time. Complex network depth information, large datasets, real-time requirements, etc., are typical problems that need to be solved urgently in the realization of autonomous driving technology. In order to address these problems, we propose an improved lightweight real-time semantic segmentation network, which is based on an efficient image cascading network (ICNet) architecture, using multi-scale branches and a cascaded feature fusion unit to extract rich multi-level features. In this paper, a spatial information network is designed to transmit more prior knowledge of spatial location and edge information. During the course of the training phase, we append an external loss function to enhance the learning process of the deep learning network system as well. This lightweight network can quickly perceive obstacles and detect roads in the drivable area from images to satisfy autonomous driving characteristics. The proposed model shows substantial performance on the Cityscapes dataset. With the premise of ensuring real-time performance, several sets of experimental comparisons illustrate that SP-ICNet enhances the accuracy of road obstacle detection and provides nearly ideal prediction outputs. Compared to the current popular semantic segmentation network, this study also demonstrates the effectiveness of our lightweight network for road obstacle detection in autonomous driving.

Download Full-text

Semantics-guided multi-level RGB-D feature fusion for indoor semantic segmentation

2017 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2017.8296484 ◽

2017 ◽

Cited By ~ 1

Author(s):

Yabei Li ◽

Junge Zhang ◽

Yanhua Cheng ◽

Kaiqi Huang ◽

Tieniu Tan

Keyword(s):

Feature Fusion ◽

Semantic Segmentation ◽

Multi Level

Download Full-text

MBFFNet: Multi-Branch Feature Fusion Network for Colonoscopy

Frontiers in Bioengineering and Biotechnology ◽

10.3389/fbioe.2021.696251 ◽

2021 ◽

Vol 9 ◽

Author(s):

Houcheng Su ◽

Bin Lin ◽

Xiaoshuang Huang ◽

Jiao Li ◽

Kailin Jiang ◽

...

Keyword(s):

Real Time ◽

Feature Fusion ◽

Rapid Development ◽

Semantic Segmentation ◽

Medical Image Segmentation ◽

Feature Maps ◽

Improve Model ◽

Segmentation Methods ◽

Rectal Polyps ◽

Good Potential

Colonoscopy is currently one of the main methods for the detection of rectal polyps, rectal cancer, and other diseases. With the rapid development of computer vision, deep learning–based semantic segmentation methods can be applied to the detection of medical lesions. However, it is challenging for current methods to detect polyps with high accuracy and real-time performance. To solve this problem, we propose a multi-branch feature fusion network (MBFFNet), which is an accurate real-time segmentation method for detecting colonoscopy. First, we use UNet as the basis of our model architecture and adopt stepwise sampling with channel multiplication to integrate features, which decreases the number of flops caused by stacking channels in UNet. Second, to improve model accuracy, we extract features from multiple layers and resize feature maps to the same size in different ways, such as up-sampling and pooling, to supplement information lost in multiplication-based up-sampling. Based on mIOU and Dice loss with cross entropy (CE), we conduct experiments in both CPU and GPU environments to verify the effectiveness of our model. The experimental results show that our proposed MBFFNet is superior to the selected baselines in terms of accuracy, model size, and flops. mIOU, F score, and Dice loss with CE reached 0.8952, 0.9450, and 0.1602, respectively, which were better than those of UNet, UNet++, and other networks. Compared with UNet, the flop count decreased by 73.2%, and the number of participants also decreased. The actual segmentation effect of MBFFNet is only lower than that of PraNet, the number of parameters is 78.27% of that of PraNet, and the flop count is 0.23% that of PraNet. In addition, experiments on other types of medical tasks show that MBFFNet has good potential for general application in medical image segmentation.

Download Full-text