Multi-scale Semantic Segmentation Enriched Features for Pedestrian Detection

Urban object segmentation and classification tasks are critical data processing steps in scene understanding, intelligent vehicles and 3D high-precision maps. Semantic segmentation of 3D point clouds is the foundational step in object recognition. To identify the intersecting objects and improve the accuracy of classification, this paper proposes a segment-based classification method for 3D point clouds. This method firstly divides points into multi-scale supervoxels and groups them by proposed inverse node graph (IN-Graph) construction, which does not need to define prior information about the node, it divides supervoxels by judging the connection state of edges between them. This method reaches minimum global energy by graph cutting, obtains the structural segments as completely as possible, and retains boundaries at the same time. Then, the random forest classifier is utilized for supervised classification. To deal with the mislabeling of scattered fragments, higher-order CRF with small-label cluster optimization is proposed to refine the classification results. Experiments were carried out on mobile laser scan (MLS) point dataset and terrestrial laser scan (TLS) points dataset, and the results show that overall accuracies of 97.57% and 96.39% were obtained in the two datasets. The boundaries of objects were retained well, and the method achieved a good result in the classification of cars and motorcycles. More experimental analyses have verified the advantages of the proposed method and proved the practicability and versatility of the method.

Download Full-text

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9411958 ◽

2021 ◽

Author(s):

Guo Cheng ◽

Jiang Yu Zheng

Keyword(s):

Pedestrian Detection ◽

Semantic Segmentation ◽

Temporal Domain

Download Full-text

MLFNet-Point Cloud Semantic Segmentation Convolution Network Based on Multi-scale Feature Fusion

IEEE Access ◽

10.1109/access.2021.3057612 ◽

2021 ◽

pp. 1-1

Author(s):

Jingfang Yang ◽

Bochang Zou ◽

Huadong Qiu ◽

Zhi Li

Keyword(s):

Point Cloud ◽

Feature Fusion ◽

Semantic Segmentation ◽

Scale Feature ◽

Multi Scale

Download Full-text

Real-time Semantic Segmentation Based on Multi-scale Feature Map Joint Pyramid Upsamping

10.1109/aeeca52519.2021.9574190 ◽

2021 ◽

Author(s):

Liang Chao ◽

Wang Xiaoyu ◽

Song Yu ◽

Jiang Changhong

Keyword(s):

Real Time ◽

Semantic Segmentation ◽

Feature Map ◽

Scale Feature ◽

Multi Scale

Download Full-text

Simultaneous Segmentation of Fetal Hearts and Lungs for Medical Ultrasound Images via an Efficient Multi-scale Model Integrated With Attention Mechanism

Ultrasonic Imaging ◽

10.1177/01617346211042526 ◽

2021 ◽

pp. 016173462110425

Author(s):

Jianing Xi ◽

Jiangang Chen ◽

Zhao Wang ◽

Dean Ta ◽

Bing Lu ◽

...

Keyword(s):

Congenital Anomaly ◽

Large Scale ◽

Automatic Segmentation ◽

Receptive Fields ◽

Semantic Segmentation ◽

Attention Mechanism ◽

Scale Model ◽

Ultrasound Images ◽

Multi Scale ◽

Task Irrelevant

Large scale early scanning of fetuses via ultrasound imaging is widely used to alleviate the morbidity or mortality caused by congenital anomalies in fetal hearts and lungs. To reduce the intensive cost during manual recognition of organ regions, many automatic segmentation methods have been proposed. However, the existing methods still encounter multi-scale problem at a larger range of receptive fields of organs in images, resolution problem of segmentation mask, and interference problem of task-irrelevant features, obscuring the attainment of accurate segmentations. To achieve semantic segmentation with functions of (1) extracting multi-scale features from images, (2) compensating information of high resolution, and (3) eliminating the task-irrelevant features, we propose a multi-scale model with skip connection framework and attention mechanism integrated. The multi-scale feature extraction modules are incorporated with additive attention gate units for irrelevant feature elimination, through a U-Net framework with skip connections for information compensation. The performance of fetal heart and lung segmentation indicates the superiority of our method over the existing deep learning based approaches. Our method also shows competitive performance stability during the task of semantic segmentations, showing a promising contribution on ultrasound based prognosis of congenital anomaly in the early intervention, and alleviating the negative effects caused by congenital anomaly.

Download Full-text

Multi-scale Context Intertwining for Semantic Segmentation

Computer Vision – ECCV 2018 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01219-9_37 ◽

2018 ◽

pp. 622-638 ◽

Cited By ~ 19

Author(s):

Di Lin ◽

Yuanfeng Ji ◽

Dani Lischinski ◽

Daniel Cohen-Or ◽

Hui Huang

Keyword(s):

Semantic Segmentation ◽

Multi Scale

Download Full-text

Multi-scale Spatial Location Preference for Semantic Segmentation

MultiMedia Modeling - Lecture Notes in Computer Science ◽

10.1007/978-3-030-37731-1_48 ◽

2019 ◽

pp. 593-604

Author(s):

Qiuyuan Han ◽

Jin Zheng

Keyword(s):

Semantic Segmentation ◽

Spatial Location ◽

Multi Scale

Download Full-text

Semantic Segmentation Network for Surface Defect Detection of Automobile Wheel Hub Fusing High-Resolution Feature and Multi-Scale Feature

Applied Sciences ◽

10.3390/app112210508 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10508

Author(s):

Chaowei Tang ◽

Xinxin Feng ◽

Haotian Wen ◽

Xu Zhou ◽

Yanqing Shao ◽

...

Keyword(s):

High Resolution ◽

Defect Detection ◽

Automobile Industry ◽

Surface Defect ◽

Semantic Segmentation ◽

The Body ◽

Multi Scale ◽

Surface Defect Detection ◽

Edge Features ◽

Automobile Wheel

Surface defect detection of an automobile wheel hub is important to the automobile industry because these defects directly affect the safety and appearance of automobiles. At present, surface defect detection networks based on convolutional neural network use many pooling layers when extracting features, reducing the spatial resolution of features and preventing the accurate detection of the boundary of defects. On the basis of DeepLab v3+, we propose a semantic segmentation network for the surface defect detection of an automobile wheel hub. To solve the gridding effect of atrous convolution, the high-resolution network (HRNet) is used as the backbone network to extract high-resolution features, and the multi-scale features extracted by the Atrous Spatial Pyramid Pooling (ASPP) of DeepLab v3+ are superimposed. On the basis of the optical flow, we decouple the body and edge features of the defects to accurately detect the boundary of defects. Furthermore, in the upsampling process, a decoder can accurately obtain detection results by fusing the body, edge, and multi-scale features. We use supervised training to optimize these features. Experimental results on four defect datasets (i.e., wheels, magnetic tiles, fabrics, and welds) show that the proposed network has better F1 score, average precision, and intersection over union than SegNet, Unet, and DeepLab v3+, proving that the proposed network is effective for different defect detection scenarios.

Download Full-text

Multi-scale Semantic Segmentation Enriched Features for Pedestrian Detection

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

An Inverse Node Graph-Based Method for the Urban Scene Segmentation of 3D Point Clouds

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

MLFNet-Point Cloud Semantic Segmentation Convolution Network Based on Multi-scale Feature Fusion

Real-time Semantic Segmentation Based on Multi-scale Feature Map Joint Pyramid Upsamping

Simultaneous Segmentation of Fetal Hearts and Lungs for Medical Ultrasound Images via an Efficient Multi-scale Model Integrated With Attention Mechanism

Multi-scale Context Intertwining for Semantic Segmentation

Multi-scale Spatial Location Preference for Semantic Segmentation

Semantic Segmentation Network for Surface Defect Detection of Automobile Wheel Hub Fusing High-Resolution Feature and Multi-Scale Feature

Export Citation Format