Switchable Deep Network for Pedestrian Detection

Pedestrian detection is the core of the driver assistance system, which collects the road conditions through the radars or cameras on the vehicle, judges whether there is a pedestrian in front of the vehicle, supports decisions such as raising the alarm, automatically slowing down, or emergency stopping to keep pedestrians safe, and improves the security when the vehicle is moving. Suffering from weather, lighting, clothing, large pose variations, and occlusion, the current pedestrian detection still has a certain distance from the practical applications. In recent years, deep networks have shown excellent performance for image detection, recognition, and classification. Some researchers employed deep network for pedestrian detection and achieve great progress, but deep networks need huge computational resources, which make it difficult to put into practical applications. In real scenarios of autonomous vehicles, the computation ability is limited. Thus, the shallow networks such as UDN (Unified Deep Networks) is a better choice, since it performs well while consuming less computation resources. Based on UDN, this paper proposes a new deep network model named two-stream UDN, which augments another branch for solving traditional UDN’s indistinction of the difference between trees/telegraph poles and pedestrians. The new branch accepts the upper third part of the pedestrian image as input, and the partial image has less deformation, stable features, and more distinguished characters from other objects. For the proposed two-stream UDN, multi-input features including the HOG (Histogram of Oriented Gradients) feature, Sobel feature, color feature, and foreground regions extracted by GrabCut segmentation algorithms are fed. Compared with the original input of UDN, the multi-input features are more conducive for pedestrian detection, since the fused HOG features and significant objects are more significant for pedestrian detection. Two-stream UDN is trained through two steps. First, the two sub-networks are trained until converge; then, we fuse results of the two subnets as the final result and feed it back to the two subnets to fine tune network parameters synchronously. To improve the performance, Swish is adopted as the activation function to obtain a faster training speed, and positive samples are mirrored and rotated with small angles to make the positive and negative samples more balanced.

Download Full-text

Part-based deep network for pedestrian detection in surveillance videos

2015 Visual Communications and Image Processing (VCIP) ◽

10.1109/vcip.2015.7457855 ◽

2015 ◽

Cited By ~ 1

Author(s):

Qi Chen ◽

Wenhui Jiang ◽

Yanyun Zhao ◽

Zhicheng Zhao

Keyword(s):

Pedestrian Detection ◽

Surveillance Videos ◽

Deep Network

Download Full-text

PedJointNet: Joint Head-Shoulder and Full Body Deep Network for Pedestrian Detection

IEEE Access ◽

10.1109/access.2019.2910201 ◽

2019 ◽

Vol 7 ◽

pp. 47687-47697

Author(s):

Chih-Yang Lin ◽

Hong-Xia Xie ◽

Hua Zheng

Keyword(s):

Pedestrian Detection ◽

Deep Network ◽

Full Body

Download Full-text

Deep network aided by guiding network for pedestrian detection

Pattern Recognition Letters ◽

10.1016/j.patrec.2017.02.018 ◽

2017 ◽

Vol 90 ◽

pp. 43-49 ◽

Cited By ~ 8

Author(s):

Sang-Il Jung ◽

Ki-Sang Hong

Keyword(s):

Pedestrian Detection ◽

Deep Network

Download Full-text

Pedestrian Detection Based on Two-Stream UDN

10.20944/preprints202001.0029.v1 ◽

2020 ◽

Author(s):

Wentong Wang ◽

Lichun Wang ◽

Xufei Ge ◽

Jinghua Li ◽

Baocai Yin

Keyword(s):

Autonomous Vehicle ◽

Pedestrian Detection ◽

Activation Function ◽

Practical Applications ◽

Deep Network ◽

The Road ◽

Slowing Down ◽

Deep Networks ◽

The Difference ◽

Computational Resources

Pedestrian detection is the core of driver assistance system, which collects the road conditions through the radars or cameras on the vehicle, judges whether there is a pedestrian in front of the vehicle, supports decisions such as raising the alarm, automatically slowing down or emergency stopping to keep pedestrians safe, and improves the security when the vehicle is moving. Suffered from weather, lighting, clothing, large pose variations and occlusion, the current pedestrian detection still has a certain distance from the practical applications. In recent years, deep networks have shown excellent performance for image detection, recognition and classification. Some researchers employed deep network for pedestrian detection and achieve great progress, but deep networks need huge computational resources which make it difficult to put into practical applications. In real scenarios of autonomous vehicle, the computation ability is limited. Thus, the shallow networks such as UDN (Unified Deep Networks) is a better choice since it performs well on consuming less computation resources. Base on UDN, this paper proposes a new deep network model named as two-stream UDN, which augments another branch for solving traditional UDN’s indistinction of the difference between trees / telegraph poles and pedestrians. The new branch accepts the upper third part of the pedestrian image as input, and the partial image has less deformation, stable features and more distinguished characters from other objects. For the proposed two-stream UDN, multi-input features including HOG feature, Sobel feature, color feature and foreground regions extracted by GrabCut segmentation algorithms are fed. Compared with the original input of UDN, the multi-input features are more conducive for pedestrian detection since the fused HOG features and significant objects are more significant for pedestrian detection. Two-stream UDN is trained through two steps: First, the two sub-networks are trained until converge; then we fuse results of the two subnets as the final result and feed it back to the two subnets to fine tune network parameters synchronously. To improve the performance, Softplus is adopted as activation function to obtain faster training speed, and positive samples are mirrored and rotated with small angle to make positive and negative samples more balanced.

Download Full-text