Human head detection based on multi-stage CNN with voting strategy

Author(s):  
Ting Rui ◽  
Jianchao Fei ◽  
You Zhou ◽  
Jian Tang ◽  
Husheng Fang
Electronics ◽  
2021 ◽  
Vol 10 (13) ◽  
pp. 1565
Author(s):  
Junwen Liu ◽  
Yongjun Zhang ◽  
Jianbin Xie ◽  
Yan Wei ◽  
Zewei Wang ◽  
...  

Pedestrian detection for complex scenes suffers from pedestrian occlusion issues, such as occlusions between pedestrians. As well-known, compared with the variability of the human body, the shape of a human head and their shoulders changes minimally and has high stability. Therefore, head detection is an important research area in the field of pedestrian detection. The translational invariance of neural network enables us to design a deep convolutional neural network, which means that, even if the appearance and location of the target changes, it can still be recognized effectively. However, the problems of scale invariance and high miss detection rates for small targets still exist. In this paper, a feature extraction network DR-Net based on Darknet-53 is proposed to improve the information transmission rate between convolutional layers and to extract more semantic information. In addition, the MDC (mixed dilated convolution) with different sampling rates of dilated convolution is embedded to improve the detection rate of small targets. We evaluated our method on three publicly available datasets and achieved excellent results. The AP (Average Precision) value on the Brainwash dataset, HollywoodHeads dataset, and SCUT-HEAD dataset reached 92.1%, 84.8%, and 90% respectively.


Author(s):  
Jian-Chao Fei ◽  
Jun-Hua Zou ◽  
Ting Rui ◽  
You Zhou ◽  
Qi-Yu Gao
Keyword(s):  

2013 ◽  
Vol 427-429 ◽  
pp. 1696-1699
Author(s):  
Xiang Yang Liu ◽  
Shao Song Zhu ◽  
Su Qing Wu ◽  
Zhi Wei Shen

For human head pose analysis based on videos, pose is usually estimated on the head patch provided by a tracking module. However, head tracking is very sensitive to the large changes of pose. Therefore, this work locates the head patch in the videos by head detection. Firstly, we use the Adaboost algorithm to detect the human head in the video. Secondly, we present a dimensionality reduction method to process the head patch. Finally, we use the nearest neighbor method to estimate the head pose. The experiment results show: accurate head detecting helps to estimate the head pose. This method can be used for complex conditions of accurate head pose estimation.


Sensors ◽  
2021 ◽  
Vol 21 (17) ◽  
pp. 5848
Author(s):  
Mohamed Chouai ◽  
Petr Dolezel ◽  
Dominik Stursa ◽  
Zdenek Nemec

In the field of computer vision, object detection consists of automatically finding objects in images by giving their positions. The most common fields of application are safety systems (pedestrian detection, identification of behavior) and control systems. Another important application is head/person detection, which is the primary material for road safety, rescue, surveillance, etc. In this study, we developed a new approach based on two parallel Deeplapv3+ to improve the performance of the person detection system. For the implementation of our semantic segmentation model, a working methodology with two types of ground truths extracted from the bounding boxes given by the original ground truths was established. The approach has been implemented in our two private datasets as well as in a public dataset. To show the performance of the proposed system, a comparative analysis was carried out on two deep learning semantic segmentation state-of-art models: SegNet and U-Net. By achieving 99.14% of global accuracy, the result demonstrated that the developed strategy could be an efficient way to build a deep neural network model for semantic segmentation. This strategy can be used, not only for the detection of the human head but also be applied in several semantic segmentation applications.


Sign in / Sign up

Export Citation Format

Share Document