Deep convolution network for dense crowd counting

2020 ◽  
Vol 14 (4) ◽  
pp. 621-627
Author(s):  
Wei Zhang ◽  
Yongjie Wang ◽  
Yanyan Liu ◽  
Jianghua Zhu
Author(s):  
Victor Hugo Roldao Reis ◽  
Silvio Jamil F. Guimaraes ◽  
Zenilton Kleber Goncalves do Patrocinio
Keyword(s):  

2020 ◽  
Vol 79 (25-26) ◽  
pp. 17837-17858 ◽  
Author(s):  
Anurag Pandey ◽  
Mayank Pandey ◽  
Navjot Singh ◽  
Abha Trivedi
Keyword(s):  

IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 136032-136044
Author(s):  
Liangjun Huang ◽  
Luning Zhu ◽  
Shihui Shen ◽  
Qing Zhang ◽  
Jianwei Zhang

Optik ◽  
2015 ◽  
Vol 126 (1) ◽  
pp. 123-130 ◽  
Author(s):  
Xuemin Hu ◽  
Hong Zheng ◽  
Yuzhang Chen ◽  
Long Chen

Author(s):  
Yaocong Hu ◽  
Huan Chang ◽  
Fudong Nian ◽  
Yan Wang ◽  
Teng Li

Author(s):  
Greg Olmschenk ◽  
Xuan Wang ◽  
Hao Tang ◽  
Zhigang Zhu

Gatherings of thousands to millions of people frequently occur for an enormous variety of educational, social, sporting, and political events, and automated counting of these high-density crowds is useful for safety, management, and measuring significance of an event. In this work, we show that the regularly accepted labeling scheme of crowd density maps for training deep neural networks may not be the most effective one. We propose an alternative inverse k-nearest neighbor (i[Formula: see text]NN) map mechanism that, even when used directly in existing state-of-the-art network structures, shows superior performance. We also provide new network architecture mechanisms that we demonstrate in our own MUD-i[Formula: see text]NN network architecture, which uses multi-scale drop-in replacement upsampling via transposed convolutions to take full advantage of the provided i[Formula: see text]NN labeling. This upsampling combined with the i[Formula: see text]NN maps further improves crowd counting accuracy. We further analyze several variations of the i[Formula: see text]NN labeling mechanism, which apply transformations on the [Formula: see text]NN measure before generating the map, in order to consider the impact of camera perspective views, image resolutions, and the changing rates of the mapping functions. To alleviate the effects of crowd density changes in each image, we also introduce an attenuation mechanism in the i[Formula: see text]NN mapping. Experimentally, we show that inverse square root [Formula: see text]NN map variation (iR[Formula: see text]NN) provides the best performance. Discussions are provided on computational complexity, label resolutions, the gains in mapping and upsampling, and details of critical cases such as various crowd counts, uneven crowd densities, and crowd occlusions.


Sign in / Sign up

Export Citation Format

Share Document