CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image

Author(s):  
Piotr Szczuko
2018 ◽  
Vol 12 (6) ◽  
pp. 919-924 ◽  
Author(s):  
Qingqiang Wu ◽  
Guanghua Xu ◽  
Min Li ◽  
Longting Chen ◽  
Xin Zhang ◽  
...  

2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Haiquan Wang ◽  
Xiangyang Wang ◽  
Yijie Shi ◽  
Yanping Li ◽  
Chunhua Qian ◽  
...  

Currently, human pose estimation (HPE) methods mainly rely on the design framework of Convolutional Neural Networks (CNNs). These CNNs typically consist of high-to-low-resolution subnetworks (encoder) to learn semantic information and low-to-high subnetworks (decoder) to raise the resolution for keypoint localization. Because too low-resolution feature maps in encoder will inevitably lose some spatial information, which cannot be recovered in the upsampling stages, keeping high spatial resolution features is critical for human pose estimation. On the other hand, due to scale variation of human body parts, multiscale features are also very important for human pose estimation. In this paper, a novel backbone network is proposed specifically for HPE, named High Spatial Resolution and Multiscale Networks (HSR-MSNet), which maintain high spatial resolution features in deeper layers of the encoder and meanwhile construct multiscale features within one single residual block via subgroup splitting and fusion of feature maps. Experiments show that our approach outperforms other state-of-the-art methods with more accurate keypoint locations on COCO dataset.


2011 ◽  
Vol 33 (6) ◽  
pp. 1413-1419
Author(s):  
Yan-chao Su ◽  
Hai-zhou Ai ◽  
Shi-hong Lao

Sign in / Sign up

Export Citation Format

Share Document