scholarly journals Full-resolution encoder-decoder networks with multi-scale feature fusion for human pose estimation

Author(s):  
Jie Ou ◽  
Mingjian Chen ◽  
Hong Wu
2020 ◽  
Vol 31 (7-8) ◽  
Author(s):  
Rui Wang ◽  
Jiangwei Tong ◽  
Xiangyang Wang

IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 71158-71166 ◽  
Author(s):  
Rui Wang ◽  
Zhongzheng Cao ◽  
Xiangyang Wang ◽  
Zhi Liu ◽  
Xiaoqiang Zhu

2019 ◽  
Vol 16 (04) ◽  
pp. 1941003
Author(s):  
Chunsheng Guo ◽  
Jialuo Zhou ◽  
Wenlong Du ◽  
Xuguang Zhang

Human pose estimation is a fundamental but challenging task in computer vision. The estimation of human pose mainly depends on the global information of the keypoint type and the local information of the keypoint location. However, the consistency of the cascading process makes it difficult for each stacking network to form a differentiation and collaboration mechanism. In order to solve these problems, this paper introduces a new human pose estimation framework called Multi-Scale Collaborative (MSC) network. The pre-processing network forms feature maps of different sizes, and dispatches them to various locations of the stack network, with small-scale features reaching the front-end stacking network and large-scale features reaching the back-end stacking network. A new loss function is proposed for MSC network. Different keypoints have different weight coefficients of loss function at different scales, and the keypoint weight coefficients are dynamically adjusted from the top hourglass network to the bottom hourglass network. Experimental results show that the proposed method is competitive in MPII and LSP challenge leaderboard among the state-of-the-art methods.


Sign in / Sign up

Export Citation Format

Share Document