Deep Reinforcement Learning for Active Human Pose Estimation

Erik Gärtner; Aleksis Pirinen; Cristian Sminchisescu

doi:10.1609/aaai.v34i07.6714

Deep Reinforcement Learning for Active Human Pose Estimation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6714 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10835-10844

Author(s):

Erik Gärtner ◽

Aleksis Pirinen ◽

Cristian Sminchisescu

Keyword(s):

Reinforcement Learning ◽

Pose Estimation ◽

Estimation Methods ◽

Estimation Accuracy ◽

Human Pose Estimation ◽

Transition Functions ◽

Processing Step ◽

Complex Scenes ◽

Human Pose ◽

3D Human Pose Estimation

Most 3d human pose estimation methods assume that input – be it images of a scene collected from one or several viewpoints, or from a video – is given. Consequently, they focus on estimates leveraging prior knowledge and measurement by fusing information spatially and/or temporally, whenever available. In this paper we address the problem of an active observer with freedom to move and explore the scene spatially – in ‘time-freeze’ mode – and/or temporally, by selecting informative viewpoints that improve its estimation accuracy. Towards this end, we introduce Pose-DRL, a fully trainable deep reinforcement learning-based active pose estimation architecture which learns to select appropriate views, in space and time, to feed an underlying monocular pose estimator. We evaluate our model using single- and multi-target estimators with strong result in both settings. Our system further learns automatic stopping conditions in time and transition functions to the next temporal processing step in videos. In extensive experiments with the Panoptic multi-view setup, and for complex scenes containing multiple people, we show that our model learns to select viewpoints that yield significantly more accurate pose estimates compared to strong multi-view baselines.

Download Full-text

Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey

Electronics ◽

10.3390/electronics10182267 ◽

2021 ◽

Vol 10 (18) ◽

pp. 2267

Author(s):

Dejun Zhang ◽

Yiqi Wu ◽

Mingyue Guo ◽

Yilin Chen

Keyword(s):

Deep Learning ◽

Pose Estimation ◽

Literature Survey ◽

Estimation Methods ◽

Human Pose Estimation ◽

Extensive Literature ◽

Learning Methods ◽

Human Pose ◽

3D Human Pose Estimation ◽

Research Studies

The rise of deep learning technology has broadly promoted the practical application of artificial intelligence in production and daily life. In computer vision, many human-centered applications, such as video surveillance, human-computer interaction, digital entertainment, etc., rely heavily on accurate and efficient human pose estimation techniques. Inspired by the remarkable achievements in learning-based 2D human pose estimation, numerous research studies are devoted to the topic of 3D human pose estimation via deep learning methods. Against this backdrop, this paper provides an extensive literature survey of recent literature about deep learning methods for 3D human pose estimation to display the development process of these research studies, track the latest research trends, and analyze the characteristics of devised types of methods. The literature is reviewed, along with the general pipeline of 3D human pose estimation, which consists of human body modeling, learning-based pose estimation, and regularization for refinement. Different from existing reviews of the same topic, this paper focus on deep learning-based methods. The learning-based pose estimation is discussed from two categories: single-person and multi-person. Each one is further categorized by data type to the image-based methods and the video-based methods. Moreover, due to the significance of data for learning-based methods, this paper surveys the 3D human pose estimation methods according to the taxonomy of supervision form. At last, this paper also enlists the current and widely used datasets and compares performances of reviewed methods. Based on this literature survey, it can be concluded that each branch of 3D human pose estimation starts with fully-supervised methods, and there is still much room for multi-person pose estimation based on other supervision methods from both image and video. Besides the significant development of 3D human pose estimation via deep learning, the inherent ambiguity and occlusion problems remain challenging issues that need to be better addressed.

Download Full-text

3D Human Pose Estimation Based on a Fully Connected Neural Network With Adversarial Learning Prior Knowledge

Frontiers in Physics ◽

10.3389/fphy.2021.629288 ◽

2021 ◽

Vol 9 ◽

Author(s):

Lu Meng ◽

Hengshang Gao

Keyword(s):

Prior Knowledge ◽

Pose Estimation ◽

Optical Sensors ◽

Estimation Methods ◽

Generative Adversarial Networks ◽

Human Pose Estimation ◽

Natural Connection ◽

Human Pose ◽

Fully Connected ◽

3D Human Pose Estimation

3D human pose estimation is more and more widely used in the real world, such as sports guidance, limb rehabilitation training, augmented reality, and intelligent security. Most existing human pose estimation methods are designed based on an RGB image obtained by one optical sensor, such as a digital camera. There is some prior knowledge, such as bone proportion and angle limitation of joint hinge motion. However, the existing methods do not consider the correlation between different joints from multi-view images, and most of them adopt fixed spatial prior constraints, resulting in poor generalizations. Therefore, it is essential to build a multi-view image acquisition system using optical sensors and customized algorithms for a 3D reconstruction of the human pose in the image. Inspired by generative adversarial networks (GAN), we used a data-driven method to learn the implicit spatial prior information and classified joints according to the natural connection characteristics. To accelerate the proposed method, we proposed a fully connected network with skip connections and used the SMPL model to make the 3D human body reconstruction. Experimental results showed that compared with other state-of-the-art methods, the joints’ average error of the proposed method was the smallest, which indicated the best performance. Moreover, the running time of the proposed method was 1.3 seconds per frame, which may not meet real-time requirements, but is still much faster than most existing methods.

Download Full-text

Context-Aware Network for 3D Human Pose Estimation from Monocular RGB Image

2019 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2019.8852263 ◽

2019 ◽

Author(s):

Binyi Yin ◽

Dongbo Zhang ◽

Shuai Li ◽

Aimin Hao ◽

Hong Qin

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Context Aware ◽

Human Pose ◽

Rgb Image ◽

3D Human Pose Estimation

Download Full-text

Deep 3D human pose estimation: A review

Computer Vision and Image Understanding ◽

10.1016/j.cviu.2021.103225 ◽

2021 ◽

pp. 103225

Author(s):

Jinbao Wang ◽

Shujie Tan ◽

Xiantong Zhen ◽

Shuo Xu ◽

Feng Zheng ◽

...

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text

Temporally Consistent 3D Human Pose Estimation Using Dual 360° Cameras

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv48630.2021.00013 ◽

2021 ◽

Author(s):

Matthew Shere ◽

Hansung Kim ◽

Adrian Hilton

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text

Multi-scale Recalibration with Advanced Geometry Constraints for 3D Human Pose Estimation

2020 IEEE 6th International Conference on Computer and Communications (ICCC) ◽

10.1109/iccc51575.2020.9345270 ◽

2020 ◽

Author(s):

Meng Xiao ◽

Hailun Xia ◽

Ziwei Xie ◽

Chunyan Feng

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Multi Scale ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text

Demo Abstract: Vision-aided 3D Human Pose Estimation with RFID

2020 16th International Conference on Mobility, Sensing and Networking (MSN) ◽

10.1109/msn50589.2020.00104 ◽

2020 ◽

Author(s):

Chao Yang ◽

Xuyu Wang ◽

Shiwen Mao

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text

A survey on monocular 3D human pose estimation

Virtual Reality & Intelligent Hardware ◽

10.1016/j.vrih.2020.04.005 ◽

2020 ◽

Vol 2 (6) ◽

pp. 471-500

Author(s):

Xiaopeng Ji ◽

Qi Fang ◽

Junting Dong ◽

Qing Shuai ◽

Wen Jiang ◽

...

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text

Automatic Calibration of the Fisheye Camera for Egocentric 3D Human Pose Estimation from a Single Image

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv48630.2021.00181 ◽

2021 ◽

Author(s):

Yahui Zhang ◽

Shaodi You ◽

Theo Gevers

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Automatic Calibration ◽

Single Image ◽

Fisheye Camera ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text

Rotational Adjoint Methods for Learning-Free 3D Human Pose Estimation from IMU Data

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9413050 ◽

2021 ◽

Author(s):

Caterina Buizza ◽

Yiannis Demiris

Keyword(s):

Pose Estimation ◽

Human Pose Estimation ◽

Adjoint Methods ◽

Human Pose ◽

3D Human Pose Estimation

Download Full-text