3D Interacting Hand Pose and Shape Estimation from a Single RGB Image

2021 ◽  
Author(s):  
Chengying Gao ◽  
Yujia Yang ◽  
Wensheng Li
Keyword(s):  
2019 ◽  
Vol 4 (4) ◽  
pp. 4239-4246
Author(s):  
Yafei Gao ◽  
Yida Wang ◽  
Pietro Falco ◽  
Nassir Navab ◽  
Federico Tombari
Keyword(s):  

2020 ◽  
Vol 10 (2) ◽  
pp. 618
Author(s):  
Xianghan Wang ◽  
Jie Jiang ◽  
Yanming Guo ◽  
Lai Kang ◽  
Yingmei Wei ◽  
...  

Precise 3D hand pose estimation can be used to improve the performance of human–computer interaction (HCI). Specifically, computer-vision-based hand pose estimation can make this process more natural. Most traditional computer-vision-based hand pose estimation methods use depth images as the input, which requires complicated and expensive acquisition equipment. Estimation through a single RGB image is more convenient and less expensive. Previous methods based on RGB images utilize only 2D keypoint score maps to recover 3D hand poses but ignore the hand texture features and the underlying spatial information in the RGB image, which leads to a relatively low accuracy. To address this issue, we propose a channel fusion attention mechanism that combines 2D keypoint features and RGB image features at the channel level. In particular, the proposed method replans weights by using cascading RGB images and 2D keypoint features, which enables rational planning and the utilization of various features. Moreover, our method improves the fusion performance of different types of feature maps. Multiple contrast experiments on public datasets demonstrate that the accuracy of our proposed method is comparable to the state-of-the-art accuracy.


2020 ◽  
Vol 1631 ◽  
pp. 012014
Author(s):  
Qi Wu ◽  
Joya Chen ◽  
Zhiming Yao ◽  
Xu Zhou ◽  
Jianguo Wang ◽  
...  

Author(s):  
John Yang ◽  
Hyung Jin Chang ◽  
Seungeui Lee ◽  
Nojun Kwak
Keyword(s):  

2020 ◽  
Vol 2020 ◽  
pp. 1-13
Author(s):  
Shiming Dai ◽  
Wei Liu ◽  
Wenji Yang ◽  
Lili Fan ◽  
Jihao Zhang

3D hand pose estimation can provide basic information about gestures, which has an important significance in the fields of Human-Machine Interaction (HMI) and Virtual Reality (VR). In recent years, 3D hand pose estimation from a single depth image has made great research achievements due to the development of depth cameras. However, 3D hand pose estimation from a single RGB image is still a highly challenging problem. In this work, we propose a novel four-stage cascaded hierarchical CNN (4CHNet), which leverages hierarchical network to decompose hand pose estimation into finger pose estimation and palm pose estimation, extracts separately finger features and palm features, and finally fuses them to estimate 3D hand pose. Compared with direct estimation methods, the hand feature information extracted by the hierarchical network is more representative. Furthermore, concatenating various stages of the network for end-to-end training can make each stage mutually beneficial and progress. The experimental results on two public datasets demonstrate that our 4CHNet can significantly improve the accuracy of 3D hand pose estimation from a single RGB image.


Sign in / Sign up

Export Citation Format

Share Document