3D Human Skeleton Keypoint Detection Using RGB and Depth Image

Jeongseok Jeong; Byeongjun Park; Kyoungro Yoon

doi:10.5370/kiee.2021.70.9.1354

A 2.5D Thinning Algorithm for Human Skeleton Extraction from a Single Depth Image

2019 Chinese Automation Congress (CAC) ◽

10.1109/cac48633.2019.8996274 ◽

2019 ◽

Author(s):

Yang Zhao ◽

Jing He ◽

Hong Cheng ◽

Zicheng Liu

Keyword(s):

Depth Image ◽

Skeleton Extraction ◽

Human Skeleton ◽

Thinning Algorithm

Get full-text (via PubEx)

Using depth image processing and human skeleton identification methods to reduce uncomfortable light from a digital projector

2012 IEEE 16th International Symposium on Consumer Electronics ◽

10.1109/isce.2012.6241748 ◽

2012 ◽

Cited By ~ 3

Author(s):

Ying-Wen Bai ◽

Ta-Wei Shen ◽

Cheng-Hung Tsai

Keyword(s):

Image Processing ◽

Depth Image ◽

Human Skeleton ◽

Identification Methods

Get full-text (via PubEx)

Formation geological depth image according to refraction and reflection marine seismic data

Geofizicheskiy Zhurnal ◽

10.24028/gzh.0203-3100.v39i6.2017.116375 ◽

2017 ◽

Vol 39 (6) ◽

pp. 106-121

Author(s):

A. O. Verpahovskaya ◽

V. N. Pilipenko ◽

Е. V. Pylypenko

Keyword(s):

Seismic Data ◽

Depth Image ◽

Marine Seismic

Get full-text (via PubEx)

Consonant Classification in Mandarin Based on the Depth Image Feature: A Pilot Study

10.21437/interspeech.2019-1893 ◽

2019 ◽

Author(s):

Han-Chi Hsieh ◽

Wei-Zhong Zheng ◽

Ko-Chiang Chen ◽

Ying-Hui Lai

Keyword(s):

Pilot Study ◽

Image Feature ◽

Depth Image

Get full-text (via PubEx)

Age markers in the human skeleton. Edited by Mehmet Yasar İşcan. Springfield, IL: Charles C Thomas. 1989. xii + 359 pp., figures, tables, index. $64.75 (cloth)

American Journal of Physical Anthropology ◽

10.1002/ajpa.1330830411 ◽

1990 ◽

Vol 83 (4) ◽

pp. 501-502

Author(s):

Michael R. Zimmerman

Keyword(s):

Human Skeleton

Get full-text (via PubEx)

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

ACM Transactions on Graphics ◽

10.1145/3451341 ◽

2021 ◽

Vol 40 (3) ◽

pp. 1-12

Author(s):

Hao Zhang ◽

Yuxiao Zhou ◽

Yifei Tian ◽

Jun-Hai Yong ◽

Feng Xu

Keyword(s):

Real Time ◽

Synthetic Data ◽

Real Data ◽

Depth Image ◽

Real Time System ◽

The Real ◽

Time Performance ◽

Contact Constraint ◽

Object Shapes ◽

Object Interactions

Reconstructing hand-object interactions is a challenging task due to strong occlusions and complex motions. This article proposes a real-time system that uses a single depth stream to simultaneously reconstruct hand poses, object shape, and rigid/non-rigid motions. To achieve this, we first train a joint learning network to segment the hand and object in a depth image, and to predict the 3D keypoints of the hand. With most layers shared by the two tasks, computation cost is saved for the real-time performance. A hybrid dataset is constructed here to train the network with real data (to learn real-world distributions) and synthetic data (to cover variations of objects, motions, and viewpoints). Next, the depth of the two targets and the keypoints are used in a uniform optimization to reconstruct the interacting motions. Benefitting from a novel tangential contact constraint, the system not only solves the remaining ambiguities but also keeps the real-time performance. Experiments show that our system handles different hand and object shapes, various interactive motions, and moving cameras.

Get full-text (via PubEx)

Deep Learning for Transient Image Reconstruction from ToF Data

Sensors ◽

10.3390/s21061962 ◽

2021 ◽

Vol 21 (6) ◽

pp. 1962

Author(s):

Enrico Buratto ◽

Adriano Simonetto ◽

Gianluca Agresti ◽

Henrik Schäfer ◽

Pietro Zanuttigh

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Light Response ◽

Real Data ◽

Depth Image ◽

Learning Approach ◽

Multiple Reflections ◽

Noisy Input ◽

Novel Approach ◽

Incoming Light

In this work, we propose a novel approach for correcting multi-path interference (MPI) in Time-of-Flight (ToF) cameras by estimating the direct and global components of the incoming light. MPI is an error source linked to the multiple reflections of light inside a scene; each sensor pixel receives information coming from different light paths which generally leads to an overestimation of the depth. We introduce a novel deep learning approach, which estimates the structure of the time-dependent scene impulse response and from it recovers a depth image with a reduced amount of MPI. The model consists of two main blocks: a predictive model that learns a compact encoded representation of the backscattering vector from the noisy input data and a fixed backscattering model which translates the encoded representation into the high dimensional light response. Experimental results on real data show the effectiveness of the proposed approach, which reaches state-of-the-art performances.

Get full-text (via PubEx)

Human Skeleton Graph Attention Convolutional for Video Action Recognition

2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT) ◽

10.1109/isctt51595.2020.00040 ◽

2020 ◽

Author(s):

Deyuan Zhang ◽

Hongwei Gao ◽

Hailong Dai ◽

Xiangbin Shi

Keyword(s):

Action Recognition ◽

Human Skeleton ◽

Skeleton Graph

Get full-text (via PubEx)

Autonomous Identification and Positioning of Trucks during Collaborative Forage Harvesting

Sensors ◽

10.3390/s21041166 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1166

Author(s):

Wei Zhang ◽

Liang Gong ◽

Suyue Chen ◽

Wenjie Wang ◽

Zhonghua Miao ◽

...

Keyword(s):

Field Experiments ◽

Absolute Error ◽

Identification Accuracy ◽

Depth Image ◽

Dynamic Identification ◽

Random Sample Consensus ◽

Harvesting Efficiency ◽

Value Decomposition ◽

Forage Harvester ◽

Ransac Algorithm

In the process of collaborative operation, the unloading automation of the forage harvester is of great significance to improve harvesting efficiency and reduce labor intensity. However, non-standard transport trucks and unstructured field environments make it extremely difficult to identify and properly position loading containers. In this paper, a global model with three coordinate systems is established to describe a collaborative harvesting system. Then, a method based on depth perception is proposed to dynamically identify and position the truck container, including data preprocessing, point cloud pose transformation based on the singular value decomposition (SVD) algorithm, segmentation and projection of the upper edge, edge lines extraction and corner points positioning based on the Random Sample Consensus (RANSAC) algorithm, and fusion and visualization of results on the depth image. Finally, the effectiveness of the proposed method has been verified by field experiments with different trucks. The results demonstrated that the identification accuracy of the container region is about 90%, and the absolute error of center point positioning is less than 100 mm. The proposed method is robust to containers with different appearances and provided a methodological reference for dynamic identification and positioning of containers in forage harvesting.

Get full-text (via PubEx)

Iranian kinect face database (IKFDB): a color-depth based face database collected by kinect v.2 sensor

SN Applied Sciences ◽

10.1007/s42452-020-03999-y ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Seyed Muhammad Hossein Mousavi ◽

S. Younes Mirinezhad

Keyword(s):

Neural Network ◽

Facial Expression ◽

Facial Expression Recognition ◽

Depth Image ◽

Sensor Technology ◽

Support Vector ◽

Expression Recognition ◽

Face Database ◽

Depth Data ◽

Color Depth

AbstractThis study presents a new color-depth based face database gathered from different genders and age ranges from Iranian subjects. Using suitable databases, it is possible to validate and assess available methods in different research fields. This database has application in different fields such as face recognition, age estimation and Facial Expression Recognition and Facial Micro Expressions Recognition. Image databases based on their size and resolution are mostly large. Color images usually consist of three channels namely Red, Green and Blue. But in the last decade, another aspect of image type has emerged, named “depth image”. Depth images are used in calculating range and distance between objects and the sensor. Depending on the depth sensor technology, it is possible to acquire range data differently. Kinect sensor version 2 is capable of acquiring color and depth data simultaneously. Facial expression recognition is an important field in image processing, which has multiple uses from animation to psychology. Currently, there is a few numbers of color-depth (RGB-D) facial micro expressions recognition databases existing. With adding depth data to color data, the accuracy of final recognition will be increased. Due to the shortage of color-depth based facial expression databases and some weakness in available ones, a new and almost perfect RGB-D face database is presented in this paper, covering Middle-Eastern face type. In the validation section, the database will be compared with some famous benchmark face databases. For evaluation, Histogram Oriented Gradients features are extracted, and classification algorithms such as Support Vector Machine, Multi-Layer Neural Network and a deep learning method, called Convolutional Neural Network or are employed. The results are so promising.

Get full-text (via PubEx)