Scene Reconstruction from a Single Depth Image Using 3D CNN

Reconstructing hand-object interactions is a challenging task due to strong occlusions and complex motions. This article proposes a real-time system that uses a single depth stream to simultaneously reconstruct hand poses, object shape, and rigid/non-rigid motions. To achieve this, we first train a joint learning network to segment the hand and object in a depth image, and to predict the 3D keypoints of the hand. With most layers shared by the two tasks, computation cost is saved for the real-time performance. A hybrid dataset is constructed here to train the network with real data (to learn real-world distributions) and synthetic data (to cover variations of objects, motions, and viewpoints). Next, the depth of the two targets and the keypoints are used in a uniform optimization to reconstruct the interacting motions. Benefitting from a novel tangential contact constraint, the system not only solves the remaining ambiguities but also keeps the real-time performance. Experiments show that our system handles different hand and object shapes, various interactive motions, and moving cameras.

Download Full-text

Seeing under the cover with a 3D U-Net: point cloud-based weight estimation of covered patients

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-021-02476-0 ◽

2021 ◽

Author(s):

Alexander Bigalke ◽

Lasse Hansen ◽

Jasper Diesel ◽

Mattias P. Heinrich

Keyword(s):

Point Cloud ◽

Drug Dosage ◽

Sensor Data ◽

Patient Specific ◽

Point Cloud Data ◽

Weight Estimation ◽

Visual Sensor ◽

Cloud Data ◽

Emergency Situations ◽

3D Cnn

Abstract Purpose Body weight is a crucial parameter for patient-specific treatments, particularly in the context of proper drug dosage. Contactless weight estimation from visual sensor data constitutes a promising approach to overcome challenges arising in emergency situations. Machine learning-based methods have recently been shown to perform accurate weight estimation from point cloud data. The proposed methods, however, are designed for controlled conditions in terms of visibility and position of the patient, which limits their practical applicability. In this work, we aim to decouple accurate weight estimation from such specific conditions by predicting the weight of covered patients from voxelized point cloud data. Methods We propose a novel deep learning framework, which comprises two 3D CNN modules solving the given task in two separate steps. First, we train a 3D U-Net to virtually uncover the patient, i.e. to predict the patient’s volumetric surface without a cover. Second, the patient’s weight is predicted from this 3D volume by means of a 3D CNN architecture, which we optimized for weight regression. Results We evaluate our approach on a lying pose dataset (SLP) under two different cover conditions. The proposed framework considerably improves on the baseline model by up to $${16}{\%}$$ 16 % and reduces the gap between the accuracy of weight estimates for covered and uncovered patients by up to $${52}{\%}$$ 52 % . Conclusion We present a novel pipeline to estimate the weight of patients, which are covered by a blanket. Our approach relaxes the specific conditions that were required for accurate weight estimates by previous contactless methods and thus constitutes an important step towards fully automatic weight estimation in clinical practice.

Download Full-text

Scene Reconstruction from a Single Depth Image Using 3D CNN

Fully convolutional denoising autoencoder for 3D scene reconstruction from a single depth image

Dense urban scene reconstruction using stereo depth image triangulation

Formation geological depth image according to refraction and reflection marine seismic data

3D Scene Reconstruction Method in Relative Coordinates with Two Images from Non-Calibrated Video Cameras

Distributed Bundle Adjustment in 3D Scene Reconstruction with

Realistic 3D Scene Reconstruction from an Image Sequence

Consonant Classification in Mandarin Based on the Depth Image Feature: A Pilot Study

On Camera Pose Estimation for 3D Scene Reconstruction

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

Seeing under the cover with a 3D U-Net: point cloud-based weight estimation of covered patients

Export Citation Format