Network algorithm real-time depth image 3D human recognition for augmented reality

Physics education applications using augmented reality technology, which has been developed extensively in recent years, have a lot of restrictions in terms of performance and accuracy. The purpose of our research is to develop a real-time simulation system for physics education that is based on parallel processing. In this paper, we present a video see-through AR (Augmented Reality) system that includes an environment recognizer using a depth image of Microsoft’s Kinect V2 and a real-time soft body simulator based on parallel processing using GPU (Graphic Processing Unit). Soft body simulation can provide more realistic simulation results than rigid body simulation, so it can be more effective in systems for physics education. We have designed and implemented a system that provides the physical deformation and movement of 3D volumetric objects, and uses them in education. To verify the usefulness of the proposed system, we conducted a questionnaire survey of 10 students majoring in physics education. As a result of the questionnaire survey, 93% of respondents answered that they would like to use it for education. We plan to use the stand-alone AR device including one or more cameras to improve the system in the future.

Download Full-text

A Mobile Augmented Reality System for the Real-Time Visualization of Pipes in Point Cloud Data with a Depth Sensor

Electronics ◽

10.3390/electronics9050836 ◽

2020 ◽

Vol 9 (5) ◽

pp. 836 ◽

Cited By ~ 1

Author(s):

Young-Hoon Jin ◽

In-Tae Hwang ◽

Won-Hyung Lee

Keyword(s):

Augmented Reality ◽

Real Time ◽

Point Cloud ◽

Depth Image ◽

Visualization Method ◽

Point Cloud Data ◽

Matching Method ◽

The Real ◽

Cloud Data ◽

Real Time Visualization

Augmented reality (AR) is a useful visualization technology that displays information by adding virtual images to the real world. In AR systems that require three-dimensional information, point cloud data is easy to use after real-time acquisition, however, it is difficult to measure and visualize real-time objects due to the large amount of data and a matching process. In this paper we explored a method of estimating pipes from point cloud data and visualizing them in real-time through augmented reality devices. In general, pipe estimation in a point cloud uses a Hough transform and is performed through a preprocessing process, such as noise filtering, normal estimation, or segmentation. However, there is a disadvantage in that the execution time is slow due to a large amount of computation. Therefore, for the real-time visualization in augmented reality devices, the fast cylinder matching method using random sample consensus (RANSAC) is required. In this paper, we proposed parallel processing, multiple frames, adjustable scale, and error correction for real-time visualization. The real-time visualization method through the augmented reality device obtained a depth image from the sensor and configured a uniform point cloud using a voxel grid algorithm. The constructed data was analyzed according to the fast cylinder matching method using RANSAC. The real-time visualization method through augmented reality devices is expected to be used to identify problems, such as the sagging of pipes, through real-time measurements at plant sites due to the spread of various AR devices.

Download Full-text

Holo Reality: Real-time low-bandwidth 3D range video communications on consumer mobile devices with application to augmented reality

Electronic Imaging ◽

10.2352/issn.2470-1173.2019.16.3dmp-007 ◽

2019 ◽

Vol 2019 (16) ◽

pp. 7-1-7-6

Author(s):

Tyler Bell ◽

Song Zhang

Keyword(s):

Augmented Reality ◽

Real Time ◽

Mobile Devices ◽

Video Communications

Download Full-text

Pengembangan Media Pembelajaran Mengenal Organ Pencernaan Manusia Menggunakan Teknologi Augmented Reality

Jurnal Buana Informatika ◽

10.24002/jbi.v6i2.404 ◽

2015 ◽

Vol 6 (2) ◽

Cited By ~ 3

Author(s):

Rujianto Eko Saputro ◽

Dhanar Intan Surya Saputra

Keyword(s):

Augmented Reality ◽

Real Time

Media pembelajaran ternyata selalu mengikuti perkembangan teknologi yangada, mulai dari teknologi cetak, audio visual, komputer sampai teknologi gabunganantara teknologi cetak dengan komputer. Saat ini media pembelajaran hasil gabunganteknologi cetak dan komputer dapat diwujudkan dengan media teknologi AugmentedReality (AR). Augmented Reality (AR) adalah teknologi yang digunakan untukmerealisasikan dunia virtual ke dalam dunia nyata secara real-time. Organ pencernaanmanusia terdiri atas Mulut, Kerongkongan atau esofagus, Lambung, Usus halus, danUsus besar. Media pembelajaran mengenal organ pencernaan manusia pada saat inisangat monoton, yaitu melalui gambar, buku atau bahkan alat proyeksi lainnya.Menggunakan Augmented Reality yang mampu merealisasikan dunia virtual ke dunianyata, dapat mengubah objek-objek tersebut menjadi objek 3D, sehingga metodepembelajaran tidaklah monoton dan anak-anak jadi terpacu untuk mengetahuinya lebihlanjut, seperti mengetahui nama organ dan keterangan dari masing-masing organtersebut.

Download Full-text

Real-time Adaptive Optimal Prediction of Safe Control Spaces and Augmented-Reality Head-Up Displays Towards Aircraft Loss-of-Control Mitigation

AIAA Scitech 2021 Forum ◽

10.2514/6.2021-0757 ◽

2021 ◽

Author(s):

Melvin Rafi ◽

James E. Steck ◽

Animesh Chakravarthy

Keyword(s):

Augmented Reality ◽

Real Time ◽

Loss Of Control ◽

Optimal Prediction ◽

Safe Control

Download Full-text

A Simple and Practical Method for Incorporating Augmented Reality into the Classroom and Laboratory

10.26434/chemrxiv.7137827.v1 ◽

2018 ◽

Author(s):

Kyle Plunkett

Keyword(s):

Organic Chemistry ◽

Augmented Reality ◽

Real Time ◽

Real World ◽

Smart Phone ◽

Practical Method ◽

Virtual Expert ◽

Video Projection

This manuscript provides two demonstrations of how Augmented Reality (AR), which is the projection of virtual information onto a real-world object, can be applied in the classroom and in the laboratory. Using only a smart phone and the free HP Reveal app, content rich AR notecards were prepared. The physical notecards are based on Organic Chemistry I reactions and show only a reagent and substrate. Upon interacting with the HP Reveal app, an AR video projection shows the product of the reaction as well as a real-time, hand-drawn curved-arrow mechanism of how the product is formed. Thirty AR notecards based on common Organic Chemistry I reactions and mechanisms are provided in the Supporting Information and are available for widespread use. In addition, the HP Reveal app was used to create AR video projections onto laboratory instrumentation so that a virtual expert can guide the user during the equipment setup and operation.

Download Full-text

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

ACM Transactions on Graphics ◽

10.1145/3451341 ◽

2021 ◽

Vol 40 (3) ◽

pp. 1-12

Author(s):

Hao Zhang ◽

Yuxiao Zhou ◽

Yifei Tian ◽

Jun-Hai Yong ◽

Feng Xu

Keyword(s):

Real Time ◽

Synthetic Data ◽

Real Data ◽

Depth Image ◽

Real Time System ◽

The Real ◽

Time Performance ◽

Contact Constraint ◽

Object Shapes ◽

Object Interactions

Reconstructing hand-object interactions is a challenging task due to strong occlusions and complex motions. This article proposes a real-time system that uses a single depth stream to simultaneously reconstruct hand poses, object shape, and rigid/non-rigid motions. To achieve this, we first train a joint learning network to segment the hand and object in a depth image, and to predict the 3D keypoints of the hand. With most layers shared by the two tasks, computation cost is saved for the real-time performance. A hybrid dataset is constructed here to train the network with real data (to learn real-world distributions) and synthetic data (to cover variations of objects, motions, and viewpoints). Next, the depth of the two targets and the keypoints are used in a uniform optimization to reconstruct the interacting motions. Benefitting from a novel tangential contact constraint, the system not only solves the remaining ambiguities but also keeps the real-time performance. Experiments show that our system handles different hand and object shapes, various interactive motions, and moving cameras.

Download Full-text

Augmented Reality Maintenance Assistant Using YOLOv5

Applied Sciences ◽

10.3390/app11114758 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4758

Author(s):

Ana Malta ◽

Mateus Mendes ◽

Torres Farinha

Keyword(s):

Neural Network ◽

Deep Learning ◽

Object Recognition ◽

Augmented Reality ◽

Real Time ◽

Recognition System ◽

High Accuracy ◽

Video Streams ◽

The Neural Network ◽

Deep Learning Neural Network

Maintenance professionals and other technical staff regularly need to learn to identify new parts in car engines and other equipment. The present work proposes a model of a task assistant based on a deep learning neural network. A YOLOv5 network is used for recognizing some of the constituent parts of an automobile. A dataset of car engine images was created and eight car parts were marked in the images. Then, the neural network was trained to detect each part. The results show that YOLOv5s is able to successfully detect the parts in real time video streams, with high accuracy, thus being useful as an aid to train professionals learning to deal with new equipment using augmented reality. The architecture of an object recognition system using augmented reality glasses is also designed.

Download Full-text

A Residual Network and FPGA Based Real-Time Depth Map Enhancement System

Entropy ◽

10.3390/e23050546 ◽

2021 ◽

Vol 23 (5) ◽

pp. 546

Author(s):

Zhenni Li ◽

Haoyi Sun ◽

Yuliang Gao ◽

Jiao Wang

Keyword(s):

Real Time ◽

Super Resolution ◽

Depth Map ◽

Acquisition System ◽

Depth Image ◽

Fpga Design ◽

Depth Sensing ◽

Residual Network ◽

Real Time Processing ◽

Depth Maps

Depth maps obtained through sensors are often unsatisfactory because of their low-resolution and noise interference. In this paper, we propose a real-time depth map enhancement system based on a residual network which uses dual channels to process depth maps and intensity maps respectively and cancels the preprocessing process, and the algorithm proposed can achieve real-time processing speed at more than 30 fps. Furthermore, the FPGA design and implementation for depth sensing is also introduced. In this FPGA design, intensity image and depth image are captured by the dual-camera synchronous acquisition system as the input of neural network. Experiments on various depth map restoration shows our algorithms has better performance than existing LRMC, DE-CNN and DDTF algorithms on standard datasets and has a better depth map super-resolution, and our FPGA completed the test of the system to ensure that the data throughput of the USB 3.0 interface of the acquisition system is stable at 226 Mbps, and support dual-camera to work at full speed, that is, 54 fps@ (1280 × 960 + 328 × 248 × 3).

Download Full-text