WIRE STRUCTURE IMAGE-BASED 3D RECONSTRUCTION AIDED BY DEEP LEARNING

Abstract. Objects and structures realized by connecting and bending wires are common in modern architecture, furniture design, metal sculpting, etc. The 3D reconstruction of such objects with traditional range- or image-based methods is very difficult and poses challenges due to their unique characteristics such as repeated structures, slim elements, holes, lack of features, self-occlusions, etc. Complete 3D models of such complex structures are normally reconstructed with lots of manual intervention as automated processes fail in providing detailed and accurate 3D reconstruction results.This paper presents the image-based 3D reconstruction of the Shukhov hyperboloid tower in Moscow, a wire structure built in 1922, composed of a series of hyperboloid sections stacked one to another to approximate an overall conical shape. A deep learning approach for image segmentation was developed in order to robustly detect wire structures in images and provide the basis for accurate corresponding problem solutions. The developed WireNet convolution neural network (CNN) model has been used to aid the multi-view stereo (MVS) process and to improve robustness and accuracy of the image-based 3D reconstruction approach, otherwise not feasible without masking the images automatically.

Download Full-text

Exploring Techniques for Photo-realistic Image Generation from 3D Models - A Deep Learning Approach

10.1109/mysurucon52639.2021.9641645 ◽

2021 ◽

Author(s):

Pranoy Ghosh ◽

Krithika M Pai ◽

Manohara Pai M M ◽

Ujjwal Verma ◽

Frederic Rivet ◽

...

Keyword(s):

Deep Learning ◽

3D Models ◽

Learning Approach ◽

Image Generation ◽

Realistic Image

Download Full-text

A fully end-to-end deep learning approach for real-time simultaneous 3D reconstruction and material recognition

2017 18th International Conference on Advanced Robotics (ICAR) ◽

10.1109/icar.2017.8023499 ◽

2017 ◽

Cited By ~ 6

Author(s):

Cheng Zhao ◽

Li Sun ◽

Rustam Stolkin

Keyword(s):

Deep Learning ◽

3D Reconstruction ◽

Real Time ◽

Learning Approach ◽

Material Recognition ◽

End To End

Download Full-text

Deep Learning Approach to Point Cloud Scene Understanding for Automated Scan to 3D Reconstruction

Journal of Computing in Civil Engineering ◽

10.1061/(asce)cp.1943-5487.0000842 ◽

2019 ◽

Vol 33 (4) ◽

pp. 04019027 ◽

Cited By ~ 14

Author(s):

Jingdao Chen ◽

Zsolt Kira ◽

Yong K. Cho

Keyword(s):

Deep Learning ◽

3D Reconstruction ◽

Point Cloud ◽

Scene Understanding ◽

Learning Approach

Download Full-text

A Deep Learning Approach to the Classification of 3D Models under BIM Environment

International Journal of Control and Automation ◽

10.14257/ijca.2016.9.7.17 ◽

2016 ◽

Vol 9 (7) ◽

pp. 179-188 ◽

Cited By ~ 1

Author(s):

Li Wang ◽

Zhikai Zhao ◽

Xuefeng Wu

Keyword(s):

Deep Learning ◽

3D Models ◽

Learning Approach

Download Full-text

MACHINE LEARNING FOR APPROXIMATING UNKNOWN FACE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2020-857-2020 ◽

2020 ◽

Vol XLIII-B2-2020 ◽

pp. 857-862 ◽

Cited By ~ 1

Author(s):

V. A. Knyaz ◽

V. V. Kniaz ◽

M. M. Novikov ◽

R. M. Galeev

Keyword(s):

Deep Learning ◽

3D Model ◽

3D Models ◽

Learning Approach ◽

Neural Network Training ◽

Adversarial Learning ◽

Network Training ◽

Learning Techniques ◽

Facial Approximation ◽

Automated Technique

Abstract. The problem of facial appearance reconstruction (or facial approximation) basing on a skull is very important as for anthropology and archaeology as for forensics. Recent progress in optical 3D measurements allowed to substitute manual facial reconstruction techniques with computer-aided ones based on digital skull 3D models. Growing amount of data and developing methods for data processing provide a background for creating fully automated technique of face approximation.The performed study addressed to a problem of facial approximation based on skull digital 3D model with deep learning techniques. The skull 3D models used for appearance reconstruction are generated by the original photogrammetric system in automated mode. These 3D models are then used as input for the algorithm for face appearance reconstruction. The paper presents a deep learning approach for facial approximation basing on a skull. It exploits the generative adversarial learning for transition data from one modality (skull) to another modality (face) using digital skull 3D models and face 3D models. A special dataset containing skull 3D models and face 3D models has been collected and adapted for convolutional neural network training and testing. Evaluation results on testing part of the dataset demonstrates high potential of the developed approach in facial approximation.

Download Full-text

Comparison of various Activation Functions A Deep Learning Approach

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i3.122126 ◽

2018 ◽

Vol 6 (3) ◽

pp. 122-126

Author(s):

Mohammed Ibrahim Khan ◽

◽

Akansha Singh ◽

Anand Handa ◽

◽

...

Keyword(s):

Deep Learning ◽

Learning Approach ◽

Activation Functions

Download Full-text

A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/3/3 ◽

2020 ◽

Vol 17 (3) ◽

pp. 299-305 ◽

Cited By ~ 1

Author(s):

Riaz Ahmad ◽

Saeeda Naz ◽

Muhammad Afzal ◽

Sheikh Rashid ◽

Marcus Liwicki ◽

...

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Data Augmentation ◽

Short Term Memory ◽

Recognition System ◽

Learning Approach ◽

Arabic Text ◽

Data Set ◽

Processing Step ◽

Handwritten Arabic

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Download Full-text