A generalized method for 3D object location from single 2D images

Abstract. The goal of this paper is to use transfer learning for semi supervised semantic segmentation in 2D images: given a pretrained deep convolutional network (DCNN), our aim is to adapt it to a new camera-sensor system by enforcing predictions to be consistent for the same object in space. This is enabled by projecting 3D object points into multi-view 2D images. Since every 3D object point is usually mapped to a number of 2D images, each of which undergoes a pixelwise classification using the pretrained DCNN, we obtain a number of predictions (labels) for the same object point. This makes it possible to detect and correct outlier predictions. Ultimately, we retrain the DCNN on the corrected dataset in order to adapt the network to the new input data. We demonstrate the effectiveness of our approach on a mobile mapping dataset containing over 10’000 images and more than 1 billion 3D points. Moreover, we manually annotated a subset of the mobile mapping images and show that we were able to rise the mean intersection over union (mIoU) by approximately 10% with Deeplabv3+, using our approach.

Download Full-text

3D object understanding from 2D images

10.1117/12.323587 ◽

1998 ◽

Author(s):

Patrick S. P. Wang

Keyword(s):

3D Object ◽

2D Images

Download Full-text

Matching curved 3D object models to 2D images

Proceedings of 1994 IEEE 2nd CAD-Based Vision Workshop ◽

10.1109/cadvis.1994.284499 ◽

2002 ◽

Cited By ~ 3

Author(s):

Jin-Long Chen ◽

G.C. Stockman

Keyword(s):

3D Object ◽

Object Models ◽

2D Images

Download Full-text

3D object model recovery from 2D images utilizing corner detection

Proceedings 2011 International Conference on System Science and Engineering ◽

10.1109/icsse.2011.5961877 ◽

2011 ◽

Author(s):

Ying-Yuan Huang ◽

Mei-Yung Chen

Keyword(s):

Corner Detection ◽

Object Model ◽

3D Object ◽

2D Images

Download Full-text

3D ARTICULATED OBJECT UNDERSTANDING, LEARNING, AND RECOGNITION FROM 2D IMAGES

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001400000544 ◽

2000 ◽

Vol 14 (07) ◽

pp. 863-873 ◽

Cited By ~ 1

Author(s):

PATRICK S. P. WANG

Keyword(s):

Virtual Reality ◽

Linear Combination ◽

Active Vision ◽

New Method ◽

Rigid Object ◽

3D Object ◽

Articulated Objects ◽

Main Portion ◽

2D Images ◽

Topological Transformations

This paper is aimed at 3D object understanding from 2D images, including articulated objects in active vision environment, using interactive, and internet virtual reality techniques. Generally speaking, an articulated object can be divided into two portions: main rigid portion and articulated portion. It is more complicated that "rigid" object in that the relative positions, shapes or angles between the main portion and the articulated portion have essentially infinite variations, in addition to the infinite variations of each individual rigid portions due to orientations, rotations and topological transformations. A new method generalized from linear combination is employed to investigate such problems. It uses very few learning samples, and can describe, understand, and recognize 3D articulated objects while the objects status is being changed in an active vision environment.

Download Full-text