Visual Localization Using Voting Based Image Retrieval and Particle Filtering in Indoor Scenes

Improved Visual Localization via Graph Filtering

Journal of Imaging ◽

10.3390/jimaging7020020 ◽

2021 ◽

Vol 7 (2) ◽

pp. 20

Author(s):

Carlos Lassance ◽

Yasir Latif ◽

Ravi Garg ◽

Vincent Gripon ◽

Ian Reid

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Visual Localization ◽

Localization Accuracy ◽

Additional Information ◽

Feature Representations ◽

The Mean ◽

Classical Image ◽

Vision Based Localization ◽

Improve Reliability

Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. This assumes that images taken from the same places consist of the same landmarks and thus would have similar feature representations. These representations can learn to be robust to different variations in capture conditions like time of the day or weather. In this work, we introduce a framework which aims at enhancing the performance of such retrieval-based localization methods. It consists in taking into account additional information available, such as GPS coordinates or temporal proximity in the acquisition of the images. More precisely, our method consists in constructing a graph based on this additional information that is later used to improve reliability of the retrieval process by filtering the feature representations of support and/or query images. We show that the proposed method is able to significantly improve the localization accuracy on two large scale datasets, as well as the mean average precision in classical image retrieval scenarios.

Download Full-text

Benchmarking Image Retrieval for Visual Localization

2020 International Conference on 3D Vision (3DV) ◽

10.1109/3dv50981.2020.00058 ◽

2020 ◽

Author(s):

Noe Pion ◽

Martin Humenberger ◽

Gabriela Csurka ◽

Yohann Cabon ◽

Torsten Sattler

Keyword(s):

Image Retrieval ◽

Visual Localization

Download Full-text

Indoor Visual Positioning Aided by CNN-Based Image Retrieval: Training-Free, 3D Modeling-Free

Sensors ◽

10.3390/s18082692 ◽

2018 ◽

Vol 18 (8) ◽

pp. 2692 ◽

Cited By ~ 17

Author(s):

Yujin Chen ◽

Ruizhi Chen ◽

Mengyun Liu ◽

Aoran Xiao ◽

Dewen Wu ◽

...

Keyword(s):

Image Retrieval ◽

Pose Estimation ◽

Indoor Localization ◽

Location Based Services ◽

Visual Localization ◽

Indoor Environments ◽

Deep Convolutional Neural Networks ◽

Accurate Localization ◽

Visual Odometer ◽

A Robust Indoor Localization System Integrating Visual Localization Aided by CNN-Based Image Retrieval with Monte Carlo Localization

Sensors ◽

10.3390/s19020249 ◽

2019 ◽

Vol 19 (2) ◽

pp. 249 ◽

Cited By ~ 17

Author(s):

Song Xu ◽

Wusheng Chou ◽

Hongyi Dong

Keyword(s):

Monte Carlo ◽

Image Retrieval ◽

Closed Loop ◽

Short Term Memory ◽

Visual Localization ◽

Place Recognition ◽

Global Localization ◽

Localization System ◽

Monte Carlo Localization ◽

Fine Localization

This paper proposes a novel multi-sensor-based indoor global localization system integrating visual localization aided by CNN-based image retrieval with a probabilistic localization approach. The global localization system consists of three parts: coarse place recognition, fine localization and re-localization from kidnapping. Coarse place recognition exploits a monocular camera to realize the initial localization based on image retrieval, in which off-the-shelf features extracted from a pre-trained Convolutional Neural Network (CNN) are adopted to determine the candidate locations of the robot. In the fine localization, a laser range finder is equipped to estimate the accurate pose of a mobile robot by means of an adaptive Monte Carlo localization, in which the candidate locations obtained by image retrieval are considered as seeds for initial random sampling. Additionally, to address the problem of robot kidnapping, we present a closed-loop localization mechanism to monitor the state of the robot in real time and make adaptive adjustments when the robot is kidnapped. The closed-loop mechanism effectively exploits the correlation of image sequences to realize the re-localization based on Long-Short Term Memory (LSTM) network. Extensive experiments were conducted and the results indicate that the proposed method not only exhibits great improvement on accuracy and speed, but also can recover from localization failures compared to two conventional localization methods.

Download Full-text

ERF-IMCS: An Efficient and Robust Framework with Image-Based Monte Carlo Scheme for Indoor Topological Navigation

Applied Sciences ◽

10.3390/app10196829 ◽

2020 ◽

Vol 10 (19) ◽

pp. 6829

Author(s):

Song Xu ◽

Huaidong Zhou ◽

Wusheng Chou

Keyword(s):

Monte Carlo ◽

Image Retrieval ◽

Large Scale ◽

Visual Localization ◽

Global Localization ◽

Localization Method ◽

Perceptual Aliasing ◽

Topological Localization ◽

Global And Local ◽

Topological Navigation

Conventional approaches to global localization and navigation mainly rely on metric maps to provide precise geometric coordinates, which may cause the problem of large-scale structural ambiguity and lack semantic information of the environment. This paper presents a scalable vision-based topological mapping and navigation method for a mobile robot to work robustly and flexibly in large-scale environment. In the vision-based topological navigation, an image-based Monte Carlo localization method is presented to realize global topological localization based on image retrieval, in which fine-tuned local region features from an object detection convolutional neural network (CNN) are adopted to perform image matching. The combination of image retrieval and Monte Carlo provide the robot with the ability to effectively avoid perceptual aliasing. Additionally, we propose an effective visual localization method, simultaneously employing the global and local CNN features of images to construct discriminative representation for environment, which makes the navigation system more robust to the interference of occlusion, translation, and illumination. Extensive experimental results demonstrate that ERF-IMCS exhibits great performance in the robustness and efficiency of navigation.

Download Full-text

A High-Accuracy Indoor-Positioning Method with Automated RGB-D Image Database Construction

Remote Sensing ◽

10.3390/rs11212572 ◽

2019 ◽

Vol 11 (21) ◽

pp. 2572 ◽

Cited By ~ 1

Author(s):

Runzhi Wang ◽

Wenhui Wan ◽

Kaichang Di ◽

Ruilin Chen ◽

Xiaoxue Feng

Keyword(s):

Image Retrieval ◽

Image Database ◽

Indoor Positioning ◽

High Accuracy ◽

The Public ◽

Visual Positioning ◽

Database Construction ◽

Positioning Method ◽

Indoor Scenes ◽

Automated Database

High-accuracy indoor positioning is a prerequisite to satisfy the increasing demands of position-based services in complex indoor scenes. Current indoor visual-positioning methods mainly include image retrieval-based methods, visual landmarks-based methods, and learning-based methods. To better overcome the limitations of traditional methods such as them being labor-intensive, of poor accuracy, and time-consuming, this paper proposes a novel indoor-positioning method with automated red, green, blue and depth (RGB-D) image database construction. First, strategies for automated database construction are developed to reduce the workload of manually selecting database images and ensure the requirements of high-accuracy indoor positioning. The database is automatically constructed according to the rules, which is more objective and improves the efficiency of the image-retrieval process. Second, by combining the automated database construction module, convolutional neural network (CNN)-based image-retrieval module, and strict geometric relations-based pose estimation module, we obtain a high-accuracy indoor-positioning system. Furthermore, in order to verify the proposed method, we conducted extensive experiments on the public indoor environment dataset. The detailed experimental results demonstrated the effectiveness and efficiency of our indoor-positioning method.

Download Full-text

Improvements of Image Retrieval-based Visual Localization Using Structured Database

2019 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops) ◽

10.1109/percomw.2019.8730768 ◽

2019 ◽

Author(s):

Ayari Akada ◽

Junji Takahashi ◽

Yu Yong

Keyword(s):

Image Retrieval ◽

Visual Localization

Download Full-text

IMAGE-TO-IMAGE TRANSLATION FOR ENHANCED FEATURE MATCHING, IMAGE RETRIEVAL AND VISUAL LOCALIZATION

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-2-w7-111-2019 ◽

2019 ◽

Vol IV-2/W7 ◽

pp. 111-119

Author(s):

M. S. Mueller ◽

T. Sattler ◽

M. Pollefeys ◽

B. Jutzi

Keyword(s):

Image Analysis ◽

Image Retrieval ◽

Data Augmentation ◽

Feature Matching ◽

3D Models ◽

Training Data ◽

Visual Localization ◽

Training Images ◽

Image Translation

Abstract. The performance of machine learning and deep learning algorithms for image analysis depends significantly on the quantity and quality of the training data. The generation of annotated training data is often costly, time-consuming and laborious. Data augmentation is a powerful option to overcome these drawbacks. Therefore, we augment training data by rendering images with arbitrary poses from 3D models to increase the quantity of training images. These training images usually show artifacts and are of limited use for advanced image analysis. Therefore, we propose to use image-to-image translation to transform images from a rendered domain to a captured domain. We show that translated images in the captured domain are of higher quality than the rendered images. Moreover, we demonstrate that image-to-image translation based on rendered 3D models enhances the performance of common computer vision tasks, namely feature matching, image retrieval and visual localization. The experimental results clearly show the enhancement on translated images over rendered images for all investigated tasks. In addition to this, we present the advantages utilizing translated images over exclusively captured images for visual localization.

Download Full-text