Real-Time Visual Place Recognition Based on Analyzing Distribution of Multi-scale CNN Landmarks

2018 ◽  
Vol 94 (3-4) ◽  
pp. 777-792 ◽  
Author(s):  
Zhe Xin ◽  
Xiaoguang Cui ◽  
Jixiang Zhang ◽  
Yiping Yang ◽  
Yanqing Wang
2017 ◽  
Vol 97 (1) ◽  
pp. 213-244 ◽  
Author(s):  
Michał R. Nowicki ◽  
Jan Wietrzykowski ◽  
Piotr Skrzypczyński

2017 ◽  
Vol 14 (1) ◽  
pp. 172988141668695 ◽  
Author(s):  
Yi Hou ◽  
Hong Zhang ◽  
Shilin Zhou

Recent impressive studies on using ConvNet landmarks for visual place recognition take an approach that involves three steps: (a) detection of landmarks, (b) description of the landmarks by ConvNet features using a convolutional neural network, and (c) matching of the landmarks in the current view with those in the database views. Such an approach has been shown to achieve the state-of-the-art accuracy even under significant viewpoint and environmental changes. However, the computational burden in step (c) significantly prevents this approach from being applied in practice, due to the complexity of linear search in high-dimensional space of the ConvNet features. In this article, we propose two simple and efficient search methods to tackle this issue. Both methods are built upon tree-based indexing. Given a set of ConvNet features of a query image, the first method directly searches the features’ approximate nearest neighbors in a tree structure that is constructed from ConvNet features of database images. The database images are voted on by features in the query image, according to a lookup table which maps each ConvNet feature to its corresponding database image. The database image with the highest vote is considered the solution. Our second method uses a coarse-to-fine procedure: the coarse step uses the first method to coarsely find the top- N database images, and the fine step performs a linear search in Hamming space of the hash codes of the ConvNet features to determine the best match. Experimental results demonstrate that our methods achieve real-time search performance on five data sets with different sizes and various conditions. Most notably, by achieving an average search time of 0.035 seconds/query, our second method improves the matching efficiency by the three orders of magnitude over a linear search baseline on a database with 20,688 images, with negligible loss in place recognition accuracy.


2020 ◽  
Vol 2020 (10) ◽  
pp. 313-1-313-7
Author(s):  
Raffaele Imbriaco ◽  
Egor Bondarev ◽  
Peter H.N. de With

Visual place recognition using query and database images from different sources remains a challenging task in computer vision. Our method exploits global descriptors for efficient image matching and local descriptors for geometric verification. We present a novel, multi-scale aggregation method for local convolutional descriptors, using memory vector construction for efficient aggregation. The method enables to find preliminary set of image candidate matches and remove visually similar but erroneous candidates. We deploy the multi-scale aggregation for visual place recognition on 3 large-scale datasets. We obtain a Recall@10 larger than 94% for the Pittsburgh dataset, outperforming other popular convolutional descriptors used in image retrieval and place recognition. Additionally, we provide a comparison for these descriptors on a more challenging dataset containing query and database images obtained from different sources, achieving over 77% Recall@10.


2021 ◽  
pp. 1039-1049
Author(s):  
Chen Fan ◽  
Adam Jacobson ◽  
Zetao Chen ◽  
Xiaofeng He ◽  
Lilian Zhang ◽  
...  

2018 ◽  
Author(s):  
Λουκάς Μπάμπης

Η παρούσα διδακτορική διατριβή αναφέρεται σε μεθόδους αναγνώρισης περιοχών για ρομποτικές εφαρμογές πραγματικού χρόνου. Συγκεκριμένα, περιγράφει νέες τεχνικές κατά τις οποίες η αναγνώριση γνωστών περιοχών επιτυγχάνεται μέσω πολλαπλών εικόνων και συνδυασμό σημείων ενδιαφέροντος για βελτίωση των αποτελεσμάτων μηχανισμών χαρτογράφησης.


2019 ◽  
Vol 9 (15) ◽  
pp. 3146 ◽  
Author(s):  
Bo Yang ◽  
Xiaosu Xu ◽  
Jun Li ◽  
Hong Zhang

Landmark generation is an essential component in landmark-based visual place recognition. In this paper, we present a simple yet effective method, called multi-scale sliding window (MSW), for landmark generation in order to improve the performance of place recognition. In our method, we generate landmarks that form a uniform distribution in multiple landmark scales (sizes) within an appropriate range by a process that samples an image with a sliding window. This is in contrast to conventional methods of landmark generation that typically depend on detecting objects whose size distributions are uneven and, as a result, may not be effective in achieving shift invariance and viewpoint invariance, two important properties in visual place recognition. We conducted experiments on four challenging datasets to demonstrate that the recognition performance can be significantly improved by our method in a standard landmark-based visual place recognition system. Our method is simple with a single input parameter, the scales of landmarks required, and it is efficient as it does not involve detecting objects.


2015 ◽  
Vol 35 (4) ◽  
pp. 334-356 ◽  
Author(s):  
Elena S. Stumm ◽  
Christopher Mei ◽  
Simon Lacroix

Sign in / Sign up

Export Citation Format

Share Document