Image-Based Localization Using Context

Image-based localization problem consists of estimating the 6 DoF camera pose by matching the image to a 3D point cloud (or equivalent) representing a 3D environment. The robustness and accuracy of current solutions is not objective and quantifiable. We have completed a comparative analysis of the main state of the art approaches, namely Brute Force Matching, Approximate Nearest Neighbour Matching, Embedded Ferns Classification, ACG Localizer( Using Visual Vocabulary) and Keyframe Matching Approach. The results of the study revealed major deficiencies in each approach mainly in search space reduction, clustering, feature matching and sensitivity to where the query image was taken. Then, we choose to focus on one common major problem that is reducing the search space. We propose to create a new image-based localization approach based on reducing the search space by using global descriptors to find candidate keyframes in the database then search against the 3D points that are only seen from these candidates using local descriptors stored in a 3D cloud map.

Download Full-text

Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6965 ◽

2020 ◽

Vol 34 (07) ◽

pp. 12717-12724

Author(s):

Yang You ◽

Yujing Lou ◽

Qi Liu ◽

Yu-Wing Tai ◽

Lizhuang Ma ◽

...

Keyword(s):

Adaptive Sampling ◽

Point Cloud ◽

Data Augmentation ◽

Feature Matching ◽

State Of The Art ◽

Point Clouds ◽

Rotation Invariant ◽

Learning Framework ◽

Point Set ◽

Part Segmentation

Point cloud analysis without pose priors is very challenging in real applications, as the orientations of point clouds are often unknown. In this paper, we propose a brand new point-set learning framework PRIN, namely, Pointwise Rotation-Invariant Network, focusing on rotation-invariant feature extraction in point clouds analysis. We construct spherical signals by Density Aware Adaptive Sampling to deal with distorted point distributions in spherical space. In addition, we propose Spherical Voxel Convolution and Point Re-sampling to extract rotation-invariant features for each point. Our network can be applied to tasks ranging from object classification, part segmentation, to 3D feature matching and label alignment. We show that, on the dataset with randomly rotated point clouds, PRIN demonstrates better performance than state-of-the-art methods without any data augmentation. We also provide theoretical analysis for the rotation-invariance achieved by our methods.

Download Full-text

An Image-Based Class Retrieval System for Roman Republican Coins

Entropy ◽

10.3390/e22080799 ◽

2020 ◽

Vol 22 (8) ◽

pp. 799

Author(s):

Hafeez Anwar ◽

Serwah Sabetghadam ◽

Peter Bell

Keyword(s):

Classification Accuracy ◽

Retrieval System ◽

Online Auctions ◽

Search Space ◽

Reference Book ◽

Query Image ◽

Force Matching ◽

User Friendly ◽

Friendly Graphical User Interface

We propose an image-based class retrieval system for ancient Roman Republican coins that can be instrumental in various archaeological applications such as museums, Numismatics study, and even online auctions websites. For such applications, the aim is not only classification of a given coin, but also the retrieval of its information from standard reference book. Such classification and information retrieval is performed by our proposed system via a user friendly graphical user interface (GUI). The query coin image gets matched with exemplar images of each coin class stored in the database. The retrieved coin classes are then displayed in the GUI along with their descriptions from a reference book. However, it is highly impractical to match a query image with each of the class exemplar images as there are 10 exemplar images for each of the 60 coin classes. Similarly, displaying all the retrieved coin classes and their respective information in the GUI will cause user inconvenience. Consequently, to avoid such brute-force matching, we incrementally vary the number of matches per class to find the least matches attaining the maximum classification accuracy. In a similar manner, we also extend the search space for coin class to find the minimal number of retrieved classes that achieve maximum classification accuracy. On the current dataset, our system successfully attains a classification accuracy of 99% for five matches per class such that the top ten retrieved classes are considered. As a result, the computational complexity is reduced by matching the query image with only half of the exemplar images per class. In addition, displaying the top 10 retrieved classes is far more convenient than displaying all 60 classes.

Download Full-text

Binarized Neural Architecture Search

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6624 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10526-10533 ◽

Cited By ~ 1

Author(s):

Hanlin Chen ◽

Li'an Zhuo ◽

Baochang Zhang ◽

Xiawu Zheng ◽

Jianzhuang Liu ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

State Of The Art ◽

Optimization Methods ◽

Search Space ◽

Network Architectures ◽

Neural Architecture ◽

Space Reduction ◽

The Cost ◽

A Performance

Neural architecture search (NAS) can have a significant impact in computer vision by automatically designing optimal neural network architectures for various tasks. A variant, binarized neural architecture search (BNAS), with a search space of binarized convolutions, can produce extremely compressed models. Unfortunately, this area remains largely unexplored. BNAS is more challenging than NAS due to the learning inefficiency caused by optimization requirements and the huge architecture space. To address these issues, we introduce channel sampling and operation space reduction into a differentiable NAS to significantly reduce the cost of searching. This is accomplished through a performance-based strategy used to abandon less potential operations. Two optimization methods for binarized neural networks are used to validate the effectiveness of our BNAS. Extensive experiments demonstrate that the proposed BNAS achieves a performance comparable to NAS on both CIFAR and ImageNet databases. An accuracy of 96.53% vs. 97.22% is achieved on the CIFAR-10 dataset, but with a significantly compressed model, and a 40% faster search than the state-of-the-art PC-DARTS.

Download Full-text

An Efficient and General Framework for Aerial Point Cloud Classification in Urban Scenarios

Remote Sensing ◽

10.3390/rs13101985 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1985

Author(s):

Emre Özdemir ◽

Fabio Remondino ◽

Alessandro Golkar

Keyword(s):

Point Cloud ◽

State Of The Art ◽

Computational Power ◽

Specific Data ◽

Cloud Processing ◽

Point Cloud Processing ◽

Current State ◽

Data Source ◽

Point Cloud Classification ◽

And Training

With recent advances in technologies, deep learning is being applied more and more to different tasks. In particular, point cloud processing and classification have been studied for a while now, with various methods developed. Some of the available classification approaches are based on specific data source, like LiDAR, while others are focused on specific scenarios, like indoor. A general major issue is the computational efficiency (in terms of power consumption, memory requirement, and training/inference time). In this study, we propose an efficient framework (named TONIC) that can work with any kind of aerial data source (LiDAR or photogrammetry) and does not require high computational power while achieving accuracy on par with the current state of the art methods. We also test our framework for its generalization ability, showing capabilities to learn from one dataset and predict on unseen aerial scenarios.

Download Full-text

Implementation of Verification and Matching E-KTP with Faster R-CNN and ORB

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v5i4.3175 ◽

2021 ◽

Vol 5 (4) ◽

pp. 783-793

Author(s):

Muhammad Muttabi Hudaya ◽

Siti Saadah ◽

Hendy Irawan

Keyword(s):

Image Matching ◽

Nearest Neighbor ◽

Feature Matching ◽

Image Feature ◽

Brute Force ◽

K Nearest Neighbor ◽

Matching Method ◽

Average Precision ◽

Detection Model ◽

Matching Process

needs a solid validation that has verification and matching uploaded images. To solve this problem, this paper implementing a detection model using Faster R-CNN and a matching method using ORB (Oriented FAST and Rotated BRIEF) and KNN-BFM (K-Nearest Neighbor Brute Force Matcher). The goal of the implementations is to reach both an 80% mark of accuracy and prove matching using ORB only can be a replaced OCR technique. The implementation accuracy results in the detection model reach mAP (Mean Average Precision) of 94%. But, the matching process only achieves an accuracy of 43,46%. The matching process using only image feature matching underperforms the previous OCR technique but improves processing time from 4510ms to 60m). Image matching accuracy has proven to increase by using a high-quality dan high quantity dataset, extracting features on the important area of EKTP card images.

Download Full-text

An optimization method for preventive control using differential evolution with consecutive search space reduction

2016 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT-Europe) ◽

10.1109/isgteurope.2016.7856215 ◽

2016 ◽

Cited By ~ 2

Author(s):

C. Fatih Kucuktezcan ◽

V. M. Istemihan Genc ◽

Osman Kaan Erol

Keyword(s):

Differential Evolution ◽

Search Space ◽

Optimization Method ◽

Preventive Control ◽

Space Reduction ◽

Search Space Reduction

Download Full-text

Search Space Reduction for MRF Stereo

Lecture Notes in Computer Science - Computer Vision – ECCV 2008 ◽

10.1007/978-3-540-88682-2_44 ◽

2008 ◽

pp. 576-588 ◽

Cited By ~ 8

Author(s):

Liang Wang ◽

Hailin Jin ◽

Ruigang Yang

Keyword(s):

Search Space ◽

Space Reduction ◽

Search Space Reduction

Download Full-text

Online Point Cloud Object Recognition System using Local Descriptors for Real-time Applications

Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications ◽

10.5220/0010198703010308 ◽

2021 ◽

Author(s):

Yacine Yaddaden ◽

Sylvie Daniel ◽

Denis Laurendeau

Keyword(s):

Object Recognition ◽

Real Time ◽

Point Cloud ◽

Recognition System ◽

Local Descriptors ◽

Real Time Applications

Download Full-text

Contact Lens Classification by Using Segmented Lens Boundary Features

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v11.i3.pp1129-1135 ◽

2018 ◽

Vol 11 (3) ◽

pp. 1129

Author(s):

Nur Ariffin Mohd Zin ◽

Hishammuddin Asmuni ◽

Haza Nuzly Abdul Hamed ◽

Razib M. Othman ◽

Shahreen Kasim ◽

...

Keyword(s):

Support Vector Machines ◽

Contact Lens ◽

State Of The Art ◽

Classification Method ◽

Support Vector ◽

Local Descriptors ◽

Iris Image ◽

Vector Machines ◽

False Reject Rate ◽

Better Than

Recent studies have shown that the wearing of soft lens may lead to performance degradation with the increase of false reject rate. However, detecting the presence of soft lens is a non-trivial task as its texture that almost indiscernible. In this work, we proposed a classification method to identify the existence of soft lens in iris image. Our proposed method starts with segmenting the lens boundary on top of the sclera region. Then, the segmented boundary is used as features and extracted by local descriptors. These features are then trained and classified using Support Vector Machines. This method was tested on Notre Dame Cosmetic Contact Lens 2013 database. Experiment showed that the proposed method performed better than state of the art methods.

Download Full-text

SMT-Based Contention-Free Task Mapping and Scheduling on 2D/3D SMART NoC with Mixed Dimension-Order Routing

ACM Transactions on Architecture and Code Optimization ◽

10.1145/3487018 ◽

2022 ◽

Vol 19 (1) ◽

pp. 1-21

Author(s):

Daeyeal Lee ◽

Bill Lin ◽

Chung-Kuan Cheng

Keyword(s):

System Performance ◽

Search Space ◽

Satisfiability Modulo Theories ◽

Low Latency ◽

Task Mapping ◽

Single Cycle ◽

Space Reduction ◽

Reduction Techniques ◽

2D And 3D ◽

Mixed Dimension

SMART NoCs achieve ultra-low latency by enabling single-cycle multiple-hop transmission via bypass channels. However, contention along bypass channels can seriously degrade the performance of SMART NoCs by breaking the bypass paths. Therefore, contention-free task mapping and scheduling are essential for optimal system performance. In this article, we propose an SMT (Satisfiability Modulo Theories)-based framework to find optimal contention-free task mappings with minimum application schedule lengths on 2D/3D SMART NoCs with mixed dimension-order routing. On top of SMT’s fast reasoning capability for conditional constraints, we develop efficient search-space reduction techniques to achieve practical scalability. Experiments demonstrate that our SMT framework achieves 10× higher scalability than ILP (Integer Linear Programming) with 931.1× (ranges from 2.2× to 1532.1×) and 1237.1× (ranges from 4× to 4373.8×) faster average runtimes for finding optimum solutions on 2D and 3D SMART NoCs and our 2D and 3D extensions of the SMT framework with mixed dimension-order routing also maintain the improved scalability with the extended and diversified routing paths, resulting in reduced application schedule lengths throughout various application benchmarks.

Download Full-text