Aerial-Satellite Image Matching Framework for UAV Absolute Visual Localization using Contrastive Learning

Author(s):  
Seongha Ahn ◽  
Hosun Kang ◽  
Jangmyung Lee

Automatic image registration (IR) is very challenging and very important in the field of hyperspectral remote sensing data. Efficient autonomous IR method is needed with high precision, fast, and robust. A key operation of IR is to align the multiple images in single co-ordinate system for extracting and identifying variation between images considered. In this paper, presented a feature descriptor by combining features from both Feature from Accelerated Segment Test (FAST) and Binary Robust Invariant Scalable Key point (BRISK). The proposed hybrid invariant local features (HILF) descriptor extract useful and similar feature sets from reference and source images. The feature matching method allows finding precise relationship or matching among two feature sets. An experimental analysis described the outcome BRISK, FASK and proposed HILF in terms of inliers ratio and repeatability evaluation metrics.


Author(s):  
Emanuele Frontoni ◽  
Adriano Mancini ◽  
Primo Zingaretti

In this paper a mixed vision-range based approach, based on Kinect technology, for safe landing of an Unmanned Aerial Vehicle (UAV) is proposed. The guidance system allows a remote user to define target areas from an high resolution aerial or satellite image to determine the waypoints of the navigation trajectory or the landing area. The system is based on our previous work on UAV navigation and landing: a feature-based image matching algorithms finds the natural landmarks and gives feedbacks to the control system for autonomous navigation and landing. An algorithm for safe landing areas detection is proposed, based on the use of 4D RGBD (Red, Green, Blue, Distance) image analysis. The helicopter is required to navigate from an initial to a final position in a partially known environment, to locate a landing area and to land on it. Results show the appropriateness of the vision-based approach that does not require any artificial landmark (e.g., helipad) and is quite robust to occlusions, light variations and high vibrations.


Author(s):  
Mohamed Tahoun ◽  
Abd El Rahman Shabayek ◽  
Hamed Nassar ◽  
Marcello M. Giovenco ◽  
Ralf Reulke ◽  
...  

2021 ◽  
Vol 13 (10) ◽  
pp. 2017
Author(s):  
Anbang Liang ◽  
Qingquan Li ◽  
Zhipeng Chen ◽  
Dejin Zhang ◽  
Jiasong Zhu ◽  
...  

Fisheye cameras are widely used in visual localization due to the advantage of the wide field of view. However, the severe distortion in fisheye images lead to feature matching difficulties. This paper proposes an IMU-assisted fisheye image matching method called spherically optimized random sample consensus (So-RANSAC). We converted the putative correspondences into fisheye spherical coordinates and then used an inertial measurement unit (IMU) to provide relative rotation angles to assist fisheye image epipolar constraints and improve the accuracy of pose estimation and mismatch removal. To verify the performance of So-RANSAC, experiments were performed on fisheye images of urban drainage pipes and public data sets. The experimental results showed that So-RANSAC can effectively improve the mismatch removal accuracy, and its performance was superior to the commonly used fisheye image matching methods in various experimental scenarios.


Sensors ◽  
2018 ◽  
Vol 18 (8) ◽  
pp. 2476 ◽  
Author(s):  
Shufei Lin ◽  
Ruiqi Cheng ◽  
Kaiwei Wang ◽  
Kailun Yang

Localization systems play an important role in assisted navigation. Precise localization renders visually impaired people aware of ambient environments and prevents them from coming across potential hazards. The majority of visual localization algorithms, which are applied to autonomous vehicles, are not adaptable completely to the scenarios of assisted navigation. Those vehicle-based approaches are vulnerable to viewpoint, appearance and route changes (between database and query images) caused by wearable cameras of assistive devices. Facing these practical challenges, we propose Visual Localizer, which is composed of ConvNet descriptor and global optimization, to achieve robust visual localization for assisted navigation. The performance of five prevailing ConvNets are comprehensively compared, and GoogLeNet is found to feature the best performance on environmental invariance. By concatenating two compressed convolutional layers of GoogLeNet, we use only thousands of bytes to represent image efficiently. To further improve the robustness of image matching, we utilize the network flow model as a global optimization of image matching. The extensive experiments using images captured by visually impaired volunteers illustrate that the system performs well in the context of assisted navigation.


2009 ◽  
Vol 36 (7) ◽  
pp. n/a-n/a ◽  
Author(s):  
Ivana Barisin ◽  
Sebastien Leprince ◽  
Barry Parsons ◽  
Tim Wright

2012 ◽  
Vol 22 (1) ◽  
pp. 236-243 ◽  
Author(s):  
I. N. Kaveev ◽  
A. G. Tashlinskii ◽  
R. M. Kurbanaliev

2017 ◽  
Vol 8 (12) ◽  
pp. 1180-1189 ◽  
Author(s):  
Siliang Du ◽  
Mi Wang ◽  
Shenghui Fang

Author(s):  
Y. Wang ◽  
D. Gong ◽  
H. Hu ◽  
S. Wang ◽  
Y. Han ◽  
...  

Abstract. Large-scale Digital Surface Model (DSM) generated with high-resolution satellite images (HRSI) are comparable, cheaper, and more accessible when comparing to Light Detection and Ranging (LiDAR) data and aerial remotely sensed images. Several photogrammetric commercial/open-source software packages are being developed for satellite image-based 3D reconstruction, in which, most of them adopt a modified version of Semi-Global Matching (SGM) algorithm for dense image matching. With the continuous development of matching cost computation methods, the existing methods can be divided into classical (low-level) and learning-based algorithms (non-end-to-end learning and end-to-end learning methods). On Middlebury and KITTI datasets, learning-based algorithms has shown their superiority compared to SGM derived methods. In this context, we assume that matching cost is the key factor of DIM. This paper reviews and evaluates Census Transform, and MC-CNN on a WorldView-3 typical city scene satellite stereo images on the premise that the overall SGM framework remains unchanged, providing a preliminary comparison for academic and industrial. We first compute the cost valume of these two methods, obtains the final DSM after semi-global optimization, and compares their gemetric accuracy with the corresponding LiDAR derived ground truth. We presented our comparison and findings in the experimental section.


Sign in / Sign up

Export Citation Format

Share Document