Feature based Monocular Visual Odometry for Autonomous Driving and Hyperparameter Tuning to Improve Trajectory Estimation

In the field of mobile autonomous robots, visual odometry entails the retrieval of a motion transformation between two consecutive poses of the robot by means of a camera sensor solely. A visual odometry provides an essential information for trajectory estimation in problems such as Localization and SLAM (Simultaneous Localization and Mapping). In this work we present a motion estimation based on a single omnidirectional camera. We exploited the maximized horizontal field of view provided by this camera, which allows us to encode large scene information into the same image. The estimation of the motion transformation between two poses is incrementally computed, since only the processing of two consecutive omnidirectional images is required. Particularly, we exploited the versatility of the information gathered by omnidirectional images to perform both an appearance-based and a feature-based method to obtain visual odometry results. We carried out a set of experiments in real indoor environments to test the validity and suitability of both methods. The data used in the experiments consists of a large sets of omnidirectional images captured along the robot's trajectory in three different real scenarios. Experimental results demonstrate the accuracy of the estimations and the capability of both methods to work in real-time.

Download Full-text

Semantic segmentation–aided visual odometry for urban autonomous driving

International Journal of Advanced Robotic Systems ◽

10.1177/1729881417735667 ◽

2017 ◽

Vol 14 (5) ◽

pp. 172988141773566 ◽

Cited By ~ 15

Author(s):

Lifeng An ◽

Xinyu Zhang ◽

Hongbo Gao ◽

Yuchao Liu

Keyword(s):

Visual Cues ◽

Dynamic Environment ◽

Semantic Segmentation ◽

Visual Odometry ◽

Autonomous Driving ◽

Urban Traffic ◽

Data Sets ◽

Dynamic Scenes ◽

Data Set ◽

Feature Based

Visual odometry plays an important role in urban autonomous driving cars. Feature-based visual odometry methods sample the candidates randomly from all available feature points, while alignment-based visual odometry methods take all pixels into account. These methods hold an assumption that quantitative majority of candidate visual cues could represent the truth of motions. But in real urban traffic scenes, this assumption could be broken by lots of dynamic traffic participants. Big trucks or buses may occupy the main image parts of a front-view monocular camera and result in wrong visual odometry estimation. Finding available visual cues that could represent real motion is the most important and hardest step for visual odometry in the dynamic environment. Semantic attributes of pixels could be considered as a more reasonable factor for candidate selection in that case. This article analyzed the availability of all visual cues with the help of pixel-level semantic information and proposed a new visual odometry method that combines feature-based and alignment-based visual odometry methods with one optimization pipeline. The proposed method was compared with three open-source visual odometry algorithms on Kitti benchmark data sets and our own data set. Experimental results confirmed that the new approach provided effective improvement both on accurate and robustness in the complex dynamic scenes.

Download Full-text

Image Gradient-based Joint Direct Visual Odometry for Stereo Camera

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/636 ◽

2017 ◽

Cited By ~ 6

Author(s):

Jianke Zhu

Keyword(s):

Visual Odometry ◽

Research Problem ◽

Feature Representation ◽

Local Optimum ◽

Multi Scale ◽

Illumination Changes ◽

Pyramid Scheme ◽

Gradient Based ◽

Feature Based ◽

Dense 3D Reconstruction

Visual odometry is an important research problem for computer vision and robotics. In general, the feature-based visual odometry methods heavily rely on the accurate correspondences between local salient points, while the direct approaches could make full use of whole image and perform dense 3D reconstruction simultaneously. However, the direct visual odometry usually suffers from the drawback of getting stuck at local optimum especially with large displacement, which may lead to the inferior results. To tackle this critical problem, we propose a novel scheme for stereo odometry in this paper, which is able to improve the convergence with more accurate pose. The key of our approach is a dual Jacobian optimization that is fused into a multi-scale pyramid scheme. Moreover, we introduce a gradient-based feature representation, which enjoys the merit of being robust to illumination changes. Furthermore, a joint direct odometry approach is proposed to incorporate the information from the last frame and previous keyframes. We have conducted the experimental evaluation on the challenging KITTI odometry benchmark, whose promising results show that the proposed algorithm is very effective for stereo visual odometry.

Download Full-text

Scene Structure Classification as Preprocessing for Feature-Based Visual Odometry

SAE International Journal of Passenger Cars - Electronic and Electrical Systems ◽

10.4271/2018-01-0610 ◽

2018 ◽

Vol 11 (3) ◽

Cited By ~ 1

Author(s):

Nathir A. Rawashdeh ◽

Mohamed Aladem ◽

Stan Baek ◽

Samir A. Rawashdeh

Keyword(s):

Visual Odometry ◽

Scene Structure ◽

Feature Based

Download Full-text

Underwater Feature Based Visual Odometry - Experiment and Analysis

2011 First International Conference on Robot, Vision and Signal Processing ◽

10.1109/rvsp.2011.14 ◽

2011 ◽

Author(s):

Peiliang Gong ◽

Donglai Zhang

Keyword(s):

Visual Odometry ◽

Feature Based

Download Full-text

Collaborative Learning of Depth Estimation, Visual Odometry and Camera Relocalization from Monocular Videos

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/68 ◽

2020 ◽

Author(s):

Haimei Zhao ◽

Wei Bian ◽

Bo Yuan ◽

Dacheng Tao

Keyword(s):

Collaborative Learning ◽

Depth Estimation ◽

Visual Odometry ◽

Autonomous Driving ◽

Geometric Information ◽

Joint Learning ◽

Learning Framework ◽

Current State ◽

Camera Localization ◽

The Rich

Scene perceiving and understanding tasks including depth estimation, visual odometry (VO) and camera relocalization are fundamental for applications such as autonomous driving, robots and drones. Driven by the power of deep learning, significant progress has been achieved on individual tasks but the rich correlations among the three tasks are largely neglected. In previous studies, VO is generally accurate in local scope yet suffers from drift in long distances. By contrast, camera relocalization performs well in the global sense but lacks local precision. We argue that these two tasks should be strategically combined to leverage the complementary advantages, and be further improved by exploiting the 3D geometric information from depth data, which is also beneficial for depth estimation in turn. Therefore, we present a collaborative learning framework, consisting of DepthNet, LocalPoseNet and GlobalPoseNet with a joint optimization loss to estimate depth, VO and camera localization unitedly. Moreover, the Geometric Attention Guidance Model is introduced to exploit the geometric relevance among three branches during learning. Extensive experiments demonstrate that the joint learning scheme is useful for all tasks and our method outperforms current state-of-the-art techniques in depth estimation and camera relocalization with highly competitive performance in VO.

Download Full-text

Robust Monocular Visual Odometry Trajectory Estimation in Urban Environments

International Journal of Information Technology and Computer Science ◽

10.5815/ijitcs.2019.10.02 ◽

2019 ◽

Vol 11 (10) ◽

pp. 12-18

Author(s):

Ahmed Abdu ◽

◽

Hakim A. Abdo ◽

Al-Alimi Dalal

Keyword(s):

Visual Odometry ◽

Urban Environments ◽

Trajectory Estimation

Download Full-text

A Modest System of Feature-based Stereo Visual Odometry

10.1109/iwbis53353.2021.9631865 ◽

2021 ◽

Author(s):

William Gates ◽

Grafika Jati ◽

Riskyana Dewi Intan P ◽

Mahardhika Pratama ◽

Wisnu Jatmiko

Keyword(s):

Visual Odometry ◽

Feature Based ◽

Stereo Visual Odometry

Download Full-text

A fusion algorithm of visual odometry based on feature-based method and direct method

2017 Chinese Automation Congress (CAC) ◽

10.1109/cac.2017.8243070 ◽

2017 ◽

Cited By ~ 2

Author(s):

Jinglun Feng ◽

Chengjin Zhang ◽

Bo Sun ◽

Yong Song

Keyword(s):

Direct Method ◽

Visual Odometry ◽

Fusion Algorithm ◽

Feature Based

Download Full-text

Forward and Backward Visual Fusion Approach to Motion Estimation with High Robustness and Low Cost

Remote Sensing ◽

10.3390/rs11182139 ◽

2019 ◽

Vol 11 (18) ◽

pp. 2139

Author(s):

Ke Wang ◽

Xin Huang ◽

JunLan Chen ◽

Chuan Cao ◽

Zhoubing Xiong ◽

...

Keyword(s):

Low Cost ◽

Visual Odometry ◽

Optimization Method ◽

Measurement Unit ◽

Ground Vehicles ◽

Unit Quaternion ◽

Trajectory Estimation ◽

Evaluation Indexes ◽

Error Space ◽

Self Motion

We present a novel low-cost visual odometry method of estimating the ego-motion (self-motion) for ground vehicles by detecting the changes that motion induces on the images. Different from traditional localization methods that use differential global positioning system (GPS), precise inertial measurement unit (IMU) or 3D Lidar, the proposed method only leverage data from inexpensive visual sensors of forward and backward onboard cameras. Starting with the spatial-temporal synchronization, the scale factor of backward monocular visual odometry was estimated based on the MSE optimization method in a sliding window. Then, in trajectory estimation, an improved two-layers Kalman filter was proposed including orientation fusion and position fusion . Where, in the orientation fusion step, we utilized the trajectory error space represented by unit quaternion as the state of the filter. The resulting system enables high-accuracy, low-cost ego-pose estimation, along with providing robustness capability of handing camera module degradation by automatic reduce the confidence of failed sensor in the fusion pipeline. Therefore, it can operate in the presence of complex and highly dynamic motion such as enter-in-and-out tunnel entrance, texture-less, illumination change environments, bumpy road and even one of the cameras fails. The experiments carried out in this paper have proved that our algorithm can achieve the best performance on evaluation indexes of average in distance (AED), average in X direction (AEX), average in Y direction (AEY), and root mean square error (RMSE) compared to other state-of-the-art algorithms, which indicates that the output results of our approach is superior to other methods.

Download Full-text