scholarly journals Global pose estimation with limited GPS and long range visual odometry

Author(s):  
Joern Rehder ◽  
Kamal Gupta ◽  
Stephen Nuske ◽  
Sanjiv Singh
2020 ◽  
Vol 17 (11) ◽  
pp. 634-646
Author(s):  
Andrew Lee ◽  
Will Dallmann ◽  
Scott Nykl ◽  
Clark Taylor ◽  
Brett Borghetti

IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 169908-169919 ◽  
Author(s):  
Francisco J. Romero-Ramire ◽  
Rafael Munoz-Salinas ◽  
R. Medina-Carnicer

Author(s):  
Rohan More ◽  
Rahul Kottath ◽  
R. Jegadeeshwaran ◽  
Vipan Kumar ◽  
Vinod Karar ◽  
...  

Author(s):  
Pasquale Ferrara ◽  
Alessandro Piva ◽  
Fabrizio Argenti ◽  
Junya Kusuno ◽  
Marta Niccolini ◽  
...  

Sensors ◽  
2019 ◽  
Vol 19 (4) ◽  
pp. 953 ◽  
Author(s):  
Nicolas Antigny ◽  
Hideaki Uchiyama ◽  
Myriam Servières ◽  
Valérie Renaudin ◽  
Diego Thomas ◽  
...  

The urban environments represent challenging areas for handheld device pose estimation (i.e., 3D position and 3D orientation) in large displacements. It is even more challenging with low-cost sensors and computational resources that are available in pedestrian mobile devices (i.e., monocular camera and Inertial Measurement Unit). To address these challenges, we propose a continuous pose estimation based on monocular Visual Odometry. To solve the scale ambiguity and suppress the scale drift, an adaptive pedestrian step lengths estimation is used for the displacements on the horizontal plane. To complete the estimation, a handheld equipment height model, with respect to the Digital Terrain Model contained in Geographical Information Systems, is used for the displacement on the vertical axis. In addition, an accurate pose estimation based on the recognition of known objects is punctually used to correct the pose estimate and reset the monocular Visual Odometry. To validate the benefit of our framework, experimental data have been collected on a 0.7 km pedestrian path in an urban environment for various people. Thus, the proposed solution allows to achieve a positioning error of 1.6–7.5% of the walked distance, and confirms the benefit of the use of an adaptive step length compared to the use of a fixed-step length.


2019 ◽  
Vol 11 (1) ◽  
pp. 67 ◽  
Author(s):  
Sung-Joo Yoon ◽  
Taejung Kim

One of the important image processing technologies is visual odometry (VO) technology. VO estimates platform motion through a sequence of images. VO is of interest in the virtual reality (VR) industry as well as the automobile industry because the construction cost is low. In this study, we developed stereo visual odometry (SVO) based on photogrammetric geometric interpretation. The proposed method performed feature optimization and pose estimation through photogrammetric bundle adjustment. After corresponding the point extraction step, the feature optimization was carried out with photogrammetry-based and vision-based optimization. Then, absolute orientation was performed for pose estimation through bundle adjustment. We used ten sequences provided by the Karlsruhe institute of technology and Toyota technological institute (KITTI) community. Through a two-step optimization process, we confirmed that the outliers, which were not removed by conventional outlier filters, were removed. We also were able to confirm the applicability of photogrammetric techniques to stereo visual odometry technology.


Author(s):  
Yiran Zhu ◽  
Xing Xu ◽  
Fumin Shen ◽  
Yanli Ji ◽  
Lianli Gao ◽  
...  

Graph neural networks (GNNs) have been widely used in the 3D human pose estimation task, since the pose representation of a human body can be naturally modeled by the graph structure. Generally, most of the existing GNN-based models utilize the restricted receptive fields of filters and single-scale information, while neglecting the valuable multi-scale contextual information. To tackle this issue, we propose a novel Graph Transformer Encoder-Decoder with Atrous Convolution, named PoseGTAC, to effectively extract multi-scale context and long-range information. In our proposed PoseGTAC model, Graph Atrous Convolution (GAC) and Graph Transformer Layer (GTL), respectively for the extraction of local multi-scale and global long-range information, are combined and stacked in an encoder-decoder structure, where graph pooling and unpooling are adopted for the interaction of multi-scale information from local to global (e.g., part-scale and body-scale). Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets demonstrate that the proposed PoseGTAC model exceeds all previous methods and achieves state-of-the-art performance.


Sign in / Sign up

Export Citation Format

Share Document