Robust optical-flow based self-motion estimation for a quadrotor UAV

Visual odometry (VO) refers to incremental estimation of the motion state of an agent (e.g., vehicle and robot) by using image information, and is a key component of modern localization and navigation systems. Addressing the monocular VO problem, this paper presents a novel end-to-end network for estimation of camera ego-motion. The network learns the latent subspace of optical flow (OF) and models sequential dynamics so that the motion estimation is constrained by the relations between sequential images. We compute the OF field of consecutive images and extract the latent OF representation in a self-encoding manner. A Recurrent Neural Network is then followed to examine the OF changes, i.e., to conduct sequential learning. The extracted sequential OF subspace is used to compute the regression of the 6-dimensional pose vector. We derive three models with different network structures and different training schemes: LS-CNN-VO, LS-AE-VO, and LS-RCNN-VO. Particularly, we separately train the encoder in an unsupervised manner. By this means, we avoid non-convergence during the training of the whole network and allow more generalized and effective feature representation. Substantial experiments have been conducted on KITTI and Malaga datasets, and the results demonstrate that our LS-RCNN-VO outperforms the existing learning-based VO approaches.

Download Full-text

Time-sequential structure and motion estimation without optical flow

10.1117/12.20003 ◽

1990 ◽

Cited By ~ 5

Author(s):

Joachim Heel ◽

Shahriar Negahdaripour

Keyword(s):

Motion Estimation ◽

Optical Flow ◽

Structure And Motion ◽

Sequential Structure ◽

Structure And Motion Estimation

Download Full-text

Distortion-Robust Spherical Camera Motion Estimation via Dense Optical Flow

2018 25th IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2018.8451406 ◽

2018 ◽

Cited By ~ 2

Author(s):

Sarthak Pathak ◽

Alessandro Moro ◽

Hiromitsu Fujii ◽

Atsushi Yamashita ◽

Hajime Asama

Keyword(s):

Motion Estimation ◽

Optical Flow ◽

Camera Motion ◽

Dense Optical Flow ◽

Camera Motion Estimation ◽

Spherical Camera

Download Full-text

Does perceptual-motor recalibration of locomotion depend on perceived self motion or the magnitude of optical flow?

Journal of Vision ◽

10.1167/5.8.386 ◽

2010 ◽

Vol 5 (8) ◽

pp. 386-386

Author(s):

W. B. Thompson ◽

B. J. Mohler ◽

S. H. Creem-Regehr

Keyword(s):

Optical Flow ◽

Self Motion

Download Full-text

How multisensory neurons solve causal inference

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2106235118 ◽

2021 ◽

Vol 118 (32) ◽

pp. e2106235118

Author(s):

Reuben Rideaux ◽

Katherine R. Storrs ◽

Guido Maiello ◽

Andrew E. Welchman

Keyword(s):

Motion Estimation ◽

Causal Inference ◽

Visual Motion ◽

Temporal Area ◽

Medial Superior Temporal ◽

Electrophysiological Recordings ◽

Visual Motion Cues ◽

Integrated Or ◽

Self Motion ◽

The Brain

Sitting in a static railway carriage can produce illusory self-motion if the train on an adjoining track moves off. While our visual system registers motion, vestibular signals indicate that we are stationary. The brain is faced with a difficult challenge: is there a single cause of sensations (I am moving) or two causes (I am static, another train is moving)? If a single cause, integrating signals produces a more precise estimate of self-motion, but if not, one cue should be ignored. In many cases, this process of causal inference works without error, but how does the brain achieve it? Electrophysiological recordings show that the macaque medial superior temporal area contains many neurons that encode combinations of vestibular and visual motion cues. Some respond best to vestibular and visual motion in the same direction (“congruent” neurons), while others prefer opposing directions (“opposite” neurons). Congruent neurons could underlie cue integration, but the function of opposite neurons remains a puzzle. Here, we seek to explain this computational arrangement by training a neural network model to solve causal inference for motion estimation. Like biological systems, the model develops congruent and opposite units and recapitulates known behavioral and neurophysiological observations. We show that all units (both congruent and opposite) contribute to motion estimation. Importantly, however, it is the balance between their activity that distinguishes whether visual and vestibular cues should be integrated or separated. This explains the computational purpose of puzzling neural representations and shows how a relatively simple feedforward network can solve causal inference.

Download Full-text