Refinement of matching costs for stereo disparities using recurrent neural networks

AbstractDepth is essential information for autonomous robotics applications that need environmental depth values. The depth could be acquired by finding the matching pixels between stereo image pairs. Depth information is an inference from a matching cost volume that is composed of the distances between the possible pixel points on the pre-aligned horizontal axis of stereo images. Most approaches use matching costs to identify matches between stereo images and obtain depth information. Recently, researchers have been using convolutional neural network-based solutions to handle this matching problem. In this paper, a novel method has been proposed for the refinement of matching costs by using recurrent neural networks. Our motivation is to enhance the depth values obtained from matching costs. For this purpose, to attain an enhanced disparity map by utilizing the sequential information of matching costs in the horizontal space, recurrent neural networks are used. Exploiting this sequential information, we aimed to determine the position of the correct matching point by using recurrent neural networks, as in the case of speech processing problems. We used existing stereo algorithms to obtain the initial matching costs and then improved the results by utilizing recurrent neural networks. The results are evaluated on the KITTI 2012 and KITTI 2015 datasets. The results show that the matching cost three-pixel error is decreased by an average of 14.5% in both datasets.

Download Full-text

Disparity Map Generation from Illumination Variant Stereo Images Using Efficient Hierarchical Dynamic Programming

The Scientific World JOURNAL ◽

10.1155/2014/513417 ◽

2014 ◽

Vol 2014 ◽

pp. 1-12

Author(s):

Viral H. Borisagar ◽

Mukesh A. Zaveri

Keyword(s):

Dynamic Programming ◽

Stereo Matching ◽

Stereo Image ◽

Stereo Pair ◽

Image Pair ◽

Stereo Images ◽

Disparity Map ◽

Illumination Variation ◽

Matching Problem ◽

Wide Range

A novel hierarchical stereo matching algorithm is presented which gives disparity map as output from illumination variant stereo pair. Illumination difference between two stereo images can lead to undesirable output. Stereo image pair often experience illumination variations due to many factors like real and practical situation, spatially and temporally separated camera positions, environmental illumination fluctuation, and the change in the strength or position of the light sources. Window matching and dynamic programming techniques are employed for disparity map estimation. Good quality disparity map is obtained with the optimized path. Homomorphic filtering is used as a preprocessing step to lessen illumination variation between the stereo images. Anisotropic diffusion is used to refine disparity map to give high quality disparity map as a final output. The robust performance of the proposed approach is suitable for real life circumstances where there will be always illumination variation between the images. The matching is carried out in a sequence of images representing the same scene, however in different resolutions. The hierarchical approach adopted decreases the computation time of the stereo matching problem. This algorithm can be helpful in applications like robot navigation, extraction of information from aerial surveys, 3D scene reconstruction, and military and security applications. Similarity measure SAD is often sensitive to illumination variation. It produces unacceptable disparity map results for illumination variant left and right images. Experimental results show that our proposed algorithm produces quality disparity maps for both wide range of illumination variant and invariant stereo image pair.

Download Full-text

Spike timing-dependent plasticity in sparse recurrent neural networks

IEICE Proceeding Series ◽

10.15248/proc.1.485 ◽

2014 ◽

Vol 1 ◽

pp. 485-488

Author(s):

Hideyuki Kato ◽

Tohru Ikeguchi

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Spike Timing ◽

Spike Timing Dependent Plasticity ◽

Dependent Plasticity

Download Full-text

Direct Adaptive Control of Process Systems Using Recurrent Neural Networks

1992 American Control Conference ◽

10.23919/acc.1992.4792020 ◽

1992 ◽

Author(s):

Sanjay Parthasarathy ◽

Alexander G. Parlos ◽

Amir F. Atiya

Keyword(s):

Neural Networks ◽

Adaptive Control ◽

Recurrent Neural Networks ◽

Process Systems ◽

Direct Adaptive Control

Download Full-text

L2 approximation properties of recurrent neural networks

1997 European Control Conference (ECC) ◽

10.23919/ecc.1997.7082360 ◽

1997 ◽

Cited By ~ 1

Author(s):

A. Ruiz ◽

D.H. Owens ◽

S. Townley

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Approximation Properties

Download Full-text

Levenshtein Augmentation Improves Performance of SMILES Based Deep-Learning Synthesis Prediction

10.26434/chemrxiv.12562121 ◽

2020 ◽

Author(s):

Dean Sumner ◽

Jiazhen He ◽

Amol Thakkar ◽

Ola Engkvist ◽

Esben Jannik Bjerrum

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Deep Learning ◽

Recurrent Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Sequence Similarity ◽

Learning Models ◽

Underlying Network

<p>SMILES randomization, a form of data augmentation, has previously been shown to increase the performance of deep learning models compared to non-augmented baselines. Here, we propose a novel data augmentation method we call “Levenshtein augmentation” which considers local SMILES sub-sequence similarity between reactants and their respective products when creating training pairs. The performance of Levenshtein augmentation was tested using two state of the art models - transformer and sequence-to-sequence based recurrent neural networks with attention. Levenshtein augmentation demonstrated an increase performance over non-augmented, and conventionally SMILES randomization augmented data when used for training of baseline models. Furthermore, Levenshtein augmentation seemingly results in what we define as <i>attentional gain </i>– an enhancement in the pattern recognition capabilities of the underlying network to molecular motifs.</p>

Download Full-text