Angular Disparity Map: A Scalable Perceptual-Based Representation of Binocular Disparity

AbstractIn this work, we propose a learning-based method to denoise and refine disparity maps. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint statistics of the color image, the confidence map and the disparity map. Due to the variational structure of our method, the individual steps can be easily visualized, thus enabling interpretability of the method. We can therefore provide interesting insights into how our method refines and denoises disparity maps. To this end, we can visualize and interpret the learned filters and activation functions and prove the increased reliability of the predicted pixel-wise confidence maps. Furthermore, the optimization based structure of our refinement module allows us to compute eigen disparity maps, which reveal structural properties of our refinement module. The efficiency of our method is demonstrated on the publicly available stereo benchmarks Middlebury 2014 and Kitti 2015.

Download Full-text

Sleep does not aid the generalisation of binocular disparity‐based learning to the other visual hemifield

Journal of Sleep Research ◽

10.1111/jsr.13335 ◽

2021 ◽

Author(s):

Jens G. Klinzing ◽

Hendrikje Nienborg ◽

Karsten Rauss

Keyword(s):

Binocular Disparity ◽

The Other ◽

Visual Hemifield

Download Full-text

PDANet: Self-Supervised Monocular Depth Estimation Using Perceptual and Data Augmentation Consistency

Applied Sciences ◽

10.3390/app11125383 ◽

2021 ◽

Vol 11 (12) ◽

pp. 5383

Author(s):

Huachen Gao ◽

Xiaoyu Liu ◽

Meixia Qu ◽

Shijie Huang

Keyword(s):

Data Augmentation ◽

State Of The Art ◽

Depth Estimation ◽

Input Image ◽

Depth Information ◽

Disparity Map ◽

Estimation Model ◽

Absolute Relative Error ◽

Texture Region ◽

Monocular Depth

In recent studies, self-supervised learning methods have been explored for monocular depth estimation. They minimize the reconstruction loss of images instead of depth information as a supervised signal. However, existing methods usually assume that the corresponding points in different views should have the same color, which leads to unreliable unsupervised signals and ultimately damages the reconstruction loss during the training. Meanwhile, in the low texture region, it is unable to predict the disparity value of pixels correctly because of the small number of extracted features. To solve the above issues, we propose a network—PDANet—that integrates perceptual consistency and data augmentation consistency, which are more reliable unsupervised signals, into a regular unsupervised depth estimation model. Specifically, we apply a reliable data augmentation mechanism to minimize the loss of the disparity map generated by the original image and the augmented image, respectively, which will enhance the robustness of the image in the prediction of color fluctuation. At the same time, we aggregate the features of different layers extracted by a pre-trained VGG16 network to explore the higher-level perceptual differences between the input image and the generated one. Ablation studies demonstrate the effectiveness of each components, and PDANet shows high-quality depth estimation results on the KITTI benchmark, which optimizes the state-of-the-art method from 0.114 to 0.084, measured by absolute relative error for depth estimation.

Download Full-text

A Joint 2D-3D Complementary Network for Stereo Matching

Sensors ◽

10.3390/s21041430 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1430

Author(s):

Xiaogang Jia ◽

Wei Chen ◽

Zhengfa Liang ◽

Xin Luo ◽

Mingfei Wu ◽

...

Keyword(s):

Stereo Matching ◽

Computational Cost ◽

Research Field ◽

Disparity Map ◽

Improve Performance ◽

Cost Aggregation ◽

Disparity Range ◽

Public Datasets ◽

Coarse To Fine ◽

Speed And Accuracy

Stereo matching is an important research field of computer vision. Due to the dimension of cost aggregation, current neural network-based stereo methods are difficult to trade-off speed and accuracy. To this end, we integrate fast 2D stereo methods with accurate 3D networks to improve performance and reduce running time. We leverage a 2D encoder-decoder network to generate a rough disparity map and construct a disparity range to guide the 3D aggregation network, which can significantly improve the accuracy and reduce the computational cost. We use a stacked hourglass structure to refine the disparity from coarse to fine. We evaluated our method on three public datasets. According to the KITTI official website results, Our network can generate an accurate result in 80 ms on a modern GPU. Compared to other 2D stereo networks (AANet, DeepPruner, FADNet, etc.), our network has a big improvement in accuracy. Meanwhile, it is significantly faster than other 3D stereo networks (5× than PSMNet, 7.5× than CSN and 22.5× than GANet, etc.), demonstrating the effectiveness of our method.

Download Full-text

A 3D mosaic algorithm using disparity map

10.1117/12.2077468 ◽

2015 ◽

Author(s):

Bo Yu ◽

Hideki Kakeya

Keyword(s):

Disparity Map

Download Full-text

The Perception of a Depth Interval with Binocular Disparity Cues

The Journal of Psychology ◽

10.1080/00223980.1960.9916442 ◽

1960 ◽

Vol 50 (2) ◽

pp. 257-269 ◽

Cited By ~ 15

Author(s):

Walter C. Gogel

Keyword(s):

Binocular Disparity ◽

Depth Interval

Download Full-text

Binocular Disparity in Aspherical Mirrors

10.4271/980918 ◽

1998 ◽

Author(s):

Stephen M. O'Day

Keyword(s):

Binocular Disparity ◽

Aspherical Mirrors

Download Full-text

The early attentional pancake: Lack of selection in depth for rapid exogenous cueing

10.31234/osf.io/xgbmv ◽

2021 ◽

Author(s):

Ryan Edward O'Donnell ◽

Kyrie Murawski ◽

Ella Herrmann ◽

Jesse Wisch ◽

Garrett D. Sullivan ◽

...

Keyword(s):

Reaction Time ◽

Binocular Disparity ◽

Dimensional Space ◽

Exogenous Attention ◽

3 Dimensional ◽

Exogenous Cueing ◽

Different Depths ◽

Depth Cueing ◽

The Difference ◽

Attentional Selectivity

There have been conflicting findings on the degree to which exogenous/reflexive visual attention is selective for depth, and this issue has important implications for attention models. Previous findings have attempted to find depth-based cueing effects on such attention using reaction time measures for stimuli presented in stereo goggles with a display screen. Results stemming from such approaches have been mixed, depending on whether target/distractor discrimination was required. To help clarify the existence of such depth effects, we have developed a paradigm that measures accuracy rather than reaction time in an immersive virtual-reality environment, providing a more appropriate context of depth. Four modified Posner Cueing paradigms were run to test for depth-specific attentional selectivity. Participants fixated a cross while attempting to identify a rapidly masked letter that was preceded by a cue that could be valid in depth and side, depth only, or side only. In Experiment 1, a potent cueing effect was found for side validity and a weak effect was found for depth. Experiment 2 controlled for differences in cue and target sizes when presented at different depths, which caused the depth validity effect to disappear entirely even though participants were explicitly asked to report depth and the difference in virtual depth was extreme (20 vs 300 meters). Experiments 3a and 3b brought the front depth plane even closer (1 m) to maximize effects of binocular disparity, but no reliable depth cueing validity was observed. Thus, it seems that rapid/exogenous attention pancakes 3-dimensional space into a 2-dimensional reference frame.

Download Full-text

Angular Disparity Map: A Scalable Perceptual-Based Representation of Binocular Disparity

Supplemental Material for Fast Perception of Binocular Disparity

An Optimized Mean Shift Filtering Technique to Image Representation Through Disparity Map for Large Scale Stereo Images

Learned Collaborative Stereo Refinement

Sleep does not aid the generalisation of binocular disparity‐based learning to the other visual hemifield

PDANet: Self-Supervised Monocular Depth Estimation Using Perceptual and Data Augmentation Consistency

A Joint 2D-3D Complementary Network for Stereo Matching

A 3D mosaic algorithm using disparity map

The Perception of a Depth Interval with Binocular Disparity Cues

Binocular Disparity in Aspherical Mirrors

The early attentional pancake: Lack of selection in depth for rapid exogenous cueing

Export Citation Format