Area-based correlation and non-local attention network for stereo matching

Author(s):  
Xing Li ◽  
Yangyu Fan ◽  
Guoyun Lv ◽  
Haoyue Ma
Author(s):  
Zhibo Rao ◽  
Mingyi He ◽  
Yuchao Dai ◽  
Zhidong Zhu ◽  
Bo Li ◽  
...  

Accurate disparity prediction is a hot spot in computer vision, and how to efficiently exploit contextual information is the key to improve the performance. In this paper, we propose a simple yet effective non-local context attention network to exploit the global context information by using attention mechanisms and semantic information for stereo matching. First, we develop a 2D geometry feature learning module to get a more discriminative representation by taking advantage of multi-scale features and form them into the variance-based cost volume. Then, we construct a non-local attention matching module by using the non-local block and hierarchical 3D convolutions, which can effectively regularize the cost volume and capture the global contextual information. Finally, we adopt a geometry refinement module to refine the disparity map to further improve the performance. Moreover, we add the warping loss function to help the model learn the matching rule of the non-occluded region. Our experiments show that (1) our approach achieves competitive results on KITTI and SceneFlow datasets in the end-point error and the fraction of erroneous pixels $({D_1})$ ; (2) our proposed method particularly has superior performance in the reflective regions and occluded areas.


2020 ◽  
Vol 57 (10) ◽  
pp. 101020
Author(s):  
马晴晴 Ma Qingqing ◽  
王彩芳 Wang Caifang

IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 51681-51690 ◽  
Author(s):  
Guanghui Zhang ◽  
Dongchen Zhu ◽  
Wenjun Shi ◽  
Xiaoqing Ye ◽  
Jiamao Li ◽  
...  

2021 ◽  
Author(s):  
Qi Zhang ◽  
Xuesong Zhang ◽  
Baoping Li ◽  
Yuzhong Chen ◽  
Anlong Ming

Author(s):  
X. Huang ◽  
R. Qin ◽  
M. Chen

<p><strong>Abstract.</strong> Stereo dense matching has already been one of the dominant tools in 3D reconstruction of urban regions, due to its low cost and high flexibility in generating 3D points. However, the image-derived 3D points are often inaccurate around building edges, which limit its use in several vision tasks (e.g. building modelling). To generate 3D point clouds or digital surface models (DSM) with sharp boundaries, this paper integrates robustly matched lines for improving dense matching, and proposes a non-local disparity refinement of building edges through an iterative least squares plane adjustment approach. In our method, we first extract and match straight lines in images using epipolar constraints, then detect building edges from these straight lines by comparing matching results on both sides of straight lines, and finally we develop a non-local disparity refinement method through an iterative least squares plane adjustment constrained by matched straight lines to yield sharper and more accurate edges. Experiments conducted on both satellite and aerial data demonstrate that our proposed method is able to generate more accurate DSM with sharper object boundaries.</p>


Electronics ◽  
2020 ◽  
Vol 9 (11) ◽  
pp. 1881
Author(s):  
Yuhui Chang ◽  
Jiangtao Xu ◽  
Zhiyuan Gao

To improve the accuracy of stereo matching, the multi-scale dense attention network (MDA-Net) is proposed. The network introduces two novel modules in the feature extraction stage to achieve better exploit of context information: dual-path upsampling (DU) block and attention-guided context-aware pyramid feature extraction (ACPFE) block. The DU block is introduced to fuse different scale feature maps. It introduces sub-pixel convolution to compensate for the loss of information caused by the traditional interpolation upsampling method. The ACPFE block is proposed to extract multi-scale context information. Pyramid atrous convolution is adopted to exploit multi-scale features and the channel-attention is used to fuse the multi-scale features. The proposed network has been evaluated on several benchmark datasets. The three-pixel-error evaluated over all ground truth pixels is 2.10% on KITTI 2015 dataset. The experiment results prove that MDA-Net achieves state-of-the-art accuracy on KITTI 2012 and 2015 datasets.


Sign in / Sign up

Export Citation Format

Share Document