Area-based correlation and non-local attention network for stereo matching

NLCA-Net: a non-local context attention network for stereo matching

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2020.16 ◽

2020 ◽

Vol 9 ◽

Cited By ~ 1

Author(s):

Zhibo Rao ◽

Mingyi He ◽

Yuchao Dai ◽

Zhidong Zhu ◽

Bo Li ◽

...

Keyword(s):

Stereo Matching ◽

Hot Spot ◽

Contextual Information ◽

Feature Learning ◽

Superior Performance ◽

Local Context ◽

Disparity Map ◽

Matching Rule ◽

Attention Network ◽

Non Local

Accurate disparity prediction is a hot spot in computer vision, and how to efficiently exploit contextual information is the key to improve the performance. In this paper, we propose a simple yet effective non-local context attention network to exploit the global context information by using attention mechanisms and semantic information for stereo matching. First, we develop a 2D geometry feature learning module to get a more discriminative representation by taking advantage of multi-scale features and form them into the variance-based cost volume. Then, we construct a non-local attention matching module by using the non-local block and hierarchical 3D convolutions, which can effectively regularize the cost volume and capture the global contextual information. Finally, we adopt a geometry refinement module to refine the disparity map to further improve the performance. Moreover, we add the warping loss function to help the model learn the matching rule of the non-occluded region. Our experiments show that (1) our approach achieves competitive results on KITTI and SceneFlow datasets in the end-point error and the fraction of erroneous pixels $({D_1})$ ; (2) our proposed method particularly has superior performance in the reflective regions and occluded areas.

Download Full-text

D-MONA: A dilated mixed-order non-local attention network for speaker and language recognition

Neural Networks ◽

10.1016/j.neunet.2021.03.014 ◽

2021 ◽

Author(s):

Xiaoxiao Miao ◽

Ian McLoughlin ◽

Wenchao Wang ◽

Pengyuan Zhang

Keyword(s):

Language Recognition ◽

Attention Network ◽

Mixed Order ◽

Non Local

Download Full-text

An Efficient Non-local Attention Network for Video-based Person Re-identification

Proceedings of the 2019 7th International Conference on Information Technology: IoT and Smart City ◽

10.1145/3377170.3377253 ◽

2019 ◽

Author(s):

Zhen Wang ◽

Shixian Luo ◽

He Sun ◽

Huadong Pan ◽

Jun Yin

Keyword(s):

Attention Network ◽

Non Local

Download Full-text

A fast non-local disparity refinement method for stereo matching

2014 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2014.7025776 ◽

2014 ◽

Cited By ~ 3

Author(s):

Xiaoming Huang ◽

Guoqin Cui ◽

Yundong Zhang

Keyword(s):

Stereo Matching ◽

Refinement Method ◽

Non Local

Download Full-text

Non-Local Stereo Matching Algorithm Based on Color and Edge Information

Laser & Optoelectronics Progress ◽

10.3788/lop57.101020 ◽

2020 ◽

Vol 57 (10) ◽

pp. 101020

Author(s):

马晴晴 Ma Qingqing ◽

王彩芳 Wang Caifang

Keyword(s):

Stereo Matching ◽

Edge Information ◽

Matching Algorithm ◽

Non Local

Download Full-text

Multi-Dimensional Residual Dense Attention Network for Stereo Matching

IEEE Access ◽

10.1109/access.2019.2911618 ◽

2019 ◽

Vol 7 ◽

pp. 51681-51690 ◽

Cited By ~ 6

Author(s):

Guanghui Zhang ◽

Dongchen Zhu ◽

Wenjun Shi ◽

Xiaoqing Ye ◽

Jiamao Li ◽

...

Keyword(s):

Stereo Matching ◽

Attention Network

Download Full-text

HDA-Net: Horizontal Deformable Attention Network for Stereo Matching

10.1145/3474085.3475273 ◽

2021 ◽

Author(s):

Qi Zhang ◽

Xuesong Zhang ◽

Baoping Li ◽

Yuzhong Chen ◽

Anlong Ming

Keyword(s):

Stereo Matching ◽

Attention Network

Download Full-text

A fast non-local based stereo matching algorithm using graph cuts

2014 9th International Conference on Computer Engineering & Systems (ICCES) ◽

10.1109/icces.2014.7030943 ◽

2014 ◽

Cited By ~ 5

Author(s):

Doaa A. Altantawy ◽

Marwa Obbaya ◽

Sherif Kishk

Keyword(s):

Stereo Matching ◽

Graph Cuts ◽

Matching Algorithm ◽

Non Local

Download Full-text

DISPARITY REFINEMENT OF BUILDING EDGES USING ROBUSTLY MATCHED STRAIGHT LINES FOR STEREO MATCHING

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-1-77-2018 ◽

2018 ◽

Vol IV-1 ◽

pp. 77-84

Author(s):

X. Huang ◽

R. Qin ◽

M. Chen

Keyword(s):

Least Squares ◽

Stereo Matching ◽

Low Cost ◽

Point Clouds ◽

3D Point Clouds ◽

Dense Matching ◽

Non Local ◽

Epipolar Constraints ◽

High Flexibility ◽

Straight Lines

<p><strong>Abstract.</strong> Stereo dense matching has already been one of the dominant tools in 3D reconstruction of urban regions, due to its low cost and high flexibility in generating 3D points. However, the image-derived 3D points are often inaccurate around building edges, which limit its use in several vision tasks (e.g. building modelling). To generate 3D point clouds or digital surface models (DSM) with sharp boundaries, this paper integrates robustly matched lines for improving dense matching, and proposes a non-local disparity refinement of building edges through an iterative least squares plane adjustment approach. In our method, we first extract and match straight lines in images using epipolar constraints, then detect building edges from these straight lines by comparing matching results on both sides of straight lines, and finally we develop a non-local disparity refinement method through an iterative least squares plane adjustment constrained by matched straight lines to yield sharper and more accurate edges. Experiments conducted on both satellite and aerial data demonstrate that our proposed method is able to generate more accurate DSM with sharper object boundaries.</p>

Download Full-text

Multi-Scale Dense Attention Network for Stereo Matching

Electronics ◽

10.3390/electronics9111881 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1881

Author(s):

Yuhui Chang ◽

Jiangtao Xu ◽

Zhiyuan Gao

Keyword(s):

Feature Extraction ◽

Stereo Matching ◽

State Of The Art ◽

Ground Truth ◽

Context Information ◽

Context Aware ◽

Feature Maps ◽

Attention Network ◽

Multi Scale ◽

Benchmark Datasets

To improve the accuracy of stereo matching, the multi-scale dense attention network (MDA-Net) is proposed. The network introduces two novel modules in the feature extraction stage to achieve better exploit of context information: dual-path upsampling (DU) block and attention-guided context-aware pyramid feature extraction (ACPFE) block. The DU block is introduced to fuse different scale feature maps. It introduces sub-pixel convolution to compensate for the loss of information caused by the traditional interpolation upsampling method. The ACPFE block is proposed to extract multi-scale context information. Pyramid atrous convolution is adopted to exploit multi-scale features and the channel-attention is used to fuse the multi-scale features. The proposed network has been evaluated on several benchmark datasets. The three-pixel-error evaluated over all ground truth pixels is 2.10% on KITTI 2015 dataset. The experiment results prove that MDA-Net achieves state-of-the-art accuracy on KITTI 2012 and 2015 datasets.

Download Full-text