scholarly journals Learning local feature aggregation functions with backpropagation

Author(s):  
Angelos Katharopoulos ◽  
Despoina Paschalidou ◽  
Christos Diou ◽  
Anastasios Delopoulos
2021 ◽  
Vol 16 (3) ◽  
Author(s):  
Hao Wang ◽  
Liyan Dong ◽  
Minghui Sun

2021 ◽  
Vol 94 ◽  
pp. 107337
Author(s):  
Guang Zeng ◽  
Haisheng Li ◽  
Xiaochuan Wang ◽  
Nan Li

2021 ◽  
Vol 13 (17) ◽  
pp. 3427
Author(s):  
Chunjiao Zhang ◽  
Shenghua Xu ◽  
Tao Jiang ◽  
Jiping Liu ◽  
Zhengjun Liu ◽  
...  

LiDAR point clouds are rich in spatial information and can effectively express the size, shape, position, and direction of objects; thus, they have the advantage of high spatial utilization. The point cloud focuses on describing the shape of the external surface of the object itself and will not store useless redundant information to describe the occupation. Therefore, point clouds have become the research focus of 3D data models and are widely used in large-scale scene reconstruction, virtual reality, digital elevation model production, and other fields. Since point clouds have various characteristics, such as disorder, density inconsistency, unstructuredness, and incomplete information, point cloud classification is still complex and challenging. To realize the semantic classification of LiDAR point clouds in complex scenarios, this paper proposes the integration of normal vector features into an atrous convolution residual network. Based on the RandLA-Net network structure, the proposed network integrates the atrous convolution into the residual module to extract global and local features of the point clouds. The atrous convolution can learn more valuable point cloud feature information by expanding the receptive field. Then, the point cloud normal vector is embedded in the local feature aggregation module of the RandLA-Net network to extract local semantic aggregation features. The improved local feature aggregation module can merge the deep features of the point cloud and mine the fine-grained information of the point cloud to improve the model’s segmentation ability in complex scenes. Finally, to resolve the imbalance of the distribution of the various categories of point clouds, the original loss function is optimized by adopting a reweighted method to prevent overfitting so that the network can focus on small target categories in the training process to effectively improve the classification performance. Through the experimental analysis of a Vaihingen (Germany) urban 3D semantic dataset from the ISPRS website, it is verified that the proposed algorithm has a strong generalization ability. The overall accuracy (OA) of the proposed algorithm on the Vaihingen urban 3D semantic dataset reached 97.9%, and the average reached 96.1%. Experiments show that the proposed algorithm fully exploits the semantic features of point clouds and effectively improves the accuracy of point cloud classification.


Author(s):  
Xingwei Li ◽  
Shaojie Guan ◽  
Xinlong Li ◽  
Jiating Jin ◽  
Jiazhe Zhang

2020 ◽  
Vol 381 ◽  
pp. 336-347 ◽  
Author(s):  
Xingxing Zhang ◽  
Zhenfeng Zhu ◽  
Yao Zhao ◽  
Yawei Zhao

2021 ◽  
Vol 13 (24) ◽  
pp. 5039
Author(s):  
Dong Chen ◽  
Guiqiu Xiang ◽  
Jiju Peethambaran ◽  
Liqiang Zhang ◽  
Jing Li ◽  
...  

In this paper, we propose a deep learning framework, namely AFGL-Net to achieve building façade parsing, i.e., obtaining the semantics of small components of building façade, such as windows and doors. To this end, we present an autoencoder embedding position and direction encoding for local feature encoding. The autoencoder enhances the local feature aggregation and augments the representation of skeleton features of windows and doors. We also integrate the Transformer into AFGL-Net to infer the geometric shapes and structural arrangements of façade components and capture the global contextual features. These global features can help recognize inapparent windows/doors from the façade points corrupted with noise, outliers, occlusions, and irregularities. The attention-based feature fusion mechanism is finally employed to obtain more informative features by simultaneously considering local geometric details and the global contexts. The proposed AFGL-Net is comprehensively evaluated on Dublin and RueMonge2014 benchmarks, achieving 67.02% and 59.80% mIoU, respectively. We also demonstrate the superiority of the proposed AFGL-Net by comparing with the state-of-the-art methods and various ablation studies.


Sign in / Sign up

Export Citation Format

Share Document