PCT: Point cloud transformer

Meng-Hao Guo; Jun-Xiong Cai; Zheng-Ning Liu; Tai-Jiang Mu; Ralph R. Martin; Shi-Min Hu

doi:10.1007/s41095-021-0229-5

PCT: Point cloud transformer

Computational Visual Media ◽

10.1007/s41095-021-0229-5 ◽

2021 ◽

Vol 7 (2) ◽

pp. 187-199

Author(s):

Meng-Hao Guo ◽

Jun-Xiong Cai ◽

Zheng-Ning Liu ◽

Tai-Jiang Mu ◽

Ralph R. Martin ◽

...

Keyword(s):

Language Processing ◽

Point Cloud ◽

Nearest Neighbor ◽

Semantic Segmentation ◽

Nearest Neighbor Search ◽

Local Context ◽

Irregular Domain ◽

Cloud Processing ◽

Neighbor Search ◽

Farthest Point

AbstractThe irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer (PCT) for point cloud learning. PCT is based on Transformer, which achieves huge success in natural language processing and displays great potential in image processing. It is inherently permutation invariant for processing a sequence of points, making it well-suited for point cloud learning. To better capture local context within the point cloud, we enhance input embedding with the support of farthest point sampling and nearest neighbor search. Extensive experiments demonstrate that the PCT achieves the state-of-the-art performance on shape classification, part segmentation, semantic segmentation, and normal estimation tasks.

Download Full-text

A Hybrid Spatial Indexing Structure of Massive Point Cloud Based on Octree and 3D R*-Tree

Applied Sciences ◽

10.3390/app11209581 ◽

2021 ◽

Vol 11 (20) ◽

pp. 9581

Author(s):

Wei Wang ◽

Yi Zhang ◽

Genyu Ge ◽

Qin Jiang ◽

Yang Wang ◽

...

Keyword(s):

Point Cloud ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Index Structure ◽

Spatial Indexing ◽

Cloud Data ◽

Neighbor Search ◽

Nearest Neighbor Searching ◽

Indexing Structure ◽

Balanced Tree

The spatial index structure is one of the most important research topics for organizing and managing massive 3D Point Cloud. As a point in Point Cloud consists of Cartesian coordinates (x,y,z), the common method to explore geometric information and features is nearest neighbor searching. An efficient spatial indexing structure directly affects the speed of the nearest neighbor search. Octree and kd-tree are the most used for Point Cloud data. However, Octree or KD-tree do not perform best in nearest neighbor searching. A highly balanced tree, 3D R*-tree is considered the most effective method so far. So, a hybrid spatial indexing structure is proposed based on Octree and 3D R*-tree. In this paper, we discussed how thresholds influence the performance of nearest neighbor searching and constructing the tree. Finally, an adaptive way method adopted to set thresholds. Furthermore, we obtained a better performance in tree construction and nearest neighbor searching than Octree and 3D R*-tree.

Download Full-text

The Earth Mover’s Distance as a Metric for the Space of Inorganic Compositions

10.26434/chemrxiv.12777566.v1 ◽

2020 ◽

Author(s):

Cameron Hargreaves ◽

Matthew Dyer ◽

Michael Gaultois ◽

Vitaliy Kurlin ◽

Matthew J Rosseinsky

Keyword(s):

Euclidean Distance ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Inorganic Crystal Structure Database ◽

Earth Mover’S Distance ◽

Chemical Similarity ◽

Earth Mover's Distance ◽

Neighbor Search ◽

The Earth ◽

Binary Compounds

It is a core problem in any field to reliably tell how close two objects are to being the same, and once this relation has been established we can use this information to precisely quantify potential relationships, both analytically and with machine learning (ML). For inorganic solids, the chemical composition is a fundamental descriptor, which can be represented by assigning the ratio of each element in the material to a vector. These vectors are a convenient mathematical data structure for measuring similarity, but unfortunately, the standard metric (the Euclidean distance) gives little to no variance in the resultant distances between chemically dissimilar compositions. We present the Earth Mover’s Distance (EMD) for inorganic compositions, a well-defined metric which enables the measure of chemical similarity in an explainable fashion. We compute the EMD between two compositions from the ratio of each of the elements and the absolute distance between the elements on the modified Pettifor scale. This simple metric shows clear strength at distinguishing compounds and is efficient to compute in practice. The resultant distances have greater alignment with chemical understanding than the Euclidean distance, which is demonstrated on the binary compositions of the Inorganic Crystal Structure Database (ICSD). The EMD is a reliable numeric measure of chemical similarity that can be incorporated into automated workflows for a range of ML techniques. We have found that with no supervision the use of this metric gives a distinct partitioning of binary compounds into clear trends and families of chemical property, with future applications for nearest neighbor search queries in chemical database retrieval systems and supervised ML techniques.

Download Full-text

Adaptive bit allocation hashing for approximate nearest neighbor search

Neurocomputing ◽

10.1016/j.neucom.2014.10.042 ◽

2015 ◽

Vol 151 ◽

pp. 719-728 ◽

Cited By ~ 4

Author(s):

Qin-Zhen Guo ◽

Zhi Zeng ◽

Shuwu Zhang

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Bit Allocation ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data ◽

10.1145/3318464.3380600 ◽

2020 ◽

Author(s):

Conglong Li ◽

Minjia Zhang ◽

David G. Andersen ◽

Yuxiong He

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Early Termination ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

A Fast k-Nearest Neighbor Search Using Query-Specific Signature Selection

Proceedings of the 24th ACM International on Conference on Information and Knowledge Management - CIKM '15 ◽

10.1145/2806416.2806632 ◽

2015 ◽

Cited By ~ 1

Author(s):

Youngki Park ◽

Heasoo Hwang ◽

Sang-goo Lee

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

K Nearest Neighbor ◽

Neighbor Search ◽

K Nearest Neighbor Search

Download Full-text

The role of local dimensionality measures in benchmarking nearest neighbor search

Information Systems ◽

10.1016/j.is.2021.101807 ◽

2021 ◽

pp. 101807

Author(s):

Martin Aumüller ◽

Matteo Ceccarello

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search

Download Full-text

Authenticated Multistep Nearest Neighbor Search

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2010.157 ◽

2011 ◽

Vol 23 (5) ◽

pp. 641-654 ◽

Cited By ~ 7

Author(s):

Stavros Papadopoulos ◽

Lixing Wang ◽

Yin Yang ◽

Dimitris Papadias ◽

Panagiotis Karras

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search

Download Full-text

Road Short-Term Travel Time Prediction Method Based on Flow Spatial Distribution and the Relations

Mathematical Problems in Engineering ◽

10.1155/2016/7626875 ◽

2016 ◽

Vol 2016 ◽

pp. 1-14 ◽

Cited By ~ 1

Author(s):

Mingjun Deng ◽

Shiru Qu

Keyword(s):

Time Series ◽

Spatial Distribution ◽

Travel Time ◽

Nonparametric Regression ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Short Term ◽

Combination Model ◽

The Road ◽

Neighbor Search

There are many short-term road travel time forecasting studies based on time series, but indeed, road travel time not only relies on the historical travel time series, but also depends on the road and its adjacent sections history flow. However, few studies have considered that. This paper is based on the correlation of flow spatial distribution and the road travel time series, applying nearest neighbor and nonparametric regression method to build a forecasting model. In aspect of spatial nearest neighbor search, three different space distances are defined. In addition, two forecasting functions are introduced: one combines the forecasting value by mean weight and the other uses the reciprocal of nearest neighbors distance as combined weight. Three different distances are applied in nearest neighbor search, which apply to the two forecasting functions. For travel time series, the nearest neighbor and nonparametric regression are applied too. Then minimizing forecast error variance is utilized as an objective to establish the combination model. The empirical results show that the combination model can improve the forecast performance obviously. Besides, the experimental results of the evaluation for the computational complexity show that the proposed method can satisfy the real-time requirement.

Download Full-text

KVGCN: A KNN Searching and VLAD Combined Graph Convolutional Network for Point Cloud Segmentation

Remote Sensing ◽

10.3390/rs13051003 ◽

2021 ◽

Vol 13 (5) ◽

pp. 1003

Author(s):

Nan Luo ◽

Hongquan Yu ◽

Zhenfeng Huo ◽

Jinhui Liu ◽

Quan Wang ◽

...

Keyword(s):

Point Cloud ◽

Nearest Neighbor ◽

Semantic Segmentation ◽

K Nearest Neighbor ◽

Topological Graph ◽

Convolutional Network ◽

Cloud Data ◽

Nearest Neighbor Searching ◽

Point Cloud Segmentation ◽

Local Feature Extraction

Semantic segmentation of the sensed point cloud data plays a significant role in scene understanding and reconstruction, robot navigation, etc. This work presents a Graph Convolutional Network integrating K-Nearest Neighbor searching (KNN) and Vector of Locally Aggregated Descriptors (VLAD). KNN searching is utilized to construct the topological graph of each point and its neighbors. Then, we perform convolution on the edges of constructed graph to extract representative local features by multiple Multilayer Perceptions (MLPs). Afterwards, a trainable VLAD layer, NetVLAD, is embedded in the feature encoder to aggregate the local and global contextual features. The designed feature encoder is repeated for multiple times, and the extracted features are concatenated in a jump-connection style to strengthen the distinctiveness of features and thereby improve the segmentation. Experimental results on two datasets show that the proposed work settles the shortcoming of insufficient local feature extraction and promotes the accuracy (mIoU 60.9% and oAcc 87.4% for S3DIS) of semantic segmentation comparing to existing models.

Download Full-text

Two-Pass K Nearest Neighbor Search for Feature Tracking

IEEE Access ◽

10.1109/access.2018.2879337 ◽

2018 ◽

Vol 6 ◽

pp. 72939-72951

Author(s):

Mingwei Cao ◽

Wei Jia ◽

Zhihan Lv ◽

Wenjun Xie ◽

Liping Zheng ◽

...

Keyword(s):

Feature Tracking ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

K Nearest Neighbor ◽

Neighbor Search ◽

K Nearest Neighbor Search

Download Full-text