An Efficient Exact Nearest Neighbor Search by Compounded Embedding

This article presents a new approximate index structure, the Bregman hyperplane tree, for indexing the Bregman divergence, aiming to decrease the number of distance computations required at query processing time, by sacrificing some accuracy in the result. The experimental results on various high-dimensional data sets demonstrate that the proposed index structure performs comparably to the state-of-the-art Bregman ball tree in terms of search performance and result quality. Moreover, this method results in a speedup of well over an order of magnitude for index construction. The authors also apply their space partitioning principle to the Bregman ball tree and obtain a new index structure for exact nearest neighbor search that is faster to build and a slightly slower at query processing than the original.

Download Full-text

Confirmation Sampling for Exact Nearest Neighbor Search

Similarity Search and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-60936-8_8 ◽

2020 ◽

pp. 97-110

Author(s):

Tobias Christiani ◽

Rasmus Pagh ◽

Mikkel Thorup

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

Accelerating exact nearest neighbor search in high dimensional Euclidean space via block vectors

International Journal of Intelligent Systems ◽

10.1002/int.22692 ◽

2021 ◽

Author(s):

Haowen Zhang ◽

Yabo Dong ◽

Duanqing Xu

Keyword(s):

Euclidean Space ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

High Dimensional ◽

Dimensional Euclidean Space ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

A novel supervised cluster adjustment method using a fast exact nearest neighbor search algorithm

Pattern Analysis and Applications ◽

10.1007/s10044-015-0527-6 ◽

2015 ◽

Vol 20 (3) ◽

pp. 701-715 ◽

Cited By ~ 1

Author(s):

Ali Zaghian ◽

Fakhroddin Noorbehbahani

Keyword(s):

Nearest Neighbor ◽

Search Algorithm ◽

Nearest Neighbor Search ◽

Adjustment Method ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

Numerical Optimization of a Walk-on-Spheres Solver for the Linear Poisson-Boltzmann Equation

Communications in Computational Physics ◽

10.4208/cicp.220711.041011s ◽

2013 ◽

Vol 13 (1) ◽

pp. 195-206 ◽

Cited By ~ 10

Author(s):

Travis Mackoy ◽

Robert C. Harris ◽

Jesse Johnson ◽

Michael Mascagni ◽

Marcia O. Fenley

Keyword(s):

Monte Carlo ◽

Boltzmann Equation ◽

Nearest Neighbor ◽

Optimization Techniques ◽

Nearest Neighbor Search ◽

Computational Time ◽

Neighbor Search ◽

Poisson Boltzmann ◽

Poisson Boltzmann Equation ◽

Exact Nearest Neighbor

AbstractStochastic walk-on-spheres (WOS) algorithms for solving the linearized Poisson-Boltzmann equation (LPBE) provide several attractive features not available in traditional deterministic solvers: Gaussian error bars can be computed easily, the algorithm is readily parallelized and requires minimal memory and multiple solvent environments can be accounted for by reweighting trajectories. However, previously-reported computational times of these Monte Carlo methods were not competitive with existing deterministic numerical methods. The present paper demonstrates a series of numerical optimizations that collectively make the computational time of these Monte Carlo LPBE solvers competitive with deterministic methods. The optimization techniques used are to ensure that each atom’s contribution to the variance of the electrostatic solvation free energy is the same, to optimize the bias-generating parameters in the algorithm and to use an epsilon-approximate rather than exact nearest-neighbor search when determining the size of the next step in the Brownian motion when outside the molecule.

Download Full-text

Fast and Exact Nearest Neighbor Search in Hamming Space on Full-Text Search Engines

Similarity Search and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-32047-8_5 ◽

2019 ◽

pp. 49-56

Author(s):

Cun (Matthew) Mu ◽

Jun (Raymond) Zhao ◽

Guang Yang ◽

Binwei Yang ◽

Zheng (John) Yan

Keyword(s):

Full Text ◽

Search Engines ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Text Search ◽

Full Text Search ◽

Neighbor Search ◽

Hamming Space ◽

Exact Nearest Neighbor

Download Full-text

The Earth Mover’s Distance as a Metric for the Space of Inorganic Compositions

10.26434/chemrxiv.12777566.v1 ◽

2020 ◽

Author(s):

Cameron Hargreaves ◽

Matthew Dyer ◽

Michael Gaultois ◽

Vitaliy Kurlin ◽

Matthew J Rosseinsky

Keyword(s):

Euclidean Distance ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Inorganic Crystal Structure Database ◽

Earth Mover’S Distance ◽

Chemical Similarity ◽

Earth Mover's Distance ◽

Neighbor Search ◽

The Earth ◽

Binary Compounds

It is a core problem in any field to reliably tell how close two objects are to being the same, and once this relation has been established we can use this information to precisely quantify potential relationships, both analytically and with machine learning (ML). For inorganic solids, the chemical composition is a fundamental descriptor, which can be represented by assigning the ratio of each element in the material to a vector. These vectors are a convenient mathematical data structure for measuring similarity, but unfortunately, the standard metric (the Euclidean distance) gives little to no variance in the resultant distances between chemically dissimilar compositions. We present the Earth Mover’s Distance (EMD) for inorganic compositions, a well-defined metric which enables the measure of chemical similarity in an explainable fashion. We compute the EMD between two compositions from the ratio of each of the elements and the absolute distance between the elements on the modified Pettifor scale. This simple metric shows clear strength at distinguishing compounds and is efficient to compute in practice. The resultant distances have greater alignment with chemical understanding than the Euclidean distance, which is demonstrated on the binary compositions of the Inorganic Crystal Structure Database (ICSD). The EMD is a reliable numeric measure of chemical similarity that can be incorporated into automated workflows for a range of ML techniques. We have found that with no supervision the use of this metric gives a distinct partitioning of binary compounds into clear trends and families of chemical property, with future applications for nearest neighbor search queries in chemical database retrieval systems and supervised ML techniques.

Download Full-text

Adaptive bit allocation hashing for approximate nearest neighbor search

Neurocomputing ◽

10.1016/j.neucom.2014.10.042 ◽

2015 ◽

Vol 151 ◽

pp. 719-728 ◽

Cited By ~ 4

Author(s):

Qin-Zhen Guo ◽

Zhi Zeng ◽

Shuwu Zhang

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Bit Allocation ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

PCT: Point cloud transformer

Computational Visual Media ◽

10.1007/s41095-021-0229-5 ◽

2021 ◽

Vol 7 (2) ◽

pp. 187-199

Author(s):

Meng-Hao Guo ◽

Jun-Xiong Cai ◽

Zheng-Ning Liu ◽

Tai-Jiang Mu ◽

Ralph R. Martin ◽

...

Keyword(s):

Language Processing ◽

Point Cloud ◽

Nearest Neighbor ◽

Semantic Segmentation ◽

Nearest Neighbor Search ◽

Local Context ◽

Irregular Domain ◽

Cloud Processing ◽

Neighbor Search ◽

Farthest Point

AbstractThe irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer (PCT) for point cloud learning. PCT is based on Transformer, which achieves huge success in natural language processing and displays great potential in image processing. It is inherently permutation invariant for processing a sequence of points, making it well-suited for point cloud learning. To better capture local context within the point cloud, we enhance input embedding with the support of farthest point sampling and nearest neighbor search. Extensive experiments demonstrate that the PCT achieves the state-of-the-art performance on shape classification, part segmentation, semantic segmentation, and normal estimation tasks.

Download Full-text

Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data ◽

10.1145/3318464.3380600 ◽

2020 ◽

Author(s):

Conglong Li ◽

Minjia Zhang ◽

David G. Andersen ◽

Yuxiong He

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Early Termination ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text