Fast nearest neighbor search in high-dimensional space

The nearest neighbor search in high-dimensional space is an important operation in many applications, such as data mining and multimedia databases. Evaluating similarity in high-dimensional space requires high computational cost; index-structures are frequently used for reducing computational cost. Most of these index-structures are built by partitioning the data set. However, the partitioning approaches potentially have the problem of failing to find the nearest neighbor that is caused by partitions. In this paper, we propose the Error Minimizing Partitioning (EMP) method with a novel tree structure that minimizes the failures of finding the nearest neighbors. EMP divides the data into subsets with considering the distribution of data sets. For partitioning a data set, the proposed method finds the line that minimizes the summation of distance to data points. The method then finds the median of the data set. Finally, our proposed method determines the partitioning hyper-plane that passes the median and is perpendicular to the line. We also make a comparative study between existing methods and the proposed method to verify the effectiveness of our method.

Download Full-text

Near-Optimal Partial Linear Scan for Nearest Neighbor Search in High-Dimensional Space

Database Systems for Advanced Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-642-37487-6_10 ◽

2013 ◽

pp. 101-115 ◽

Cited By ~ 1

Author(s):

Jiangtao Cui ◽

Zi Huang ◽

Bo Wang ◽

Yingfan Liu

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Nearest Neighbor Search ◽

High Dimensional ◽

High Dimensional Space ◽

Neighbor Search ◽

Partial Linear

Download Full-text

Grassmann Hashing for approximate nearest neighbor search in high dimensional space

2011 IEEE International Conference on Multimedia and Expo ◽

10.1109/icme.2011.6012027 ◽

2011 ◽

Author(s):

Xinchao Wang ◽

Zhu Li ◽

Lei Zhang ◽

Junsong Yuan

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Nearest Neighbor Search ◽

High Dimensional ◽

High Dimensional Space ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

Distance Encoded Product Quantization for Approximate K-Nearest Neighbor Search in High-Dimensional Space

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2018.2853161 ◽

2019 ◽

Vol 41 (9) ◽

pp. 2084-2097 ◽

Cited By ~ 4

Author(s):

Jae-Pil Heo ◽

Zhe Lin ◽

Sung-Eui Yoon

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Nearest Neighbor Search ◽

High Dimensional ◽

K Nearest Neighbor ◽

High Dimensional Space ◽

Product Quantization ◽

Neighbor Search ◽

K Nearest Neighbor Search

Download Full-text

Effective optimizations of cluster-based nearest neighbor search in high-dimensional space

Multimedia Systems ◽

10.1007/s00530-014-0444-3 ◽

2014 ◽

Vol 23 (1) ◽

pp. 139-153 ◽

Cited By ~ 2

Author(s):

Xiaokang Feng ◽

Jiangtao Cui ◽

Yingfan Liu ◽

Hui Li

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Nearest Neighbor Search ◽

High Dimensional ◽

High Dimensional Space ◽

Neighbor Search

Download Full-text

I-LSH: I/O Efficient c-Approximate Nearest Neighbor Search in High-Dimensional Space

2019 IEEE 35th International Conference on Data Engineering (ICDE) ◽

10.1109/icde.2019.00169 ◽

2019 ◽

Cited By ~ 10

Author(s):

Wanqi Liu ◽

Hanchen Wang ◽

Ying Zhang ◽

Wei Wang ◽

Lu Qin

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Nearest Neighbor Search ◽

High Dimensional ◽

High Dimensional Space ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

Novel approach for nearest neighbor search in high dimensional space

2008 4th International IEEE Conference Intelligent Systems ◽

10.1109/is.2008.4670504 ◽

2008 ◽

Cited By ~ 1

Author(s):

Ming Zhang ◽

Reda Alhajj

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Nearest Neighbor Search ◽

High Dimensional ◽

High Dimensional Space ◽

Neighbor Search ◽

Novel Approach

Download Full-text

The Automated Risk Estimation for the Navigation of Autonomous Ships by Learning with Navigation Feature

International Journal of Computational Methods ◽

10.1142/s0219876220410030 ◽

2020 ◽

pp. 2041003

Author(s):

Wei Chian Tan ◽

Kie Hian Chua ◽

Yanling Wu

Keyword(s):

Nearest Neighbor ◽

Dimensional Space ◽

Risk Estimation ◽

Nearest Neighbor Search ◽

Mathematical Representation ◽

Automatic Identification ◽

Identification System ◽

North West ◽

The North ◽

Neighbor Search

This work presents a data-driven approach for the automated risk estimation of the voyage of a vessel or ship. While the industry is moving from a compliance-based framework with existing rules to a risk-based one, there is also a need to monitor the risk of a vessel from the perspective of the navigation. This is of even higher importance for the case of autonomous ships. Built based on the state-of-the-art mathematical representation, the navigation feature, each existing voyage is transformed into a corresponding series of points in [Formula: see text]-dimensional space. During the stage of pre-processing, given a set of historical Automatic Identification System (AIS) data, those records that belong to the same vessel within a certain period of time are taken as a voyage and mapped to the corresponding space of the navigation feature. After the pre-processing and during the online monitoring, the current trajectory of the vessel is transformed into the corresponding representation in the same way. Based on a nearest-neighbor search scheme, the distance from the nearest neighbor is taken as the risk of the current voyage. In other words, the deviation from the closest route in the historical data is taken as the risk. The developed method has demonstrated encouraging performance on a set of challenging historical AIS data from the Australian Maritime Safety Authority, covering three regions in the Australian territory, namely, the Bass Strait, the Great Australian Bight and the North West.

Download Full-text