Fast and Exact Nearest Neighbor Search in Hamming Space on Full-Text Search Engines

Fast Nearest Neighbor Search in the Hamming Space

MultiMedia Modeling - Lecture Notes in Computer Science ◽

10.1007/978-3-319-27671-7_27 ◽

2016 ◽

pp. 325-336 ◽

Cited By ~ 7

Author(s):

Zhansheng Jiang ◽

Lingxi Xie ◽

Xiaotie Deng ◽

Weiwei Xu ◽

Jingdong Wang

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search ◽

Hamming Space

Download Full-text

A Fast Approximate Nearest Neighbor Search Algorithm in the Hamming Space

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2012.170 ◽

2012 ◽

Vol 34 (12) ◽

pp. 2481-2488 ◽

Cited By ~ 31

Author(s):

Mani Malek Esmaeili ◽

R. K. Ward ◽

M. Fatourechi

Keyword(s):

Nearest Neighbor ◽

Search Algorithm ◽

Nearest Neighbor Search ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search ◽

Hamming Space

Download Full-text

Optimized K-Means Hashing for Approximate Nearest Neighbor Search

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.651-653.2168 ◽

2014 ◽

Vol 651-653 ◽

pp. 2168-2171

Author(s):

Qin Zhen Guo ◽

Zhi Zeng ◽

Shu Wu Zhang ◽

Yuan Zhang ◽

Gui Xuan Zhang

Keyword(s):

High Efficiency ◽

Nearest Neighbor ◽

Quantization Error ◽

Nearest Neighbor Search ◽

Binary Codes ◽

Neighborhood Structure ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search ◽

Hamming Space

Hashing which maps data into binary codes in Hamming space has attracted more and more attentions for approximate nearest neighbor search due to its high efficiency and reduced storage cost. K-means hashing (KH) is a novel hashing method which firstly quantizes the data by codewords and then uses the indices of codewords to encode the data. However, in KH, only the codewords are updated to minimize the quantization error and affinity error while the indices of codewords remain the same after they are initialized. In this paper, we propose an optimized k-means hashing (OKH) method to encode data by binary codes. In our method, we simultaneously optimize the codewords and the indices of them to minimize the quantization error and the affinity error. Our OKH method can find both the optimal codewords and the optiaml indices, and the resulting binary codes in Hamming space can better preserve the original neighborhood structure of the data. Besides, OKH can further be generalized to a product space. Extensive experiments have verified the superiority of OKH over KH and other state-of-the-art hashing methods.

Download Full-text

Quantification of competitive value of documents

Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis ◽

10.11118/actaun200957050285 ◽

2009 ◽

Vol 57 (5) ◽

pp. 285-290

Author(s):

Pavel Šimek ◽

Jiří Vaněk ◽

Jan Jarolímek

Keyword(s):

Search Engine ◽

Market Share ◽

Full Text ◽

Search Engines ◽

Web Site ◽

Optimization Techniques ◽

Text Search ◽

Full Text Search ◽

Google Search ◽

The Web

The majority of Internet users use the global network to search for different information using fulltext search engines such as Google, Yahoo!, or Seznam. The web presentation operators are trying, with the help of different optimization techniques, to get to the top places in the results of fulltext search engines. Right there is a great importance of Search Engine Optimization and Search Engine Marketing, because normal users usually try links only on the first few pages of the fulltext search engines results on certain keywords and in catalogs they use primarily hierarchically higher placed links in each category. Key to success is the application of optimization methods which deal with the issue of keywords, structure and quality of content, domain names, individual sites and quantity and reliability of backward links. The process is demanding, long-lasting and without a guaranteed outcome. A website operator without advanced analytical tools do not identify the contribution of individual documents from which the entire web site consists. If the web presentation operators want to have an overview of their documents and web site in global, it is appropriate to quantify these positions in a specific way, depending on specific key words. For this purpose serves the quantification of competitive value of documents, which consequently sets global competitive value of a web site. Quantification of competitive values is performed on a specific full-text search engine. For each full-text search engine can be and often are, different results. According to published reports of ClickZ agency or Market Share is according to the number of searches by English-speaking users most widely used Google search engine, which has a market share of more than 80%. The whole procedure of quantification of competitive values is common, however, the initial step which is the analysis of keywords depends on a choice of the fulltext search engine.

Download Full-text

An Efficient Exact Nearest Neighbor Search by Compounded Embedding

Database Systems for Advanced Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-319-91452-7_3 ◽

2018 ◽

pp. 37-54

Author(s):

Mingjie Li ◽

Ying Zhang ◽

Yifang Sun ◽

Wei Wang ◽

Ivor W. Tsang ◽

...

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

Bregman Hyperplane Trees for Fast Approximate Nearest Neighbor Search

International Journal of Multimedia Data Engineering and Management ◽

10.4018/jmdem.2012100104 ◽

2012 ◽

Vol 3 (4) ◽

pp. 75-87

Author(s):

Bilegsaikhan Naidan ◽

Magnus Lie Hetland

Keyword(s):

Query Processing ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Index Structure ◽

Data Sets ◽

Search Performance ◽

Space Partitioning ◽

Neighbor Search ◽

Order Of Magnitude ◽

Exact Nearest Neighbor

This article presents a new approximate index structure, the Bregman hyperplane tree, for indexing the Bregman divergence, aiming to decrease the number of distance computations required at query processing time, by sacrificing some accuracy in the result. The experimental results on various high-dimensional data sets demonstrate that the proposed index structure performs comparably to the state-of-the-art Bregman ball tree in terms of search performance and result quality. Moreover, this method results in a speedup of well over an order of magnitude for index construction. The authors also apply their space partitioning principle to the Bregman ball tree and obtain a new index structure for exact nearest neighbor search that is faster to build and a slightly slower at query processing than the original.

Download Full-text

Confirmation Sampling for Exact Nearest Neighbor Search

Similarity Search and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-60936-8_8 ◽

2020 ◽

pp. 97-110

Author(s):

Tobias Christiani ◽

Rasmus Pagh ◽

Mikkel Thorup

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

Accelerating exact nearest neighbor search in high dimensional Euclidean space via block vectors

International Journal of Intelligent Systems ◽

10.1002/int.22692 ◽

2021 ◽

Author(s):

Haowen Zhang ◽

Yabo Dong ◽

Duanqing Xu

Keyword(s):

Euclidean Space ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

High Dimensional ◽

Dimensional Euclidean Space ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

A novel supervised cluster adjustment method using a fast exact nearest neighbor search algorithm

Pattern Analysis and Applications ◽

10.1007/s10044-015-0527-6 ◽

2015 ◽

Vol 20 (3) ◽

pp. 701-715 ◽

Cited By ~ 1

Author(s):

Ali Zaghian ◽

Fakhroddin Noorbehbahani

Keyword(s):

Nearest Neighbor ◽

Search Algorithm ◽

Nearest Neighbor Search ◽

Adjustment Method ◽

Neighbor Search ◽

Exact Nearest Neighbor

Download Full-text

Efficient nearest neighbor search in high dimensional hamming space

Pattern Recognition ◽

10.1016/j.patcog.2019.107082 ◽

2020 ◽

Vol 99 ◽

pp. 107082 ◽

Cited By ~ 6

Author(s):

Bin Fan ◽

Qingqun Kong ◽

Baoqian Zhang ◽

Hongmin Liu ◽

Chunhong Pan ◽

...

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

High Dimensional ◽

Neighbor Search ◽

Hamming Space

Download Full-text