scholarly journals Multi-PQTable for Approximate Nearest-Neighbor Search

Information ◽  
2019 ◽  
Vol 10 (6) ◽  
pp. 190
Author(s):  
Xinpan Yuan ◽  
Qunfeng Liu ◽  
Jun Long ◽  
Lei Hu ◽  
Songlin Wang

Image retrieval or content-based image retrieval (CBIR) can be transformed into the calculation of the distance between image feature vectors. The closer the vectors are, the higher the image similarity will be. In the image retrieval system for large-scale dataset, the approximate nearest-neighbor (ANN) search can quickly obtain the top k images closest to the query image, which is the Top-k problem in the field of information retrieval. With the traditional ANN algorithms, such as KD-Tree, R-Tree, and M-Tree, when the dimension of the image feature vector increases, the computing time will increase exponentially due to the curse of dimensionality. In order to reduce the calculation time and improve the efficiency of image retrieval, we propose an ANN search algorithm based on the Product Quantization Table (PQTable). After quantizing and compressing the image feature vectors by the product quantization algorithm, we can construct the image index structure of the PQTable, which speeds up image retrieval. We also propose a multi-PQTable query strategy for ANN search. Besides, we generate several nearest-neighbor vectors for each sub-compressed vector of the query vector to reduce the failure rate and improve the recall in image retrieval. Through theoretical analysis and experimental verification, it is proved that the multi-PQTable query strategy and the generation of several nearest-neighbor vectors are greatly correct and efficient.

2019 ◽  
Vol 125 ◽  
pp. 187-194 ◽  
Author(s):  
Shan An ◽  
Zhibiao Huang ◽  
Shuang Bai ◽  
Guangfu Che ◽  
Xin Ma ◽  
...  

2018 ◽  
Author(s):  
Daisuke Komura ◽  
Keisuke Fukuta ◽  
Ken Tominaga ◽  
Akihiro Kawabe ◽  
Hirotomo Koda ◽  
...  

AbstractBackgroundAs a large number of digital histopathological images have been accumulated, there is a growing demand of content-based image retrieval (CBIR) in pathology for educational, diagnostic, or research purposes. However, no CBIR systems in digital pathology are publicly available.ResultsWe developed a web application, the Luigi system, which retrieves similar histopathological images from various cancer cases. Using deep texture representations computed with a pre-trained convolutional neural network as an image feature in conjunction with an approximate nearest neighbor search method, the Luigi system provides fast and accurate results for any type of tissue or cell without the need for further training. In addition, users can easily submit query images of an appropriate scale into the Luigi system and view the retrieved results using our smartphone application. The cases stored in the Luigi database are obtained from The Cancer Genome Atlas with rich clinical, pathological, and molecular information. We tested the Luigi system by querying typical cancerous regions from four cancer types, and confirmed successful retrieval of relevant images.ConclusionsThe Luigi system will help students, pathologists, and researchers easily retrieve histopathological images of various cancers similar to those of the query image.


Author(s):  
André Fernandes ◽  
George Teodoro

Nesse artigo é apresentada uma paralelização eficiente do algoritmo de busca por similaridade Product Quantization Approximate Nearest Neighbor Search (PQANNS). Esse método pode responder consultas com uma demanda reduzida de memória e, juntamente com a paralelização proposta, pode lidar de forma eficiente com grandes bases de dados. A execução utilizando 128 nós/3584 núcleos de CPU foi capaz de atingir uma eficiência do paralelismo de 0.97 em uma base de dados contendo 256 bilhões de descritores SIFT.


Sign in / Sign up

Export Citation Format

Share Document