A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval

Vinh-Tiep Nguyen; Thanh Duc Ngo; Minh-Triet Tran; Duy-Dinh Le; Duc Anh Duong

doi:10.4018/ijmdem.2015040103

A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2015040103 ◽

2015 ◽

Vol 6 (2) ◽

pp. 37-51 ◽

Cited By ~ 2

Author(s):

Vinh-Tiep Nguyen ◽

Thanh Duc Ngo ◽

Minh-Triet Tran ◽

Duy-Dinh Le ◽

Duc Anh Duong

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Spatial Information ◽

Real Life ◽

Inverted Index ◽

Bag Of Words ◽

Visual Words ◽

Benchmark Datasets ◽

Large Scale Image Retrieval ◽

Inverted Indexing

Large-scale image retrieval has been shown remarkable potential in real-life applications. The standard approach is based on Inverted Indexing, given images are represented using Bag-of-Words model. However, one major limitation of both Inverted Index and Bag-of-Words presentation is that they ignore spatial information of visual words in image presentation and comparison. As a result, retrieval accuracy is decreased. In this paper, the authors investigate an approach to integrate spatial information into Inverted Index to improve accuracy while maintaining short retrieval time. Experiments conducted on several benchmark datasets (Oxford Building 5K, Oxford Building 5K+100K and Paris 6K) demonstrate the effectiveness of our proposed approach.

Download Full-text

A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval

Computer Vision ◽

10.4018/978-1-5225-5204-8.ch054 ◽

2018 ◽

pp. 1307-1321

Author(s):

Vinh-Tiep Nguyen ◽

Thanh Duc Ngo ◽

Minh-Triet Tran ◽

Duy-Dinh Le ◽

Duc Anh Duong

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Spatial Information ◽

Real Life ◽

Inverted Index ◽

Bag Of Words ◽

Visual Words ◽

Benchmark Datasets ◽

Large Scale Image Retrieval ◽

Inverted Indexing

Download Full-text

Integrating Spatial Information into Inverted Index for Large-Scale Image Retrieval

2014 IEEE International Symposium on Multimedia ◽

10.1109/ism.2014.63 ◽

2014 ◽

Cited By ~ 4

Author(s):

Bien-Van Nguyen ◽

Duy Pham ◽

Thanh Duc Ngo ◽

Duy-Dinh Le ◽

Duc Anh Duong

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Spatial Information ◽

Inverted Index ◽

Large Scale Image Retrieval

Download Full-text

Simhash for Large Scale Image Retrieval

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.651-653.2197 ◽

2014 ◽

Vol 651-653 ◽

pp. 2197-2200

Author(s):

Qin Zhen Guo ◽

Zhi Zeng ◽

Shu Wu Zhang ◽

Xiao Feng ◽

Hu Guan

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Original Data ◽

Locality Sensitive Hashing ◽

Neighborhood Structure ◽

Retrieval Task ◽

Visual Words ◽

Feature Vectors ◽

Representation Method ◽

Large Scale Image Retrieval

Due to its fast query speed and reduced storage cost, hashing, which tries to learn binary code representation for data with the expectation of preserving the neighborhood structure in the original data space, has been widely used in a large variety of applications like image retrieval. For most existing image retrieval methods with hashing, there are two main steps: describe images with feature vectors, and then use hashing methods to encode the feature vectors. In this paper, we make two research contributions. First, we creatively propose to use simhash which can be intrinsically combined with the popular image representation method, Bag-of-visual-words (BoW) for image retrieval. Second, we novelly incorporate “locality-sensitive” hashing into simhash to take the correlation of the visual words of BoW into consideration to make similar visual words have similar fingerprint. Extensive experiments have verified the superiority of our method over some state-of-the-art methods for image retrieval task.

Download Full-text

Large-Scale Image Retrieval with Bag-of-Words and k-NN Re-Ranking

International Journal of Multimedia and Ubiquitous Engineering ◽

10.14257/ijmue.2015.10.6.26 ◽

2015 ◽

Vol 10 (6) ◽

pp. 265-276 ◽

Cited By ~ 1

Author(s):

Pang Haibo ◽

Chengming Liu ◽

Zhe Zhao ◽

Zhanbo Li

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Bag Of Words ◽

Large Scale Image Retrieval

Download Full-text

Large Scale Image Retrieval with Practical Spatial Weighting for Bag-of-Visual-Words

Lecture Notes in Computer Science - Advances in Multimedia Modeling ◽

10.1007/978-3-642-35725-1_47 ◽

2013 ◽

pp. 513-523 ◽

Cited By ~ 1

Author(s):

Fangyuan Wang ◽

Hai Wang ◽

Heping Li ◽

Shuwu Zhang

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Bag Of Visual Words ◽

Visual Words ◽

Large Scale Image Retrieval

Download Full-text

Neural methods for effective, efficient, and exposure-aware information retrieval

ACM SIGIR Forum ◽

10.1145/3476415.3476434 ◽

2021 ◽

Vol 55 (1) ◽

pp. 1-2

Author(s):

Bhaskar Mitra

Keyword(s):

Information Retrieval ◽

Language Processing ◽

Large Scale ◽

Web Search ◽

Real Life ◽

Inverted Index ◽

Information Need ◽

Product Model ◽

Performance Improvements ◽

Deep Model

Neural networks with deep architectures have demonstrated significant performance improvements in computer vision, speech recognition, and natural language processing. The challenges in information retrieval (IR), however, are different from these other application areas. A common form of IR involves ranking of documents---or short passages---in response to keyword-based queries. Effective IR systems must deal with query-document vocabulary mismatch problem, by modeling relationships between different query and document terms and how they indicate relevance. Models should also consider lexical matches when the query contains rare terms---such as a person's name or a product model number---not seen during training, and to avoid retrieving semantically related but irrelevant results. In many real-life IR tasks, the retrieval involves extremely large collections---such as the document index of a commercial Web search engine---containing billions of documents. Efficient IR methods should take advantage of specialized IR data structures, such as inverted index, to efficiently retrieve from large collections. Given an information need, the IR system also mediates how much exposure an information artifact receives by deciding whether it should be displayed, and where it should be positioned, among other results. Exposure-aware IR systems may optimize for additional objectives, besides relevance, such as parity of exposure for retrieved items and content publishers. In this thesis, we present novel neural architectures and methods motivated by the specific needs and challenges of IR tasks. We ground our contributions with a detailed survey of the growing body of neural IR literature [Mitra and Craswell, 2018]. Our key contribution towards improving the effectiveness of deep ranking models is developing the Duet principle [Mitra et al., 2017] which emphasizes the importance of incorporating evidence based on both patterns of exact term matches and similarities between learned latent representations of query and document. To efficiently retrieve from large collections, we develop a framework to incorporate query term independence [Mitra et al., 2019] into any arbitrary deep model that enables large-scale precomputation and the use of inverted index for fast retrieval. In the context of stochastic ranking, we further develop optimization strategies for exposure-based objectives [Diaz et al., 2020]. Finally, this dissertation also summarizes our contributions towards benchmarking neural IR models in the presence of large training datasets [Craswell et al., 2019] and explores the application of neural methods to other IR tasks, such as query auto-completion.

Download Full-text

Ratio Voting: A New Voting Strategy for Large-Scale Image Retrieval

2012 IEEE International Conference on Multimedia and Expo ◽

10.1109/icme.2012.156 ◽

2012 ◽

Cited By ~ 1

Author(s):

Yusuke Uchida ◽

Koichi Takagi ◽

Shigeyuki Sakazawa

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Large Scale Image Retrieval ◽

Voting Strategy

Download Full-text

An Enhanced Deep Hashing Method for Large-Scale Image Retrieval

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29551-6_34 ◽

2019 ◽

pp. 382-393

Author(s):

Cong Chen ◽

Weiqin Tong ◽

Xuehai Ding ◽

Xiaoli Zhi

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Deep Hashing ◽

Large Scale Image Retrieval

Download Full-text

Cascaded Deep Hashing for Large-Scale Image Retrieval

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-04224-0_36 ◽

2018 ◽

pp. 419-429

Author(s):

Jun Lu ◽

Li Zhang

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Deep Hashing ◽

Large Scale Image Retrieval

Download Full-text

A novel method for content-based image retrieval to improve the effectiveness of the bag-of-words model using a support vector machine

Journal of Information Science ◽

10.1177/0165551518782825 ◽

2018 ◽

Vol 45 (1) ◽

pp. 117-135 ◽

Cited By ~ 15

Author(s):

Amna Sarwar ◽

Zahid Mehmood ◽

Tanzila Saba ◽

Khurram Ashfaq Qazi ◽

Ahmed Adnan ◽

...

Keyword(s):

Image Retrieval ◽

Image Databases ◽

Semantic Gap ◽

Content Based Image Retrieval ◽

Support Vector ◽

Bag Of Words ◽

Visual Words ◽

Visual Attributes ◽

Visual Vocabulary ◽

Novel Method

The advancements in the multimedia technologies result in the growth of the image databases. To retrieve images from such image databases using visual attributes of the images is a challenging task due to the close visual appearance among the visual attributes of these images, which also introduces the issue of the semantic gap. In this article, we recommend a novel method established on the bag-of-words (BoW) model, which perform visual words integration of the local intensity order pattern (LIOP) feature and local binary pattern variance (LBPV) feature to reduce the issue of the semantic gap and enhance the performance of the content-based image retrieval (CBIR). The recommended method uses LIOP and LBPV features to build two smaller size visual vocabularies (one from each feature), which are integrated together to build a larger size of the visual vocabulary, which also contains complementary features of both descriptors. Because for efficient CBIR, the smaller size of the visual vocabulary improves the recall, while the bigger size of the visual vocabulary improves the precision or accuracy of the CBIR. The comparative analysis of the recommended method is performed on three image databases, namely, WANG-1K, WANG-1.5K and Holidays. The experimental analysis of the recommended method on these image databases proves its robust performance as compared with the recent CBIR methods.

Download Full-text