Massive picture retrieval system based on big data image mining

Recently, the increasing use of mobile devices, such as cameras and smartphones, has resulted in a dramatic increase in the amount of images collected every day. Therefore, retrieving and managing these large volumes of images has become a major challenge in the field of computer vision. One of the solutions for efficiently managing image databases is an Image Content Search (CBIR) system. For this, we introduce in this chapter some fundamental theories of content-based image retrieval for large scale databases using Parallel frameworks. Section 2 and Section 3 presents the basic methods of content-based image retrieval. Then, as the emphasis of this chapter, we introduce in Section 1.2 A content-based image retrieval system for large-scale images databases. After that, we briefly address Big Data, Big Data processing platforms for large scale image retrieval. In Sections 5, 6, 7, and 8. Finally, we draw a conclusion in Section 9.

Download Full-text

Big data Curation: Enhanced Information Retrieval System

10.22161/ijaers/nctet.2017.21 ◽

2017 ◽

Author(s):

K. Naresh ◽

A. BasiReddy ◽

S. Swarnalatha

Keyword(s):

Big Data ◽

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Data Curation

Download Full-text

Locality-Sensitive Hashing for Information Retrieval System on Multiple GPGPU Devices

Applied Sciences ◽

10.3390/app10072539 ◽

2020 ◽

Vol 10 (7) ◽

pp. 2539 ◽

Cited By ~ 1

Author(s):

Toan Nguyen Mau ◽

Yasushi Inoguchi

Keyword(s):

Big Data ◽

Information Retrieval ◽

Retrieval System ◽

Hash Table ◽

Information Retrieval System ◽

Main Memory ◽

Locality Sensitive Hashing ◽

Data Sets ◽

Similar Data ◽

Data Set

It is challenging to build a real-time information retrieval system, especially for systems with high-dimensional big data. To structure big data, many hashing algorithms that map similar data items to the same bucket to advance the search have been proposed. Locality-Sensitive Hashing (LSH) is a common approach for reducing the number of dimensions of a data set, by using a family of hash functions and a hash table. The LSH hash table is an additional component that supports the indexing of hash values (keys) for the corresponding data/items. We previously proposed the Dynamic Locality-Sensitive Hashing (DLSH) algorithm with a dynamically structured hash table, optimized for storage in the main memory and General-Purpose computation on Graphics Processing Units (GPGPU) memory. This supports the handling of constantly updated data sets, such as songs, images, or text databases. The DLSH algorithm works effectively with data sets that are updated with high frequency and is compatible with parallel processing. However, the use of a single GPGPU device for processing big data is inadequate, due to the small memory capacity of GPGPU devices. When using multiple GPGPU devices for searching, we need an effective search algorithm to balance the jobs. In this paper, we propose an extension of DLSH for big data sets using multiple GPGPUs, in order to increase the capacity and performance of the information retrieval system. Different search strategies on multiple DLSH clusters are also proposed to adapt our parallelized system. With significant results in terms of performance and accuracy, we show that DLSH can be applied to real-life dynamic database systems.

Download Full-text

An Approach of Semantic Similarity Measure between Documents Based on Big Data

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i5.10853 ◽

2016 ◽

Vol 6 (5) ◽

pp. 2454 ◽

Cited By ~ 2

Author(s):

Mohammed Erritali ◽

Abderrahim Beni-Hssane ◽

Marouane Birjali ◽

Youness Madani

Keyword(s):

Big Data ◽

Semantic Similarity ◽

Retrieval System ◽

Programming Model ◽

State Of The Art ◽

Distributed Processing ◽

Similarity Measures ◽

The State ◽

Semantic Similarity Measure ◽

Document Similarity

<p>Semantic indexing and document similarity is an important information retrieval system problem in Big Data with broad applications. In this paper, we investigate MapReduce programming model as a specific framework for managing distributed processing in a large of amount documents. Then we study the state of the art of different approaches for computing the similarity of documents. Finally, we propose our approach of semantic similarity measures using WordNet as an external network semantic resource. For evaluation, we compare the proposed approach with other approaches previously presented by using our new MapReduce algorithm. Experimental results review that our proposed approach outperforms the state of the art ones on running time performance and increases the measurement of semantic similarity.</p>

Download Full-text

The Research and Implementation of File Information Retrieval System Based on Big Data Semantic

Proceedings of the Advances in Materials, Machinery, Electrical Engineering (AMMEE 2017) ◽

10.2991/ammee-17.2017.103 ◽

2017 ◽

Author(s):

Zebo Zhu ◽

Baochuan Lin

Keyword(s):

Big Data ◽

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System

Download Full-text

Cloud assisted big data information retrieval system for critical data supervision in disaster regions

Computer Communications ◽

10.1016/j.comcom.2019.11.028 ◽

2020 ◽

Vol 151 ◽

pp. 548-555

Author(s):

Chunmei Wang ◽

Fang Qin ◽

Dinesh Jackson Samuel R.

Keyword(s):

Big Data ◽

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Critical Data

Download Full-text

Mobile based big data design patent image retrieval system via Lp norm deep learning approach

IECON 2015 - 41st Annual Conference of the IEEE Industrial Electronics Society ◽

10.1109/iecon.2015.7392866 ◽

2015 ◽

Author(s):

Jing Su ◽

Bingo W. K. Ling ◽

Qingyun Dai ◽

Jun Xiao ◽

Kim-Fung Tsang

Keyword(s):

Big Data ◽

Deep Learning ◽

Image Retrieval ◽

Retrieval System ◽

Learning Approach ◽

Lp Norm ◽

Design Patent ◽

Image Retrieval System ◽

Data Design

Download Full-text