Creating a Concurrent Overflowing Bloom Filter

10.14293/s2199-1006.1.sor-.ppf4wcp.v1 ◽

2019 ◽

Author(s):

Alex Berliner ◽

Brian Estes ◽

Ebin Scaria

Keyword(s):

Data Structure ◽

Recent Literature ◽

Bloom Filter ◽

Bloom Filters ◽

Probabilistic Data ◽

Additional Element ◽

Marginal Value ◽

The Creation ◽

Probabilistic Data Structure

Bloom filters are an efficient probabilistic data structure used to verify membership of an element inside of a set. There is diminishing marginal value for inserting each additional element into a Bloom filter, and so steps must be taken to maintain scalability. One such option is to create a secondary hash set for a particular hash set in a Bloom filter that has become full, known as an overflow area. At this time, there are no implementations of a Bloom filter that implement this overflow system while maintaining concurrency. In this paper, we demonstrate the creation of a concurrent overflow system for Bloom filters. We use the base Bloom filter presented in recent literature and replace their method of dynamically resizing the Bloom filters with our overflow table implementation, as outlined in one of their suggested areas for future exploration. We then compare the results of our Bloom filter with those from the previously mentioned implementation as well as a standard Bloom filter.

Download Full-text

Malicious Website Detection Using Probabilistic Data Structure Bloom Filter

2019 3rd International Conference on Computing Methodologies and Communication (ICCMC) ◽

10.1109/iccmc.2019.8819818 ◽

2019 ◽

Author(s):

K. Nandhini ◽

Ramesh Balasubramaniam

Keyword(s):

Data Structure ◽

Bloom Filter ◽

Probabilistic Data ◽

Probabilistic Data Structure

Download Full-text

Privacy-Enhanced Robust Image Hashing with Bloom Filters

Journal of Cyber Security and Mobility ◽

10.13052/jcsm2245-1439.1014 ◽

2021 ◽

Author(s):

Uwe Breidenbach ◽

Martin Steinebach ◽

Huajian Liu

Keyword(s):

Data Structure ◽

Privacy Protection ◽

Structural Information ◽

Bloom Filter ◽

Error Rates ◽

Bloom Filters ◽

Image Hashing ◽

Robust Image ◽

Probabilistic Data Structure ◽

Robust Image Hashing

Robust image hashes are used to detect known illegal images, even after image processing. This is, for example, interesting for a forensic investigation, or for a company to protect their employees and customers by filtering content. The disadvantage of robust hashes is that they leak structural information of the pictures, which can lead to privacy issues. Our scientific contribution is to extend a robust image hash with privacy protection. We thus introduce and discuss such a privacy-preserving concept. The approach uses a probabilistic data structure -- known as Bloom filter -- to store robust image hashes. Bloom filter store elements by mapping hashes of each element to an internal data structure. We choose a cryptographic hash function to one-way encrypt and store elements. The privacy of the inserted elements is thus protected. We evaluate our implementation, and compare it to its underlying robust image hashing algorithm. Thereby, we show the cost with respect to error rates for introducing a privacy protection into robust hashing. Finally, we discuss our approach's results and usability, and suggest possible future improvements.

Download Full-text

Probabilistic data structure-based community detection and storage scheme in online social networks

Future Generation Computer Systems ◽

10.1016/j.future.2018.11.026 ◽

2019 ◽

Vol 94 ◽

pp. 173-184 ◽

Cited By ~ 4

Author(s):

Amritpal Singh ◽

Sahil Garg ◽

Shalini Batra ◽

Neeraj Kumar

Keyword(s):

Social Networks ◽

Data Structure ◽

Community Detection ◽

Online Social Networks ◽

Probabilistic Data ◽

Storage Scheme ◽

Probabilistic Data Structure ◽

And Storage

Download Full-text

A Load Balancing Mechanism Using Bloom Filter in Storm System

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.543-547.1972 ◽

2014 ◽

Vol 543-547 ◽

pp. 1972-1976

Author(s):

Huai Lin Dong ◽

Ming Yuan He ◽

Qing Feng Wu ◽

Sheng Hang Wu

Keyword(s):

Data Structure ◽

Load Balancing ◽

Real Time ◽

Data Transmission ◽

Streaming Media ◽

Bloom Filter ◽

Time System ◽

Real Time System ◽

Membership Queries ◽

Probabilistic Data Structure

When membership queries are evaluated in a set, the performance can be improved by a Bloom filter which is a space-efficient probabilistic data structure. According to its space-efficient character, Bloom Filter presented to address the load balancing problem for streaming media information in Storm system which is free and open source distributed real time computation system. This method increases the server cluster availability by balancing the workloads among the servers within a cluster. Additionally, it improves real time system Storm efficiently in saving the data transmission time and reducing the calculation complexity.

Download Full-text

A neural data structure for novelty detection

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1814448115 ◽

2018 ◽

Vol 115 (51) ◽

pp. 13093-13098 ◽

Cited By ~ 7

Author(s):

Sanjoy Dasgupta ◽

Timothy C. Sheehan ◽

Charles F. Stevens ◽

Saket Navlakha

Keyword(s):

Data Structure ◽

Computer Science ◽

Novelty Detection ◽

Bloom Filter ◽

Fruit Fly ◽

Fruit Flies ◽

Bloom Filters ◽

Neural Data ◽

Biological Problem ◽

Computational Systems

Novelty detection is a fundamental biological problem that organisms must solve to determine whether a given stimulus departs from those previously experienced. In computer science, this problem is solved efficiently using a data structure called a Bloom filter. We found that the fruit fly olfactory circuit evolved a variant of a Bloom filter to assess the novelty of odors. Compared with a traditional Bloom filter, the fly adjusts novelty responses based on two additional features: the similarity of an odor to previously experienced odors and the time elapsed since the odor was last experienced. We elaborate and validate a framework to predict novelty responses of fruit flies to given pairs of odors. We also translate insights from the fly circuit to develop a class of distance- and time-sensitive Bloom filters that outperform prior filters when evaluated on several biological and computational datasets. Overall, our work illuminates the algorithmic basis of an important neurobiological problem and offers strategies for novelty detection in computational systems.

Download Full-text