probabilistic data structure Latest Research Papers

Abstract Lock based techniques have its own limitations like priority inversion, convoying, and deadlock. Lock free techniques overcome those mentioned limitations. Transactional memory (TM) is one leading lock free technique used in recent multi core processors like Intel Haswell and IBM BlueGene/Q. TM has to do data versioning and conflict detection. For conflict detection probabilistic data structure called Bloom Filters are used. Bloom filter based hardware signatures are used in TM. In TM shared memory conflicts like RAW, WAR, and WAW hazards are handled by Bloom Filter (BF). Hardware signatures store memory addresses in hashed form on Bloom filters. Bloom filters are easy to use, performance efficient data structures lead to false positive but never support false negative. Locality sensitive hardware signatures reduce filter occupancy by sharing bits for the contiguous memory addresses, in turn reduces the false positive rate. This paper implements existing H3 – HS and LS – HS proposed by Ricardo Quislant et al. [13]. Also this paper proposes RS – HS, CS – HS, and RO – HS. RO – HS equally spreads addresses among bloom filters thereby reduces filter occupancy. In turn reduced filter occupancy leads to better False Positive Rate.

Download Full-text

Privacy-Enhanced Robust Image Hashing with Bloom Filters

Journal of Cyber Security and Mobility ◽

10.13052/jcsm2245-1439.1014 ◽

2021 ◽

Author(s):

Uwe Breidenbach ◽

Martin Steinebach ◽

Huajian Liu

Keyword(s):

Data Structure ◽

Privacy Protection ◽

Structural Information ◽

Bloom Filter ◽

Error Rates ◽

Bloom Filters ◽

Image Hashing ◽

Robust Image ◽

Probabilistic Data Structure ◽

Robust Image Hashing

Robust image hashes are used to detect known illegal images, even after image processing. This is, for example, interesting for a forensic investigation, or for a company to protect their employees and customers by filtering content. The disadvantage of robust hashes is that they leak structural information of the pictures, which can lead to privacy issues. Our scientific contribution is to extend a robust image hash with privacy protection. We thus introduce and discuss such a privacy-preserving concept. The approach uses a probabilistic data structure -- known as Bloom filter -- to store robust image hashes. Bloom filter store elements by mapping hashes of each element to an internal data structure. We choose a cryptographic hash function to one-way encrypt and store elements. The privacy of the inserted elements is thus protected. We evaluate our implementation, and compare it to its underlying robust image hashing algorithm. Thereby, we show the cost with respect to error rates for introducing a privacy protection into robust hashing. Finally, we discuss our approach's results and usability, and suggest possible future improvements.

Download Full-text

Bloom hash probabilistic data structure and Benaloh Cryptosystem for secured data storage and access control in cloud

Materials Today Proceedings ◽

10.1016/j.matpr.2021.01.864 ◽

2021 ◽

Author(s):

P. Calista Bebe ◽

D. Akila

Keyword(s):

Data Structure ◽

Access Control ◽

Data Storage ◽

Probabilistic Data ◽

Probabilistic Data Structure ◽

Secured Data

Download Full-text

Integrated Probabilistic Data Structure For Accurate and Scalable Sequence Prediction

Procedia Computer Science ◽

10.1016/j.procs.2020.03.437 ◽

2020 ◽

Vol 167 ◽

pp. 2429-2436

Author(s):

Soumonos Mukherjee ◽

Uddipta Dutta ◽

Jit Sarkar ◽

Rajkumar R

Keyword(s):

Data Structure ◽

Probabilistic Data ◽

Sequence Prediction ◽

Probabilistic Data Structure

Download Full-text

GloBiMaps - A Probabilistic Data Structure for In-Memory Processing of Global Raster Datasets

Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems - SIGSPATIAL '19 ◽

10.1145/3347146.3359086 ◽

2019 ◽

Author(s):

Martin Werner

Keyword(s):

Data Structure ◽

Probabilistic Data ◽

Memory Processing ◽

Probabilistic Data Structure

Download Full-text

Moore Data Clustering Based Bloom Hash Storage for Dimensionality Reduction of Big Data Analytics

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c6652.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 8178-8184

Keyword(s):

Big Data ◽

Data Clustering ◽

Data Analytics ◽

Clustered Data ◽

Big Data Analytics ◽

Space Complexity ◽

Weather Data ◽

Efficient Performance ◽

Probabilistic Data Structure ◽

Hashing Function

Big data contains massive amounts of information’s that are difficult to manage, acquire, store and analyses. The clustering of data is a demanding issue in the field of big data analytics. The existing techniques developed for clustering does not provide efficient performance and also time complexity of clustering was higher. Further, minimizing dimensionality of big data was not addressed effectively. In order to overcome these limitations, a Moore Data Clustering based Bloom Hash Storage (MDC-BHS) Technique is proposed. The MDC-BHS Technique is designed with aim of reducing the dimensionality of big data with lesser time through clustering. The MDC-BHS Technique used Moore Data Clustering (MDC) Model in order to group the data in big dataset with minimum time consumption. After performing clustering process, the MDC-BHS Technique employed Bloom Hash Storage (BHS) Model in order to store clustered data with minimum space complexity. The BHS Model is a space-efficient probabilistic data structure which utilized hashing function to create hash value for clustered data. Therefore, proposed MDC-BHS Technique significantly reduces the dimensionality of larger dataset. The experimental evaluation of MDC-BHS technique is carried out on weather data with factors such as clustering time and clustering accuracy and space complexity with respect to number of data. The experimental results demonstrate that MDC-BHS Technique is able to improve the clustering accuracy and also minimizes the space complexity when compared to state-of-the-art works

Download Full-text

Creating a Concurrent Overflowing Bloom Filter

10.14293/s2199-1006.1.sor-.ppf4wcp.v1 ◽

2019 ◽

Author(s):

Alex Berliner ◽

Brian Estes ◽

Ebin Scaria

Keyword(s):

Data Structure ◽

Recent Literature ◽

Bloom Filter ◽

Bloom Filters ◽

Probabilistic Data ◽

Additional Element ◽

Marginal Value ◽

The Creation ◽

Probabilistic Data Structure

Bloom filters are an efficient probabilistic data structure used to verify membership of an element inside of a set. There is diminishing marginal value for inserting each additional element into a Bloom filter, and so steps must be taken to maintain scalability. One such option is to create a secondary hash set for a particular hash set in a Bloom filter that has become full, known as an overflow area. At this time, there are no implementations of a Bloom filter that implement this overflow system while maintaining concurrency. In this paper, we demonstrate the creation of a concurrent overflow system for Bloom filters. We use the base Bloom filter presented in recent literature and replace their method of dynamically resizing the Bloom filters with our overflow table implementation, as outlined in one of their suggested areas for future exploration. We then compare the results of our Bloom filter with those from the previously mentioned implementation as well as a standard Bloom filter.

Download Full-text

Creating a Concurrent Overflowing Bloom Filter

10.14293/s2199-1006.1.sor-.ppzvfcw.v1 ◽

2019 ◽

Author(s):

Alex Berliner ◽

Brian Estes ◽

Ebin Scaria

Keyword(s):

Data Structure ◽

Recent Literature ◽

Bloom Filter ◽

Bloom Filters ◽

Probabilistic Data ◽

Additional Element ◽

Marginal Value ◽

The Creation ◽

Probabilistic Data Structure

Bloom filters are an efficient probabilistic data structure used to verify membership of an element inside of a set. There is diminishing marginal value for inserting each additional element into a Bloom filter, and so steps must be taken to maintain scalability. One such option is to create a secondary hash set for a particular hash set in a Bloom filter that has become full, known as an overflow area. At this time, there are no implementations of a Bloom filter that implement this overflow system while maintaining concurrency. In this paper, we demonstrate the creation of a concurrent overflow system for Bloom filters. We use the base Bloom filter presented in recent literature and replace their method of dynamically resizing the Bloom filters with our overflow table implementation, as outlined in one of their suggested areas for future exploration. We then compare the results of our Bloom filter with those from the previously mentioned implementation as well as a standard Bloom filter.

Download Full-text

Probabilistic data structure-based community detection and storage scheme in online social networks

Future Generation Computer Systems ◽

10.1016/j.future.2018.11.026 ◽

2019 ◽

Vol 94 ◽

pp. 173-184 ◽

Cited By ~ 4

Author(s):

Amritpal Singh ◽

Sahil Garg ◽

Shalini Batra ◽

Neeraj Kumar

Keyword(s):

Social Networks ◽

Data Structure ◽

Community Detection ◽

Online Social Networks ◽

Probabilistic Data ◽

Storage Scheme ◽

Probabilistic Data Structure ◽

And Storage

Download Full-text

Provably Secure Private Set Intersection With Constant Communication Complexity

International Journal of Cyber Warfare and Terrorism ◽

10.4018/ijcwt.2019040104 ◽

2019 ◽

Vol 9 (2) ◽

pp. 39-64

Author(s):

Sumit Kumar Debnath

Keyword(s):

Communication Complexity ◽

Bloom Filter ◽

Security And Privacy ◽

Data Sets ◽

Indistinguishability Obfuscation ◽

Security Parameter ◽

Private Data ◽

Set Intersection ◽

Private Set Intersection ◽

Probabilistic Data Structure

Electronic information is increasingly shared among unreliable entities. In this context, one interesting problem involves two parties that secretly want to determine an intersection of their respective private data sets while none of them wish to disclose the whole set to the other. One can adopt a Private Set Intersection (PSI) protocol to address this problem preserving the associated security and privacy issues. In this article, the authors present the first PSI protocol that incurs constant (p(k)) communication complexity with linear computation overhead and is fast even for the case of large input sets, where p(k) is a polynomial in security parameter k. Security of this scheme is proven in the standard model against semi-honest entities. The authors combine somewhere statistically binding (SSB) hash function with indistinguishability obfuscation (iO) and space-efficient probabilistic data structure Bloom filter to design the scheme.

Download Full-text

probabilistic data structure
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Locality Sensitive Hardware Signature Variants for Hardware Transactional Memory

Privacy-Enhanced Robust Image Hashing with Bloom Filters

Bloom hash probabilistic data structure and Benaloh Cryptosystem for secured data storage and access control in cloud

Integrated Probabilistic Data Structure For Accurate and Scalable Sequence Prediction

GloBiMaps - A Probabilistic Data Structure for In-Memory Processing of Global Raster Datasets

Moore Data Clustering Based Bloom Hash Storage for Dimensionality Reduction of Big Data Analytics

Creating a Concurrent Overflowing Bloom Filter

Creating a Concurrent Overflowing Bloom Filter

Probabilistic data structure-based community detection and storage scheme in online social networks

Provably Secure Private Set Intersection With Constant Communication Complexity

Export Citation Format

probabilistic data structureRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Locality Sensitive Hardware Signature Variants for Hardware Transactional Memory

Privacy-Enhanced Robust Image Hashing with Bloom Filters

Bloom hash probabilistic data structure and Benaloh Cryptosystem for secured data storage and access control in cloud

Integrated Probabilistic Data Structure For Accurate and Scalable Sequence Prediction

GloBiMaps - A Probabilistic Data Structure for In-Memory Processing of Global Raster Datasets

Moore Data Clustering Based Bloom Hash Storage for Dimensionality Reduction of Big Data Analytics

Creating a Concurrent Overflowing Bloom Filter

Creating a Concurrent Overflowing Bloom Filter

Probabilistic data structure-based community detection and storage scheme in online social networks

Provably Secure Private Set Intersection With Constant Communication Complexity

probabilistic data structure
Recently Published Documents