Developing Bloom Filters for Web Archives’ Holdings

In this paper the tasks of managing the directory in coherence maintenance systems in multiprocessor systems with a large number of processors are solved. In microprocessor systems with a large number of processors (MSLP) the problem of maintaining the coherence of processor caches is significantly complicated. This is due to increased traffic on the memory buses and increased complexity of interprocessor communications. This problem is solved in various ways. In this paper, we propose the use of Bloom filters used to accelerate the determination of an element’s belonging to a certain array. In this article, such filters are used to establish the fact that the processor belongs to some subset of the processors and determine if the processor has a cache line in the set. In the paper, the processes of writing and reading information in the data shared between processors are discussed in detail, as well as the process of data replacement from private caches. The article also shows how the addresses of cache lines and processor numbers are removed from the Bloom filters. The system proposed in this paper allows significantly speeding up the implementation of operations to maintain cache coherence in the MSLP as compared to conventional systems. In terms of performance and additional hardware and software costs, the proposed system is not inferior to the most efficient of similar systems, but on some applications and significantly exceeds them.

Download Full-text

Privacy-Preserving Crowd-Monitoring Using Bloom Filters and Homomorphic Encryption

Proceedings of the 4th International Workshop on Edge Systems, Analytics and Networking ◽

10.1145/3434770.3459735 ◽

2021 ◽

Author(s):

Valeriu-Daniel Stanciu ◽

Maarten van Steen ◽

Ciprian Dobre ◽

Andreas Peter

Keyword(s):

Homomorphic Encryption ◽

Privacy Preserving ◽

Bloom Filters ◽

Crowd Monitoring

Download Full-text

Securing Bloom Filters for Privacy-preserving Record Linkage

Proceedings of the 29th ACM International Conference on Information & Knowledge Management ◽

10.1145/3340531.3412105 ◽

2020 ◽

Author(s):

Thilina Ranbaduge ◽

Rainer Schnell

Keyword(s):

Record Linkage ◽

Privacy Preserving ◽

Bloom Filters

Download Full-text

A Time-aware Random Walk Model for Finding Important Documents in Web Archives

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '15 ◽

10.1145/2766462.2767832 ◽

2015 ◽

Cited By ~ 7

Author(s):

Tu Ngoc Nguyen ◽

Nattiya Kanhabua ◽

Claudia Niederée ◽

Xiaofei Zhu

Keyword(s):

Random Walk ◽

Random Walk Model ◽

Web Archives ◽

Time Aware

Download Full-text

The Neil deGrasse Tyson Problem: Methods for Exploring Base Memes in Web Archives

International Conference on Social Media and Society ◽

10.1145/3400806.3400836 ◽

2020 ◽

Author(s):

Amelia Acker ◽

Anne C. Loos ◽

Julia Sufrin

Keyword(s):

Web Archives

Download Full-text

Implementation of bloom filters in reconfigurable hardware for tracing network attacks

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)30203-3 ◽

2006 ◽

Vol 39 (21) ◽

pp. 310-315

Author(s):

Maciej Wołowiec ◽

Jakub Botwicz ◽

Piotr Sapiecha

Keyword(s):

Reconfigurable Hardware ◽

Bloom Filters ◽

Network Attacks

Download Full-text

Providing quality-of-service for frequency-aware Wi-Fi using OFDM-based variable-length Bloom filters

EURASIP Journal on Wireless Communications and Networking ◽

10.1186/1687-1499-2014-152 ◽

2014 ◽

Vol 2014 (1) ◽

Cited By ~ 2

Author(s):

Suchul Lee ◽

Jaehyuk Choi ◽

Joon Yoo ◽

Chong-Kwon Kim

Keyword(s):

Quality Of Service ◽

Variable Length ◽

Bloom Filters

Download Full-text

K-Mer Counting Using Bloom Filters with an FPGA-Attached HMC

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) ◽

10.1109/fccm.2017.23 ◽

2017 ◽

Cited By ~ 9

Author(s):

Nathaniel Mcvicar ◽

Chih-Ching Lin ◽

Scott Hauck

Keyword(s):

Bloom Filters

Download Full-text

A Study of Availability and Recovery of URLs in Library and Information Science Scholarly Journals

Asian Journal of Information Science and Technology ◽

10.51983/ajist-2020.10.1.297 ◽

2020 ◽

Vol 10 (1) ◽

pp. 51-61

Author(s):

B. Niveditha ◽

Mallinath Kumbar

Keyword(s):

Impact Factor ◽

Information Science ◽

Editorial Staff ◽

Time Travel ◽

High Impact ◽

Library And Information Science ◽

High Impact Factor ◽

Scholarly Journals ◽

Web Archives ◽

Research Findings

The present study examines the availability and recovery of web references cited in scholarly journals selected based on their high impact factor published between 2008 and 2017. A PHP script was used to crawl the Uniform Resource Locators (URL) collected from the references. A total of 5720 articles were downloaded and 237418 references were extracted. A total of 33512 URLs were checked for their availability. Further the lexical features of URLs like file extension, path depth, character length and top-level domain was determined. The research findings indicated that out of 33512 web references, 20218 contained URLs, DOIs were found in 12799 references and 495 references contained arXiv or WOS identifier. It was found that 29760 URLs were accessible and the remaining 3752 URLs were missing. Most errors were due to HTTP 404 error code (Not found error). The study also tried to recover the inaccessible URLs through Time Travel. Almost 60.55% of inaccessible URLs were archived in various web archives. The findings of the study will be helpful to authors, publishers, and editorial staff to ensure that web references will be accessible in future.

Download Full-text