distributed caching
Recently Published Documents


TOTAL DOCUMENTS

150
(FIVE YEARS 28)

H-INDEX

21
(FIVE YEARS 4)

2022 ◽  
pp. 245-261
Author(s):  
Geetha J. J. ◽  
Jaya Lakshmi D. S. ◽  
Keerthana Ningaraju L. N.

Distributed caching is one such system used by dynamic high-traffic websites to process the incoming user requests to perform the required tasks in an efficient way. Distributed caching is currently employing hashing algorithm in order to serve its purpose. A significant drawback of hashing in this circumstance is the addition of new servers that would result in a change in the previous hashing method (rehashing), hence, goes into a rigmarole. Thus, we need an effective algorithm to address the problem. This technique has served as a solution for distributed and rehashing problems. Most of upcoming internet of things will have to be latency aware and will not afford the data transmission and computation time in the cloud servers. The real-time processing in proximal distance device would be much needed. Hence, the authors aim to employ a real-time task scheduling algorithm. Computations referring to the user requests that are to be handled by the servers can be efficiently handled by consistent hashing algorithms.


Author(s):  
Bahman Abolhassani ◽  
John Tadrous ◽  
Atilla Eryilmaz

Author(s):  
Tiancheng Qin ◽  
S. Rasoul Etesami

We consider a generalization of the standard cache problem called file-bundle caching, where different queries (tasks), each containing l ≥ 1 files, sequentially arrive. An online algorithm that does not know the sequence of queries ahead of time must adaptively decide on what files to keep in the cache to incur the minimum number of cache misses. Here a cache miss refers to the case where at least one file in a query is missing among the cache files. In the special case where l = 1, this problem reduces to the standard cache problem. We first analyze the performance of the classic least recently used (LRU) algorithm in this setting and show that LRU is a near-optimal online deterministic algorithm for file-bundle caching with regard to competitive ratio. We then extend our results to a generalized ( h,k )-paging problem in this file-bundle setting, where the performance of the online algorithm with a cache size k is compared to an optimal offline benchmark of a smaller cache size h < k . In this latter case, we provide a randomized O ( l ln k / k-h )-competitive algorithm for our generalized ( h, k )-paging problem, which can be viewed as an extension of the classic marking algorithm . We complete this result by providing a matching lower bound for the competitive ratio, indicating that the performance of this modified marking algorithm is within a factor of 2 of any randomized online algorithm. Finally, we look at the distributed version of the file-bundle caching problem where there are m ≥ 1 identical caches in the system. In this case, we show that for m = l + 1 caches, there is a deterministic distributed caching algorithm that is ( l 2 + l )-competitive and a randomized distributed caching algorithm that is O ( l ln ( 2l + 1)-competitive when l ≥ 2. We also provide a general framework to devise other efficient algorithms for the distributed file-bundle caching problem and evaluate the performance of our results through simulations.


2021 ◽  
Vol 251 ◽  
pp. 02052
Author(s):  
Robert Currie ◽  
Wenlong Yuan

To optimise the performance of distributed compute, smaller lightweight storage caches are needed which integrate with existing grid computing workflows. A good solution to provide lightweight storage caches is to use an XRootD-proxy cache. To support distributed lightweight XRootD proxy services across GridPP we have developed a centralised monitoring framework. With the v5 release of XRootD it is possible to build a monitoring framework which collects distributed caching metadata broadcast from multiple sites. To provide the best support for these distributed caches we have built a centralised monitoring service for XRootD storage instances within GridPP. This monitoring solution is built upon experiences presented by CMS in setting up a similar service as part of their AAA system. This new framework is designed to provide remote monitoring of the behaviour, performance, and reliability of distributed XRootD services across the UK. Effort has been made to simplify ease of deployment by remote site administrators. The result of this work is an interactive dashboard system which enables administrators to access real-time metrics on the performance of their lightweight storage systems. This monitoring framework is intended to supplement existing functionality and availability testing metrics by providing detailed information and logging from a site perspective.


Author(s):  
Lalhruaizela Chhangte ◽  
Nikhil Karamchandani ◽  
D Manjunath ◽  
Emanuele Viterbo
Keyword(s):  

2020 ◽  
Vol 66 (1) ◽  
pp. 66-77
Author(s):  
Wei Zhang ◽  
Jian Xiong ◽  
Lin Gui ◽  
Bo Liu ◽  
Meikang Qiu ◽  
...  

2020 ◽  
Vol 245 ◽  
pp. 04043
Author(s):  
B. Galewsky ◽  
R. Gardner ◽  
L. Gray ◽  
M. Neubauer ◽  
J. Pivarski ◽  
...  

We will describe a component of the Intelligent Data Delivery Service being developed in collaboration with IRIS-HEP and the LHC experiments. ServiceX is an experiment-agnostic service to enable on-demand data delivery specifically tailored for nearly-interactive vectorized analysis. This work is motivated by the data engineering challenges posed by HL-LHC data volumes and the increasing popularity of python and Spark-based analysis workflows. ServiceX gives analyzers the ability to query events by dataset metadata. It uses containerized transformations to extract just the data required for the analysis. This operation is colocated with the data to avoid transferring unnecessary branches over the WAN. Simple filtering operations are supported to further reduce the amount of data transferred. Transformed events are cached in a columnar datastore to accelerate delivery of subsequent similar requests. ServiceX will learn commonly related columns and automatically include them in the transformation to increase the potential for cache hits by other users. Selected events are streamed to the analysis system using an efficient wire protocol that can be readily consumed by a variety of computational frameworks. This reduces time-to-insight for physics analysis by delegating to ServiceX the complexity of event selection, slimming, reformatting, and streaming.


Sign in / Sign up

Export Citation Format

Share Document