Reducing Storage Costs of Reconfiguration Contexts by Sharing Instruction Memory Cache Blocks

Due to large data volume and low latency requirements of modern web services, the use of an in-memory key-value (KV) cache often becomes an inevitable choice (e.g., Redis and Memcached). The in-memory cache holds hot data, reduces request latency, and alleviates the load on background databases. Inheriting from the traditional hardware cache design, many existing KV cache systems still use recency-based cache replacement algorithms, e.g., least recently used or its approximations. However, the diversity of miss penalty distinguishes a KV cache from a hardware cache. Inadequate consideration of penalty can substantially compromise space utilization and request service time. KV accesses also demonstrate locality, which needs to be coordinated with miss penalty to guide cache management. In this article, we first discuss how to enhance the existing cache model, the Average Eviction Time model, so that it can adapt to modeling a KV cache. After that, we apply the model to Redis and propose pRedis, Penalty- and Locality-aware Memory Allocation in Redis, which synthesizes data locality and miss penalty, in a quantitative manner, to guide memory allocation and replacement in Redis. At the same time, we also explore the diurnal behavior of a KV store and exploit long-term reuse. We replace the original passive eviction mechanism with an automatic dump/load mechanism, to smooth the transition between access peaks and valleys. Our evaluation shows that pRedis effectively reduces the average and tail access latency with minimal time and space overhead. For both real-world and synthetic workloads, our approach delivers an average of 14.0%∼52.3% latency reduction over a state-of-the-art penalty-aware cache management scheme, Hyperbolic Caching (HC), and shows more quantitative predictability of performance. Moreover, we can obtain even lower average latency (1.1%∼5.5%) when dynamically switching policies between pRedis and HC.

Download Full-text

Transparent In-memory Cache Management in Apache Spark based on Post-Mortem Analysis

2019 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata47090.2019.9006590 ◽

2019 ◽

Author(s):

Atsuya Nasu ◽

Kenji Yoneo ◽

Masao Okita ◽

Fumihiko Ino

Keyword(s):

Apache Spark ◽

Cache Management ◽

Post Mortem ◽

Memory Cache

Download Full-text

A Large-scale Analysis of Hundreds of In-memory Key-value Cache Clusters at Twitter

ACM Transactions on Storage ◽

10.1145/3468521 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-35

Author(s):

Juncheng Yang ◽

Yao Yue ◽

K. V. Rashmi

Keyword(s):

Large Scale ◽

Production Systems ◽

Wide Spectrum ◽

Use Cases ◽

Scale Analysis ◽

Business Logic ◽

Traffic Pattern ◽

Fine Grained ◽

Memory Cache ◽

Large Scale Analysis

Modern web services use in-memory caching extensively to increase throughput and reduce latency. There have been several workload analyses of production systems that have fueled research in improving the effectiveness of in-memory caching systems. However, the coverage is still sparse considering the wide spectrum of industrial cache use cases. In this work, we significantly further the understanding of real-world cache workloads by collecting production traces from 153 in-memory cache clusters at Twitter, sifting through over 80 TB of data, and sometimes interpreting the workloads in the context of the business logic behind them. We perform a comprehensive analysis to characterize cache workloads based on traffic pattern, time-to-live (TTL), popularity distribution, and size distribution. A fine-grained view of different workloads uncover the diversity of use cases: many are far more write-heavy or more skewed than previously shown and some display unique temporal patterns. We also observe that TTL is an important and sometimes defining parameter of cache working sets. Our simulations show that ideal replacement strategy in production caches can be surprising, for example, FIFO works the best for a large number of workloads.

Download Full-text

Speed scaling problems with memory/cache consideration

Journal of Scheduling ◽

10.1007/s10951-018-0565-1 ◽

2018 ◽

Vol 21 (6) ◽

pp. 633-646 ◽

Cited By ~ 1

Author(s):

Weiwei Wu ◽

Minming Li ◽

Kai Wang ◽

He Huang ◽

Enhong Chen

Keyword(s):

Speed Scaling ◽

Memory Cache ◽

With Memory

Download Full-text

Integrated High-Performance Platform for Fast Query Response in Big Data with Hive, Impala, and SparkSQL: A Performance Evaluation

Applied Sciences ◽

10.3390/app8091514 ◽

2018 ◽

Vol 8 (9) ◽

pp. 1514 ◽

Cited By ~ 1

Author(s):

Bao Chang ◽

Hsiu-Fen Tsai ◽

Yun-Da Lee

Keyword(s):

Big Data ◽

Open Source Software ◽

High Performance ◽

Low Cost ◽

Data Retrieval ◽

Fast Response ◽

Disk Cache ◽

Memory Cache ◽

Medium Enterprise ◽

Effectiveness And Efficiency

This paper first integrates big data tools—Hive, Impala, and SparkSQL—which support SQL-like queries for rapid data retrieval in big data. The three introduced tools are not only suitable for operating in business intelligence to serve high-performance data retrieval, but they are also an open-source software solution with low cost for small-to-medium enterprise use. In practice, the proposed approach provides an in-memory cache and an in-disk cache to achieve a very fast response to a query if a cache hit occurs. Moreover, this paper develops so-called platform selection that is able to select the appropriate tool dealing with input query with effectiveness and efficiency. As a result, the speed of job execution of proposed approach using platform selection is 2.63 times faster than Hive in the Case 1 experiment, and 4.57 times faster in the Case 2 experiment.

Download Full-text