shared caches
Recently Published Documents


TOTAL DOCUMENTS

63
(FIVE YEARS 19)

H-INDEX

9
(FIVE YEARS 3)

2022 ◽  
Vol 19 (1) ◽  
pp. 1-25
Author(s):  
Muhammad Aditya Sasongko ◽  
Milind Chabbi ◽  
Mandana Bagheri Marzijarani ◽  
Didem Unat

One widely used metric that measures data locality is reuse distance —the number of unique memory locations that are accessed between two consecutive accesses to a particular memory location. State-of-the-art techniques that measure reuse distance in parallel applications rely on simulators or binary instrumentation tools that incur large performance and memory overheads. Moreover, the existing sampling-based tools are limited to measuring reuse distances of a single thread and discard interactions among threads in multi-threaded programs. In this work, we propose ReuseTracker —a fast and accurate reuse distance analyzer that leverages existing hardware features in commodity CPUs. ReuseTracker is designed for multi-threaded programs and takes cache-coherence effects into account. By utilizing hardware features like performance monitoring units and debug registers, ReuseTracker can accurately profile reuse distance in parallel applications with much lower overheads than existing tools. It introduces only 2.9× runtime and 2.8× memory overheads. Our tool achieves 92% accuracy when verified against a newly developed configurable benchmark that can generate a variety of different reuse distance patterns. We demonstrate the tool’s functionality with two use-case scenarios using PARSEC, Rodinia, and Synchrobench benchmark suites where ReuseTracker guides code refactoring in these benchmarks by detecting spatial reuses in shared caches that are also false sharing and successfully predicts whether some benchmarks in these suites can benefit from adjacent cache line prefetch optimization.


2021 ◽  
Author(s):  
Hui Zhao ◽  
Antonio Bazco-Nogueras ◽  
Petros Elia
Keyword(s):  

2021 ◽  
Vol 18 (3) ◽  
pp. 1-25
Author(s):  
Weijia Song ◽  
Christina Delimitrou ◽  
Zhiming Shen ◽  
Robbert Van Renesse ◽  
Hakim Weatherspoon ◽  
...  

Infrastructure-as-a-Service cloud providers sell virtual machines that are only specified in terms of number of CPU cores, amount of memory, and I/O throughput. Performance-critical aspects such as cache sizes and memory latency are missing or reported in ways that make them hard to compare across cloud providers. It is difficult for users to adapt their application’s behavior to the available resources. In this work, we aim to increase the visibility that cloud users have into shared resources on public clouds. Specifically, we present CacheInspector , a lightweight runtime that determines the performance and allocated capacity of shared caches on multi-tenant public clouds. We validate CacheInspector ’s accuracy in a controlled environment, and use it to study the characteristics and variability of cache resources in the cloud, across time, instances, availability regions, and cloud providers. We show that CacheInspector ’s output allows cloud users to tailor their application’s behavior, including their output quality, to avoid suboptimal performance when resources are scarce.


2021 ◽  
Author(s):  
Amin Sarihi ◽  
Ahmad Patooghy ◽  
Mahdi Amininasab ◽  
Mohammad Shokrolah Shirazi ◽  
Abdel‐Hameed A. Badawy

Author(s):  
Javier Díaz ◽  
Pablo Ibáñez ◽  
Teresa Monreal ◽  
Víctor Viñals ◽  
José M. Llabería

2021 ◽  
pp. 1-1
Author(s):  
Monolina Dutta ◽  
Anoop Thomas
Keyword(s):  

2021 ◽  
pp. 1-1
Author(s):  
Shreya Shrestha Meel ◽  
B. Sundar Rajan
Keyword(s):  

2020 ◽  
Vol 69 (10) ◽  
pp. 1487-1499 ◽  
Author(s):  
Jun Xiao ◽  
Sebastian Altmeyer ◽  
Andy D. Pimentel

Sign in / Sign up

Export Citation Format

Share Document