l2 cache Latest Research Papers

Conventional 2-level cache architecture is not efficient in mobile systems when small programs that do not require the large L2 cache run. Bypassing the L2 cache for those small programs has two benefits. When only a single program runs, bypassing the L2 cache allows to power it down removing its leakage energy consumption. When multiple programs run simultaneously on multiple cores, small programs bypass the L2 cache while large programs use it. This decreases conflicts in the L2 cache among those programs increasing overall performance. From our experiments using cycle-accurate performance and energy simulators, our proposed L2 cache architecture supporting bypassing is shown to be effective in reducing L2 cache energy consumption and increasing overall performance of programs.

Download Full-text

L2 Cache Robust Partitioning in Multicore Processors

Proceedings of the 6th Brazilian Technology Symposium (BTSym’20) - Smart Innovation, Systems and Technologies ◽

10.1007/978-3-030-75680-2_72 ◽

2021 ◽

pp. 654-661

Author(s):

Thiago Silva de Oliveira Duarte ◽

Osamu Saotome

Keyword(s):

Multicore Processors ◽

L2 Cache

Download Full-text

Energy-Efficient GPU L2 Cache Design Using Instruction-Level Data Locality Similarity

ACM Transactions on Design Automation of Electronic Systems ◽

10.1145/3408060 ◽

2020 ◽

Vol 25 (6) ◽

pp. 1-18 ◽

Cited By ~ 1

Author(s):

Jingweijia Tan ◽

Kaige Yan ◽

Shuaiwen Leon Song ◽

Xin Fu

Keyword(s):

Energy Efficient ◽

Data Locality ◽

Level Data ◽

L2 Cache ◽

Cache Design

Download Full-text

Fast modeling L2 cache reuse distance histograms using combined locality information from software traces

Journal of Systems Architecture ◽

10.1016/j.sysarc.2020.101745 ◽

2020 ◽

Vol 108 ◽

pp. 101745

Author(s):

Ming Ling ◽

Jiancong Ge ◽

Guangmin Wang

Keyword(s):

Reuse Distance ◽

L2 Cache ◽

Locality Information ◽

Cache Reuse

Download Full-text

Miss Rate Estimation (MRE) an Novel Approach Toward L2 Cache Partitioning Algorithm’s for Multicore System

Advances in Intelligent Systems and Computing - Intelligent System Design ◽

10.1007/978-981-15-5400-1_58 ◽

2020 ◽

pp. 593-603

Author(s):

Pallavi Joshi ◽

M. V. Rathnamma ◽

K. Srujan Raju ◽

Urmila Pawar

Keyword(s):

Cache Partitioning ◽

Rate Estimation ◽

Novel Approach ◽

L2 Cache ◽

Multicore System

Download Full-text

Optimization of Regressions in L2 Cache Verification

2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA) ◽

10.1109/icirca48905.2020.9183338 ◽

2020 ◽

Author(s):

Basaweshwari ◽

H.V. Ravish Aradhya ◽

Robert Chan ◽

Jerry Dai ◽

Pawan Yenamandra

Keyword(s):

L2 Cache

Download Full-text

Execution Model to Reduce the Interference of Shared Memory in ARINC 653 Compliant Multicore RTOS

Applied Sciences ◽

10.3390/app10072464 ◽

2020 ◽

Vol 10 (7) ◽

pp. 2464

Author(s):

Sihyeong Park ◽

Mi-Young Kwon ◽

Hoon-Kyu Kim ◽

Hyungshin Kim

Keyword(s):

Execution Time ◽

Software Verification ◽

Main Memory ◽

Critical Systems ◽

Multicore Architectures ◽

L2 Cache ◽

Execution Model ◽

Time Division ◽

L1 And L2 ◽

Execution Models

Multicore architecture is applied to contemporary avionics systems to deal with complex tasks. However, multicore architectures can cause interference by contention because the cores share hardware resources. This interference reduces the predictable execution time of safety-critical systems, such as avionics systems. To reduce this interference, methods of separating hardware resources or limiting capacity by core have been proposed. Existing studies have modified kernels to control hardware resources. Additionally, an execution model has been proposed that can reduce interference by adjusting the execution order of tasks without software modification. Avionics systems require several rigorous software verification procedures. Therefore, modifying existing software can be costly and time-consuming. In this work, we propose a method to apply execution models proposed in existing studies without modifying commercial real-time operating systems. We implemented the time-division multiple access (TDMA) and acquisition execution restitution (AER) execution models with pseudo-partition and message queuing on VxWorks 653. Moreover, we propose a multi-TDMA model considering the characteristics of the target hardware. For the interference analysis, we measured the L1 and L2 cache misses and the number of main memory requests. We demonstrated that the interference caused by memory sharing was reduced by at least 60% in the execution model. In particular, multi-TDMA doubled utilization compared to TDMA and also reduced the execution time by 20% compared to the AER model.

Download Full-text

Poluição de Cache e Thrashing em Aplicações Paralelas de Alto Desempenho

10.5753/wscad.2019.8683 ◽

2019 ◽

Author(s):

Arthur Krause ◽

Francis Moreira ◽

Valéria Girelli ◽

Philippe Olivier Navaux

Keyword(s):

High Performance ◽

Computer Systems ◽

Memory Access ◽

Replacement Policy ◽

Parallel Applications ◽

Access Time ◽

L2 Cache ◽

Intelligent Management

Conforme os processadores evoluem, o desempenho dos sistemas computacionais se torna cada vez mais limitado pelo tempo de acesso à memória. Caches são empregadas a fim de contornar este problema, mas é necessária uma gerência inteligente dos dados que são armazenados nelas para impedir que problemas como poluição e thrashing degradem seu desempenho. Neste trabalho é apresentada uma análise da poluição de cache e thrashing em aplicações paralelas de alto desempenho. Os resultados mostram que caches com maior associatividade sofrem mais com estes problemas. Até 28% dos cache misses na L1 poderiam ser evitados com uma política de substituição de cache mais inteligente, chegando a até 62% na cache L2 e 98% na LLC. As processors evolve, the performance of computer systems becomes increasingly limited by the memory access time. Caches are employed in order to get around this problem, but an intelligent management of the data that is stored in them is necessary to prevent problems such as pollution and thrashing from degrading their performance. In this work, an analysis of cache and thrashing pollution in high performance parallel applications is presented. The results show that caches with greater associativity suffer more from these problems. Up to 28% of cache misses in the L1 cache could be avoided with a smarter replacement policy, up to 62% in the L2 cache and 98% in the LLC.

Download Full-text

A Varying Processor Cache Sets Architecture

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c5679.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 6141-6145

Keyword(s):

Power Saving ◽

Memory Access ◽

Access Time ◽

Design Time ◽

Cache Size ◽

System A ◽

Line Placement ◽

Proposed Model ◽

L2 Cache ◽

Cache System

Any processor cache has three parameters capacity, line size and associativity. Usually all three are fixed at design time. Algorithms to have variable cache sets are proposed in literature. This paper proposes a method to have variable cache sets logically. The cache comes with fixed sets. The cache is visualized to have logically any number of sets greater than or equal to one. An algorithm for line placement/replacement is proposed in this paper for this model. The proposed model is simulated with SPEC2K benchmarks using Simplescalar Toolkit for two level inclusive set associative cache system. A power saving of 8.4% for L1 cache size 512x4, 17.58% for 1024x4 and 31.3% for 2048x4 is observed compared with traditional set associative cache of same size. A power saving of 7.53% compared with model proposed in literature for L1 size 512x4, 7.64% for 1024x4 and 7.645% for 2048x4 is observed. The L2 cache size is fixed at 2048x8. The average memory access time (AMAT) is found to degrade compared with conventional set associative cache by 19.63% for L1 size of 512x4, 24.68% for 1024x4 and 2048x4. (Abstract)

Download Full-text

Efficient L2 Cache Management to Boost GPGPU Performance

10.4995/thesis/10251/125477 ◽

2019 ◽

Author(s):

Francisco Candel Margaix

Keyword(s):

Cache Management ◽

L2 Cache

Download Full-text

l2 cache
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

An L2 Cache Architecture Supporting Bypassing for Low Energy and High Performance

L2 Cache Robust Partitioning in Multicore Processors

Energy-Efficient GPU L2 Cache Design Using Instruction-Level Data Locality Similarity

Fast modeling L2 cache reuse distance histograms using combined locality information from software traces

Miss Rate Estimation (MRE) an Novel Approach Toward L2 Cache Partitioning Algorithm’s for Multicore System

Optimization of Regressions in L2 Cache Verification

Execution Model to Reduce the Interference of Shared Memory in ARINC 653 Compliant Multicore RTOS

Poluição de Cache e Thrashing em Aplicações Paralelas de Alto Desempenho

A Varying Processor Cache Sets Architecture

Efficient L2 Cache Management to Boost GPGPU Performance

Export Citation Format

l2 cacheRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

An L2 Cache Architecture Supporting Bypassing for Low Energy and High Performance

L2 Cache Robust Partitioning in Multicore Processors

Energy-Efficient GPU L2 Cache Design Using Instruction-Level Data Locality Similarity

Fast modeling L2 cache reuse distance histograms using combined locality information from software traces

Miss Rate Estimation (MRE) an Novel Approach Toward L2 Cache Partitioning Algorithm’s for Multicore System

Optimization of Regressions in L2 Cache Verification

Execution Model to Reduce the Interference of Shared Memory in ARINC 653 Compliant Multicore RTOS

Poluição de Cache e Thrashing em Aplicações Paralelas de Alto Desempenho

A Varying Processor Cache Sets Architecture

Efficient L2 Cache Management to Boost GPGPU Performance

l2 cache
Recently Published Documents