instruction cache Latest Research Papers

AbstractMicro-architectural behavior of traditional disk-based online transaction processing (OLTP) systems has been investigated extensively over the past couple of decades. Results show that traditional OLTP systems mostly under-utilize the available micro-architectural resources. In-memory OLTP systems, on the other hand, process all the data in main-memory and, therefore, can omit the buffer pool. Furthermore, they usually adopt more lightweight concurrency control mechanisms, cache-conscious data structures, and cleaner codebases since they are usually designed from scratch. Hence, we expect significant differences in micro-architectural behavior when running OLTP on platforms optimized for in-memory processing as opposed to disk-based database systems. In particular, we expect that in-memory systems exploit micro-architectural features such as instruction and data caches significantly better than disk-based systems. This paper sheds light on the micro-architectural behavior of in-memory database systems by analyzing and contrasting it to the behavior of disk-based systems when running OLTP workloads. The results show that, despite all the design changes, in-memory OLTP exhibits very similar micro-architectural behavior to disk-based OLTP: more than half of the execution time goes to memory stalls where instruction cache misses or the long-latency data misses from the last-level cache (LLC) are the dominant factors in the overall execution time. Even though ground-up designed in-memory systems can eliminate the instruction cache misses, the reduction in instruction stalls amplifies the impact of LLC data misses. As a result, only 30% of the CPU cycles are used to retire instructions, and 70% of the CPU cycles are wasted to stalls for both traditional disk-based and new generation in-memory OLTP.

Download Full-text

Power Reduction of a Set-Associative Instruction Cache Using a Dynamic Early Tag Lookup

10.23919/date51398.2021.9474191 ◽

2021 ◽

Author(s):

Chun-Chang Yu ◽

Yu Hen Hu ◽

Yi-Chang Lu ◽

Charlie Chung-Ping Chen

Keyword(s):

Power Reduction ◽

Instruction Cache

Download Full-text

Novel Method for Verification and Performance Evaluation of a Non-Blocking Level-1 Instruction Cache designed for Out-of-Order RISC-V Superscaler Processor on FPGA

2020 24th International Symposium on VLSI Design and Test (VDAT) ◽

10.1109/vdat50263.2020.9190377 ◽

2020 ◽

Author(s):

Vivian Desalphine ◽

Somya Dashora ◽

Laxita Mali ◽

K. Suhas ◽

Aneesh Raveendran ◽

...

Keyword(s):

Performance Evaluation ◽

Instruction Cache ◽

Novel Method ◽

Level 1 ◽

And Performance

Download Full-text

Energy-Efficient Two-level Instruction Cache Design for an Ultra-Low-Power Multi-core Cluster

2020 Design, Automation & Test in Europe Conference & Exhibition (DATE) ◽

10.23919/date48585.2020.9116212 ◽

2020 ◽

Author(s):

Chen Jie ◽

Igor Loi ◽

Luca Benini ◽

Davide Rossi

Keyword(s):

Low Power ◽

Energy Efficient ◽

Ultra Low Power ◽

Instruction Cache ◽

Core Cluster ◽

Cache Design

Download Full-text

A Dynamic Instruction Cache Locking Approach for Minimizing Worst Case Execution Time of a Single Task

IEEE Access ◽

10.1109/access.2020.3038170 ◽

2020 ◽

Vol 8 ◽

pp. 208003-208015

Author(s):

Tingxu Zhang ◽

Wenguang Zheng ◽

Yingyuan Xiao ◽

Guangping Xu

Keyword(s):

Execution Time ◽

Worst Case ◽

Single Task ◽

Instruction Cache ◽

Worst Case Execution Time ◽

Cache Locking

Download Full-text

CID: Co-Architecting Instruction Cache and Decompression System for Embedded Systems

IEEE Transactions on Computers ◽

10.1109/tc.2020.3010062 ◽

2020 ◽

pp. 1-1

Author(s):

Jinkwon Kim ◽

Seokin Hong ◽

Jeongkyu Hong ◽

Soontae Kim

Keyword(s):

Embedded Systems ◽

Instruction Cache

Download Full-text

Parloom: A New Low-Power Set-Associative Instruction Cache Architecture Utilizing Enhanced Counting Bloom Filter and Partial Tags

Journal of Circuits System and Computers ◽

10.1142/s0218126619502037 ◽

2019 ◽

Vol 28 (12) ◽

pp. 1950203

Author(s):

Sajjad Rostami-Sani ◽

Mojtaba Valinataj ◽

Saeideh Alinezhad Chamazcoti

Keyword(s):

Energy Consumption ◽

System Performance ◽

Bloom Filter ◽

Instruction Cache ◽

Cache Architecture ◽

Power Set ◽

Novel Method ◽

Instruction Caches ◽

Cache System ◽

Counting Bloom Filter

The cache system dissipates a significant amount of energy compared to the other memory components. This will be intensified if a cache is designed with a set-associative structure to improve the system performance because the parallel accesses to the entries of a set for tag comparisons lead to even more energy consumption. In this paper, a novel method is proposed as a combination of a counting Bloom filter and partial tags to mitigate the energy consumption of set-associative caches. This new hybrid method noticeably decreases the cache energy consumption especially in highly-associative instruction caches. In fact, it uses an enhanced counting Bloom filter to predict cache misses with a high accuracy as well as partial tags to decrease the overall cache size. This way, unnecessary tag comparisons can be prevented and therefore, the cache energy consumption is considerably reduced. Based on the simulation results, the proposed method provides the energy reduction from 22% to 31% for 4-way–32-way set-associative L1 caches bigger than 16[Formula: see text]kB running the MiBench programs. The improvements are attained with a negligible system performance degradation compared to the traditional cache system.

Download Full-text

Combining watchdog processor with instruction cache locking for a fault-tolerant, predictable architecture applied to fixed-priority, preemptive, multitasking real-time systems

2019 24th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA) ◽

10.1109/etfa.2019.8869168 ◽

2019 ◽

Author(s):

Antonio Marti-Campoy ◽

Francisco Rodriguez-Ballester

Keyword(s):

Real Time ◽

Fault Tolerant ◽

Real Time Systems ◽

Fixed Priority ◽

Instruction Cache ◽

Cache Locking ◽

Time Systems

Download Full-text

instruction cache
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications

Large-Capacity and High-Speed Instruction Cache Based on Divide-by-2 Memory Banks

Micro-architectural analysis of in-memory OLTP: Revisited

Power Reduction of a Set-Associative Instruction Cache Using a Dynamic Early Tag Lookup

Novel Method for Verification and Performance Evaluation of a Non-Blocking Level-1 Instruction Cache designed for Out-of-Order RISC-V Superscaler Processor on FPGA

Energy-Efficient Two-level Instruction Cache Design for an Ultra-Low-Power Multi-core Cluster

A Dynamic Instruction Cache Locking Approach for Minimizing Worst Case Execution Time of a Single Task

CID: Co-Architecting Instruction Cache and Decompression System for Embedded Systems

Parloom: A New Low-Power Set-Associative Instruction Cache Architecture Utilizing Enhanced Counting Bloom Filter and Partial Tags

Combining watchdog processor with instruction cache locking for a fault-tolerant, predictable architecture applied to fixed-priority, preemptive, multitasking real-time systems

Export Citation Format

instruction cacheRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications

Large-Capacity and High-Speed Instruction Cache Based on Divide-by-2 Memory Banks

Micro-architectural analysis of in-memory OLTP: Revisited

Power Reduction of a Set-Associative Instruction Cache Using a Dynamic Early Tag Lookup

Novel Method for Verification and Performance Evaluation of a Non-Blocking Level-1 Instruction Cache designed for Out-of-Order RISC-V Superscaler Processor on FPGA

Energy-Efficient Two-level Instruction Cache Design for an Ultra-Low-Power Multi-core Cluster

A Dynamic Instruction Cache Locking Approach for Minimizing Worst Case Execution Time of a Single Task

CID: Co-Architecting Instruction Cache and Decompression System for Embedded Systems

Parloom: A New Low-Power Set-Associative Instruction Cache Architecture Utilizing Enhanced Counting Bloom Filter and Partial Tags

Combining watchdog processor with instruction cache locking for a fault-tolerant, predictable architecture applied to fixed-priority, preemptive, multitasking real-time systems

instruction cache
Recently Published Documents