register file Latest Research Papers

From early design phases to final release, the reliability of modern embedded systems against soft errors should be carefully considered. Several schemes have been proposed to protect embedded systems against soft errors, but they are neither always functional nor robust, even with expensive overhead in terms of hardware area, performance, and power consumption. Thus, system designers need to estimate reliability quantitatively to apply appropriate protection techniques for resource-constrained embedded systems. Vulnerability modeling based on lifetime analysis is one of the most efficient ways to quantify system reliability against soft errors. However, lifetime analysis can be inaccurate, mainly because it fails to comprehensively capture several system-level masking effects. This study analyzes and characterizes microarchitecture-level and software-level masking effects by developing an automated framework with exhaustive fault injections (i.e., soft errors) based on a cycle-accurate gem5 simulator. We injected faults into a register file because errors in the register file can easily be propagated to other components in a processor. We found that only 5% of injected faults can cause system failures on an average over benchmarks, mainly from the MiBench suite. Further analyses showed that 71% of soft errors are overwritten by write operations before being used, and the CPU does not use 20% of soft errors at all after fault injections. The remainder are also masked by several software-level masking effects, such as dynamically dead instructions, compare and logical instructions that do not change the result, and incorrect control flows that do not affect program outputs.

Download Full-text

Design and Implementation of Optimized Register File for Streaming Applications

10.1109/vdat53777.2021.9600984 ◽

2021 ◽

Author(s):

Ayazulla Khan Patan ◽

Dimitrios Stathis ◽

Pudi Dhilleswararao ◽

Yu Yang ◽

Srinivas Boppu ◽

...

Keyword(s):

Register File ◽

Streaming Applications ◽

Design And Implementation

Download Full-text

Early Address Prediction

ACM Transactions on Architecture and Code Optimization ◽

10.1145/3458883 ◽

2021 ◽

Vol 18 (3) ◽

pp. 1-22

Author(s):

Ricardo Alves ◽

Stefanos Kaxiras ◽

David Black-Schaffer

Keyword(s):

Data Storage ◽

Additional Data ◽

State Of The Art ◽

Data Reuse ◽

Spatial Reuse ◽

Register File ◽

Data Movement ◽

Low Load ◽

Prefetching Technique ◽

And Storage

Achieving low load-to-use latency with low energy and storage overheads is critical for performance. Existing techniques either prefetch into the pipeline (via address prediction and validation) or provide data reuse in the pipeline (via register sharing or L0 caches). These techniques provide a range of tradeoffs between latency, reuse, and overhead. In this work, we present a pipeline prefetching technique that achieves state-of-the-art performance and data reuse without additional data storage, data movement, or validation overheads by adding address tags to the register file. Our addition of register file tags allows us to forward (reuse) load data from the register file with no additional data movement, keep the data alive in the register file beyond the instruction’s lifetime to increase temporal reuse, and coalesce prefetch requests to achieve spatial reuse. Further, we show that we can use the existing memory order violation detection hardware to validate prefetches and data forwards without additional overhead. Our design achieves the performance of existing pipeline prefetching while also forwarding 32% of the loads from the register file (compared to 15% in state-of-the-art register sharing), delivering a 16% reduction in L1 dynamic energy (1.6% total processor energy), with an area overhead of less than 0.5%.

Download Full-text

CNTFET Design of a Multiple-Port Ternary Register File

Microelectronics Journal ◽

10.1016/j.mejo.2021.105076 ◽

2021 ◽

pp. 105076

Author(s):

A. Mohammaden ◽

M.E. Fouda ◽

Ihsen Alouani ◽

Lobna A. Said ◽

Ahmed G. Radwan

Keyword(s):

Register File

Download Full-text

24.4 A 5nm [email protected] and [email protected] 4kb Standard-Cell- Based Two-Port Register File with a 16T Bitcell with No Half-Selection Issue

2021 IEEE International Solid- State Circuits Conference (ISSCC) ◽

10.1109/isscc42613.2021.9366000 ◽

2021 ◽

Author(s):

Hidehiro Fujiwara ◽

Yi-Hsin Nien ◽

Chih-Yu Lin ◽

Hsien-Yu Pan ◽

Hao-Wen Hsu ◽

...

Keyword(s):

Standard Cell ◽

Register File

Download Full-text

Eff-ECC: Protecting GPGPUs Register File with a Unified Energy-Efficient ECC Mechanism

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ◽

10.1109/tcad.2021.3104529 ◽

2021 ◽

pp. 1-1

Author(s):

Hengshan Yue ◽

Xiaohui Wei ◽

Jingweijia Tan ◽

Nan Jiang ◽

Meikang Qiu

Keyword(s):

Energy Efficient ◽

Register File

Download Full-text

Performance Tuning Techniques for Face Detection Algorithms on GPGPU

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b8234.1210220 ◽

2020 ◽

Vol 10 (2) ◽

pp. 103-108

Author(s):

Yara M. Abdelaal ◽

M. Fayez ◽

Samy Ghoniemy ◽

Ehab Abozinadah ◽

H. M. Faheem

Keyword(s):

Face Detection ◽

Performance Tuning ◽

Double Precision ◽

Occupancy Rate ◽

Register File ◽

File Size ◽

Detection Algorithms ◽

And Performance ◽

Gpu Architectures ◽

Common Face

Face detection algorithms varies in speed and performance on GPUs. Different algorithms can report different speeds on different GPUs that are not governed by linear or nearlinear approximations. This is due to many factors such as register file size, occupancy rate of the GPU, speed of the memory, and speed of double precision processors. This paper studies the most common face detection algorithms LBP and Haar-like and study the bottlenecks associated with deploying both algorithms on different GPU architectures. The study focuses on the bottlenecks and the associated techniques to resolve them based on the different GPUs specifications.

Download Full-text

register file
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Improving the Robustness of Redundant Execution with Register File Randomization

Research Department - Banking Section - Miscellaneous - Governor's Statistical Register, Old Sheets from Copy of Register - File 2 - 1950 - 1953

Register file management for PRET machines

Characterizing System-Level Masking Effects against Soft Errors

Design and Implementation of Optimized Register File for Streaming Applications

Early Address Prediction

CNTFET Design of a Multiple-Port Ternary Register File

24.4 A 5nm [email protected] and [email protected] 4kb Standard-Cell- Based Two-Port Register File with a 16T Bitcell with No Half-Selection Issue

Eff-ECC: Protecting GPGPUs Register File with a Unified Energy-Efficient ECC Mechanism

Performance Tuning Techniques for Face Detection Algorithms on GPGPU

Export Citation Format

register fileRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Improving the Robustness of Redundant Execution with Register File Randomization

Research Department - Banking Section - Miscellaneous - Governor's Statistical Register, Old Sheets from Copy of Register - File 2 - 1950 - 1953

Register file management for PRET machines

Characterizing System-Level Masking Effects against Soft Errors

Design and Implementation of Optimized Register File for Streaming Applications

Early Address Prediction

CNTFET Design of a Multiple-Port Ternary Register File

24.4 A 5nm [email protected] and [email protected] 4kb Standard-Cell- Based Two-Port Register File with a 16T Bitcell with No Half-Selection Issue

Eff-ECC: Protecting GPGPUs Register File with a Unified Energy-Efficient ECC Mechanism

Performance Tuning Techniques for Face Detection Algorithms on GPGPU

register file
Recently Published Documents