scholarly journals Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection

Author(s):  
Chen-Yong Cher ◽  
Meeta S. Gupta ◽  
Pradip Bose ◽  
K. Paul Muller
Author(s):  
Chen-Yong Cher ◽  
K. Paul Muller ◽  
Ruud A. Haring ◽  
David L. Satterfield ◽  
Thomas E. Musta ◽  
...  

2005 ◽  
Author(s):  
P.K. Tapadiya ◽  
D.R. Avresky

Author(s):  
Qiang Guan ◽  
Nathan DeBardeleben ◽  
Sean Blanchard ◽  
Song Fu ◽  
Claude H. Davis IV ◽  
...  

As the high performance computing (HPC) community continues to push towards exascale computing, HPC applications of today are only affected by soft errors to a small degree but we expect that this will become a more serious issue as HPC systems grow. We propose F-SEFI, a Fine-grained Soft Error Fault Injector, as a tool for profiling software robustness against soft errors. We utilize soft error injection to mimic the impact of errors on logic circuit behavior. Leveraging the open source virtual machine hypervisor QEMU, F-SEFI enables users to modify emulated machine instructions to introduce soft errors. F-SEFI can control what application, which sub-function, when and how to inject soft errors with different granularities, without interference to other applications that share the same environment. We demonstrate use cases of F-SEFI on several benchmark applications with different characteristics to show how data corruption can propagate to incorrect results. The findings from the fault injection campaign can be used for designing robust software and power-efficient hardware.


2017 ◽  
Vol 50 ◽  
pp. 102-112 ◽  
Author(s):  
Maha Kooli ◽  
Firas Kaddachi ◽  
Giorgio Di Natale ◽  
Alberto Bosio ◽  
Pascal Benoit ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document