Fault-tolerant scheduling of multicore mixed-criticality systems under permanent failures

Author(s):  
Zaid Al-bayati ◽  
Brett H. Meyer ◽  
Haibo Zeng
Author(s):  
Jian (Denny) Lin ◽  
Albert M. K. Cheng ◽  
Doug Steel ◽  
Michael Yu-Chi Wu ◽  
Nanfei Sun

Enabling computer tasks with different levels of criticality running on a common hardware platform has been an increasingly important trend in the design of real-time and embedded systems. On such systems, a real-time task may exhibit different WCETs (Worst Case Execution Times) in different criticality modes. It is well-known that traditional real-time scheduling methods are not applicable to ensure the timely requirement of the mixed-criticality tasks. In this paper, the authors study a problem of scheduling real-time, mixed-criticality tasks with fault tolerance. An optimal, off-line algorithm is designed to guarantee the most tasks completing successfully when the system runs into the high-criticality mode. A formal proof of the optimality is given. Also, a novel on-line slack-reclaiming algorithm is proposed to recover from computing faults before the tasks' deadline during the run-time. Simulations show that an improvement of about 30% in performance is obtained by using the slack-reclaiming method.


2016 ◽  
Vol 26 (01) ◽  
pp. 1750016 ◽  
Author(s):  
Junlong Zhou ◽  
Min Yin ◽  
Zhifang Li ◽  
Kun Cao ◽  
Jianming Yan ◽  
...  

Integration of safety-critical tasks with different certification requirements onto a common hardware platform has become a growing tendency in the design of real-time and embedded systems. In the past decade, great efforts have been made to develop techniques for handling uncertainties in task worst-case execution time, quality-of-service, and schedulability of mixed-criticality systems. However, few works take fault-tolerance as a design requirement. In this paper, we address the scheduling of fault-tolerant mixed-criticality systems to ensure the safety of tasks at different levels of criticalities in the presence of transient faults. We adopt task re-execution as the fault-tolerant technique. Extensive simulations were performed to validate the effectiveness of our algorithm. Simulation results show that our algorithm results in up to [Formula: see text] and [Formula: see text] improvement in system reliability and schedule feasibility as compared to existing techniques, which contributes to a more safe system.


2019 ◽  
Vol 100 ◽  
pp. 165-175 ◽  
Author(s):  
Kun Cao ◽  
Guo Xu ◽  
Junlong Zhou ◽  
Mingsong Chen ◽  
Tongquan Wei ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document