Modeling fault-tolerant system behavior

A fault-tolerant system incorporating an Ada executive and 1750A processors

10.1145/339665.339692 ◽

1986 ◽

Author(s):

David Butler

Keyword(s):

Fault Tolerant ◽

Fault Tolerant System

Fault-Tolerant Strategy for Real-Time System Based on Evolvable Hardware

Journal of Circuits System and Computers ◽

10.1142/s0218126617501110 ◽

2017 ◽

Vol 26 (07) ◽

pp. 1750111 ◽

Cited By ~ 2

Author(s):

Jie Wang ◽

Jiwei Liu

Keyword(s):

Real Time ◽

Recovery Time ◽

Fault Tolerant ◽

Fault Tree Analysis ◽

Repair Process ◽

Evolvable Hardware ◽

Target System ◽

Time System ◽

Real Time System ◽

Fault Tolerant System

The evolvable hardware (EHW) is widely used in the design of fault-tolerant system. Fault-tolerant system is really a real-time system, and the recovery time is necessary in fault detection and recovery. However, when applying EHW, real-time characteristic is usually ignored. In this paper, a fault-tolerant strategy based on EHW is proposed. The recovery time, predicted by the fault tree analysis (FTA), is considered as a constraint condition. A configuration library is set up in the design phase to accelerate the repair process of the anticipated faults. An evolvable algorithm (EA) based on similarity is applied to evolve the repair circuit for the unanticipated faults. When the library reaches the upper, the target system is reconfigured by the EA-repair technology. Extensive experiments are conducted to show that our method can improve the fault-tolerance of the system while satisfying the real-time requirement on FPGA platform. In a long run system, our method can keep a higher fault recovery rate.

Comparing the Performance of Reference Trajectory Management and Controller Reconfiguration in Attitude Fault Tolerant Control

MATEC Web of Conferences ◽

10.1051/matecconf/201815104008 ◽

2018 ◽

Vol 151 ◽

pp. 04008

Author(s):

Rouzbeh Moradi ◽

Alireza Alikhani ◽

Mohsen Fathi Jegarkandi

Keyword(s):

Control Problem ◽

Closed Loop ◽

Fault Tolerant ◽

Dynamic Performance ◽

Fault Tolerant Control ◽

Spacecraft Attitude ◽

Reference Trajectory ◽

System Behavior ◽

Closed Loop System ◽

Reference Trajectories

Reference trajectory management is a method to modify reference trajectories for the faulty system. The modified reference trajectories define new maneuvers for the system to retain its pre-fault dynamic performance. Controller reconfiguration is another method to handle faults in the system, for instance by adjusting the controller parameters (coefficients). Both of these two methods have been considered in the literature and are proven to be capable of handling various faults. However, the comparison of these two methods has not been considered sufficiently. In this paper, a controller reconfiguration mechanism and a reference trajectory management are proposed for the spacecraft attitude fault tolerant control problem. Then, these two methods are compared under the same conditions, and it is shown that the proposed controller reconfiguration has better performance than the proposed reference trajectory management. The reason is that the controller reconfiguration has more variables to modify the closed-loop system behavior.

Does your fault-tolerant system tolerate faults?

Proceedings of the ACM Symposium on Cloud Computing - SoCC '18 ◽

10.1145/3267809.3275451 ◽

2018 ◽

Author(s):

Kamala Ramasubramanian ◽

Peter Alvaro

Keyword(s):

Fault Tolerant ◽

Fault Tolerant System

FT-EST Framework: Reliability Estimation for the Purposes of Fault-Tolerant System Design Automation

2018 21st Euromicro Conference on Digital System Design (DSD) ◽

10.1109/dsd.2018.00053 ◽

2018 ◽

Cited By ~ 4

Author(s):

Jakub Lojda ◽

Jakub Podivinsky ◽

Ondrej Cekan ◽

Richard Panek ◽

Zdenek Kotasek

Keyword(s):

System Design ◽

Design Automation ◽

Fault Tolerant ◽

Reliability Estimation ◽

Fault Tolerant System

Overview of a Fault-Tolerant System

Fault-Tolerant Parallel and Distributed Systems ◽

10.1007/978-1-4615-5449-3_6 ◽

1998 ◽

pp. 109-121

Author(s):

Angelo Pruscino

Keyword(s):

Fault Tolerant ◽

Fault Tolerant System

Fault-Tolerant system design in multiple operating modes using a structural model

Advances in Safety, Reliability and Risk Management ◽

10.1201/b11433-77 ◽

2011 ◽

pp. 549-556 ◽

Cited By ~ 4

Author(s):

B Conrard ◽

V Cocquempot ◽

S Mili

Keyword(s):

System Design ◽

Structural Model ◽

Fault Tolerant ◽

Fault Tolerant System ◽

Operating Modes

Log Replication in Raft vs Kafka

Studia Universitatis Babeș-Bolyai Informatica ◽

10.24193/subbi.2020.2.05 ◽

2020 ◽

Vol 65 (2) ◽

pp. 66

Author(s):

M. Petrescu ◽

R. Petrescu

Keyword(s):

Distributed Systems ◽

Fault Tolerant ◽

Consensus Algorithm ◽

Correct Operation ◽

Consensus Algorithms ◽

Fault Tolerant System ◽

Multiple Algorithms

The implementation of a fault-tolerant system requires some type of consensus algorithm for correct operation. From Paxos to View-stamped Replication and Raft multiple algorithms have been developed to handle this problem. This paper presents and compares the Raft algorithm and Apache Kafka, a distributed messaging system which, although at a higher level, implements many concepts present in Raft (strong leadership, append-only log, log compaction, etc.).This shows that mechanisms conceived to handle one class of problems (consensus algorithms) are very useful to handle a larger category in the context of distributed systems.

SOFTWARE IMPLEMENTED HARDWARE-TRANSIENT FAULTS DETECTION

International Journal of Computing ◽

10.47839/ijc.5.1.377 ◽

2014 ◽

pp. 26-30

Author(s):

Goutam Kumar Saha

Keyword(s):

Error Correction ◽

Fault Tolerant ◽

Low Cost ◽

Transient Faults ◽

Fault Tolerant System ◽

Cost Approach ◽

Run Time ◽

Commodity Systems ◽

Fail Safe ◽

Register Error

This paper examines a software implemented self-checking technique that is capable of detecting processorregisters' hardware-transient faults. The proposed approach is intended to detect run-time transient bit-errors in memory and processor status register. Error correction is not considered here. However, this low-cost approach is intended to be adopted in commodity systems that use ordinary off-the-shelf microprocessors, for the purpose of operational faults detection towards gaining fail-safe kind of fault tolerant system.

System-Level Reliability and Sensitivity Analyses for Three Fault-Tolerant System Architectures

Dependable Computing and Fault-Tolerant Systems - Dependable Computing for Critical Applications 4 ◽

10.1007/978-3-7091-9396-9_37 ◽

1995 ◽

pp. 459-477 ◽

Cited By ~ 2

Author(s):

Joanne Bechta Dugan ◽

Michael R. Lyu

Keyword(s):

Fault Tolerant ◽

System Level ◽

Sensitivity Analyses ◽

System Architectures ◽

Fault Tolerant System