Recovery Time and Fault Tolerance Improvement for Circuits mapped on SRAM-based FPGAs

Application checkpointing is a widely used recovery mechanism that consists of saving an application's state periodically to be used in case of a failure. In this study we investigate the utilisation of distributed checkpointing for replicated state machines. Conventionally, for replicated state machines, checkpointing information is stored in a replicated way in each of the replicas or separately in a single instance. Applying distributed checkpointing provides a means to adjust the level of fault tolerance of the checkpointing approach by giving away from recovery time. We use a local cluster and cloud environment to examine the effects of distributed checkpointing in a simple state machine example and compare the results with conventional approaches. As expected, distributed checkpointing gains from memory consumption and utilise different levels of fault tolerance while performing worse in terms of recovery time.

Download Full-text

Fault Tolerance Improvement for Cloud Data Center

Journal of Communications ◽

10.12720/jcm.12.7.412-418 ◽

2017 ◽

pp. 412-418

Author(s):

Humphrey Emesowum ◽

◽

Athanasios Paraskelidis ◽

Mo Adda

Keyword(s):

Fault Tolerance ◽

Data Center ◽

Cloud Data Center ◽

Cloud Data ◽

Tolerance Improvement

Download Full-text

Redundant logic insertion and fault tolerance improvement in combinational circuits

2017 International Conference on Circuits, System and Simulation (ICCSS) ◽

10.1109/cirsyssim.2017.8023171 ◽

2017 ◽

Cited By ~ 2

Author(s):

P. Balasubramanian ◽

R. T. Naayagi

Keyword(s):

Fault Tolerance ◽

Combinational Circuits ◽

Tolerance Improvement

Download Full-text

Fault Tolerance Improvement through Architecture Change in Artificial Neural Networks

Advances in Computation and Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-540-92137-0_28 ◽

2008 ◽

pp. 248-257 ◽

Cited By ~ 6

Author(s):

Fernando Morgado Dias ◽

Ana Antunes

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Fault Tolerance ◽

Artificial Neural ◽

Tolerance Improvement

Download Full-text

Fault Tolerance Improvement of IPM Type BLDC Motor Considering Winding Configuration under a Stator Inter-Turn Fault Condition

The Transactions of The Korean Institute of Electrical Engineers ◽

10.5370/kiee.2011.60.3.524 ◽

2011 ◽

Vol 60 (3) ◽

pp. 524-530

Author(s):

Hee-Woon Kim ◽

Jin-Gyu Yoon ◽

Jin Hur

Keyword(s):

Fault Tolerance ◽

Bldc Motor ◽

Tolerance Improvement

Download Full-text

Increasing fault tolerance of data plane on the internet of things using the software-defined networks

PeerJ Computer Science ◽

10.7717/peerj-cs.543 ◽

2021 ◽

Vol 7 ◽

pp. e543

Author(s):

Katayoun Bakhshi Kiadehi ◽

Amir Masoud Rahmani ◽

Amir Sabbagh Molahosseini

Keyword(s):

Fault Tolerance ◽

Internet Of Things ◽

Service Quality ◽

Packet Loss ◽

Recovery Time ◽

The Internet ◽

Software Defined Networks ◽

Data Plane ◽

Shared Risk ◽

The Internet Of Things

Considering the Internet of Things (IoT) impact in today’s world, uninterrupted service is essential, and recovery has received more attention than ever before. Fault-tolerance (FT) is an essential aspect of network resilience. Fault-tolerance mechanisms are required to ensure high availability and high reliability in systems. The advent of software-defined networking (SDN) in the IoT plays a significant role in providing a reliable communication platform. This paper proposes a data plane fault-tolerant architecture using the concepts of software-defined networks for IoT environments. In this work, a mathematical model called Shared Risk Link Group (SRLG) calculates redundant paths as the primary and backup non-overlapping paths between network equipment. In addition to the fault tolerance, service quality was considered in the proposed schemes. Putting the percentage of link bandwidth usage and the rate of link delay in calculating link costs makes it possible to calculate two completely non-overlapping paths with the best condition. We compare our two proposed dynamic schemes with the hybrid disjoint paths (Hybrid_DP) method and our previous work. IoT developments, wireless and wired equipment are now used in many industrial and commercial applications, so the proposed hybrid dynamic method supports both wired and wireless devices; furthermore multiple link failures will be supported in the two proposed dynamic schemes. Simulation results indicate that, while reducing the error recovery time, the two proposed dynamic designs lead to improved service quality parameters such as packet loss and delay compared to the Hybrid_DP method. The results show that in case of a link failure in the network, the proposed hybrid dynamic scheme’s recovery time is approximately 12 ms. Furthermore, in the proposed hybrid dynamic scheme, on average, the recovery time, the packet loss, and the delay improved by 22.39%, 8.2%, 5.66%, compared to the Hybrid_DP method, respectively.

Download Full-text

Fault-tolerance improvement of planar adaptive routing based on detailed traffic analysis

2007 22nd international symposium on computer and information sciences ◽

10.1109/iscis.2007.4456897 ◽

2007 ◽

Author(s):

A. Shamaei ◽

A. Nayebi ◽

H. Sarbazi-Azad

Keyword(s):

Fault Tolerance ◽

Adaptive Routing ◽

Traffic Analysis ◽

Tolerance Improvement

Download Full-text