Recovery Time and Fault Tolerance Improvement for Circuits mapped on SRAM-based FPGAs

2014 ◽  
Vol 30 (4) ◽  
pp. 425-442 ◽  
Author(s):  
Anees Ullah ◽  
Luca Sterpone
Author(s):  
Humphrey Emesowum ◽  
◽  
Athanasios Paraskelidis ◽  
Mo Adda

2021 ◽  
Vol 22 (1) ◽  
pp. 67-79
Author(s):  
Özdinç Çelikel ◽  
Tolga Ovatman

Application checkpointing is a widely used recovery mechanism that consists of saving an application's state periodically to be used in case of a failure. In this study we investigate the utilisation of distributed checkpointing for replicated state machines. Conventionally, for replicated state machines, checkpointing information is stored in a replicated way in each of the replicas or separately in a single instance. Applying distributed checkpointing provides a means to adjust the level of fault tolerance of the checkpointing approach by giving away from recovery time. We use a local cluster and cloud environment to examine the effects of distributed checkpointing in a simple state machine example and compare the results with conventional approaches. As expected, distributed checkpointing gains from memory consumption and utilise different levels of fault tolerance while performing worse in terms of recovery time.


2017 ◽  
pp. 412-418
Author(s):  
Humphrey Emesowum ◽  
◽  
Athanasios Paraskelidis ◽  
Mo Adda

2021 ◽  
Vol 7 ◽  
pp. e543
Author(s):  
Katayoun Bakhshi Kiadehi ◽  
Amir Masoud Rahmani ◽  
Amir Sabbagh Molahosseini

Considering the Internet of Things (IoT) impact in today’s world, uninterrupted service is essential, and recovery has received more attention than ever before. Fault-tolerance (FT) is an essential aspect of network resilience. Fault-tolerance mechanisms are required to ensure high availability and high reliability in systems. The advent of software-defined networking (SDN) in the IoT plays a significant role in providing a reliable communication platform. This paper proposes a data plane fault-tolerant architecture using the concepts of software-defined networks for IoT environments. In this work, a mathematical model called Shared Risk Link Group (SRLG) calculates redundant paths as the primary and backup non-overlapping paths between network equipment. In addition to the fault tolerance, service quality was considered in the proposed schemes. Putting the percentage of link bandwidth usage and the rate of link delay in calculating link costs makes it possible to calculate two completely non-overlapping paths with the best condition. We compare our two proposed dynamic schemes with the hybrid disjoint paths (Hybrid_DP) method and our previous work. IoT developments, wireless and wired equipment are now used in many industrial and commercial applications, so the proposed hybrid dynamic method supports both wired and wireless devices; furthermore multiple link failures will be supported in the two proposed dynamic schemes. Simulation results indicate that, while reducing the error recovery time, the two proposed dynamic designs lead to improved service quality parameters such as packet loss and delay compared to the Hybrid_DP method. The results show that in case of a link failure in the network, the proposed hybrid dynamic scheme’s recovery time is approximately 12 ms. Furthermore, in the proposed hybrid dynamic scheme, on average, the recovery time, the packet loss, and the delay improved by 22.39%, 8.2%, 5.66%, compared to the Hybrid_DP method, respectively.


Sign in / Sign up

Export Citation Format

Share Document