An adaptive fault tolerance strategy for cloud storage

2021 ◽  
pp. 102217
Author(s):  
Yu Wu ◽  
Duo Liu ◽  
Xianzhang Chen ◽  
Jinting Ren ◽  
Renping Liu ◽  
...  

2011 ◽  
Vol 6 (4) ◽  
pp. 34-41
Author(s):  
Zheng Chen ◽  
Yang Xu ◽  
Xiaojing Wang ◽  
Yili Jin

1992 ◽  
Vol 02 (03) ◽  
pp. 281-304
Author(s):  
SANJAY P. POPLI ◽  
MAGDY A. BAYOUMI ◽  
AKASH TYAGI

Real-time digital signal processing (DSP) applications require high performance parallel architectures that are also reliable. VLSI arrays are good candidates for providing the required high throughput for these applications. These arrays which consist of a number of regularly interconnected processing elements (PEs) will not function correctly in the presence of even a single fault in any of the PEs. Fault tolerance has therefore become a vital design criterion for VLSI arrays. In this paper, a fault tolerance strategy for VLSI arrays is proposed, which significantly improves the reliability of the system. The fault tolerance scheme is composed of two phases: testing and locating faults (fault detection and diagnosis), and reconfiguration. The first phase employs an on-line error detection technique which achieves a compromise between the space and time redundancy approaches. This concurrent error detection technique reduces the rollback time considerably. The reconfiguration phase is achieved by using a global control responsible for changing the states of the switches in the interconnection network. Backtracking is introduced into the algorithm for maximizing the processor utilization, at the same time keeping the complexity of the interconnection network as simple as possible. Finally, a reliability analysis of this scheme using a Markov model and a comparison with some previous schemes are given.


2013 ◽  
Vol 2013 ◽  
pp. 1-11 ◽  
Author(s):  
Chao Yin ◽  
Changsheng Xie ◽  
Jiguang Wan ◽  
Chih-Cheng Hung ◽  
Jinjiang Liu ◽  
...  

To protect data in cloud storage, fault tolerance and efficient recovery become very important. Recent studies have developed numerous solutions based on erasure code techniques to solve this problem using functional repairs. However, there are two limitations to address. The first one is consistency since the Encoding Matrix (EM) is different among clouds. The other one is repairing bandwidth, which is a concern for most of us. We addressed these two problems from both theoretical and practical perspectives. We developed BMCloud, a new low repair bandwidth, low maintenance cost cloud storage system, which aims to reduce repair bandwidth and maintenance cost. The system employs both functional repair and exact repair while it inherits advantages from the both. We propose the JUDGE_STYLE algorithm, which can judge whether the system should adopt exact repair or functional repair. We implemented a networked storage system prototype and demonstrated our findings. Compared with existing solutions, BMCloud can be used in engineering to save repair bandwidth and degrade maintenance significantly.


2014 ◽  
Vol 511-512 ◽  
pp. 1012-1016 ◽  
Author(s):  
Zhi Qiang Wang ◽  
Xiao Long Li ◽  
Qing Zhen Wang

For the failure of current sensor on maglev train, an active fault tolerance control strategy based on feedback gain reconfiguration is proposed. Fault diagnosis unit based on state observer is designed to detect the output of current sensor, the diagnosis result is used to switch the control strategy. Simulation result indicates that the fault tolerance strategy meets the demands of the system.


Sign in / Sign up

Export Citation Format

Share Document