Efficiency-First Fault-Tolerant Replica Scheduling Strategy for Reliability Constrained Cloud Application

Author(s):  
Yingxue Zhang ◽  
Guisheng Fan ◽  
Huiqun Yu ◽  
Xingpeng Chen
2007 ◽  
Vol 1 (3) ◽  
pp. 329-337
Author(s):  
Yuanzhuo Wang ◽  
Chuang Lin ◽  
Yang Yang ◽  
Zhiguang Shan

2021 ◽  
Vol 2021 ◽  
pp. 1-17
Author(s):  
Zheng Liu ◽  
Guisheng Fan ◽  
Huiqun Yu ◽  
Liqiong Chen

Microservice architecture is a cloud-native architectural style, which has attracted extensive attention from the scientific research and industry communities to benefit independent development and deployment. However, due to the complexity of cloud-based platforms, the design of fault-tolerant strategies for microservice-oriented cloud applications becomes challenging. In order to improve the quality of service, it is essential to focus on the microservice with more criticality and maximize the reliability of the entire cloud application. This paper studies the modeling and analysis of service reliability in the cloud environment. Firstly, a formal description language is defined to model microservice, user request, and container accurately. Secondly, the reliability analysis is conducted to measure a critical microservice’s fluctuation and vibration attributes within a period, and the related properties of the constructed model are analyzed. Thirdly, a fault-tolerant strategy with redundancy operation has been proposed to optimize cloud application reliability. Finally, the effectiveness of the method is verified by experiments. The simulation results show that the algorithm obtains the maximum benefits and has high performance through several experiments.


2021 ◽  
Author(s):  
QIN Jun ◽  
SONG Yanyan ◽  
ZONG Ping

With the rapid development and popularization of information technology, cloud computing technology provides a good environment for solving massive data processing. Hadoop is an open-source implementation of MapReduce and has the ability to process large amounts of data. Aiming at the shortcomings of the fault-tolerant technology in the MapReduce programming model, this paper proposes a reliability task scheduling strategy that introduces a failure recovery mechanism, evaluates the trustworthiness of resource nodes in the cloud environment, establishes a trustworthiness model, and avoids task allocation to low reliability node, causing the task to be re-executed, wasting time and resources. Finally, the simulation platform CloudSim verifies the validity and stability of the task scheduling algorithm and scheduling model proposed in this paper.


2021 ◽  
Vol 12 (05) ◽  
pp. 01-09
Author(s):  
Jun QIN ◽  
Yanyan SONG ◽  
Ping ZONG

MapReduce is a distributed computing model for cloud computing to process massive data. It simplifies the writing of distributed parallel programs. For the fault-tolerant technology in the MapReduce programming model, tasks may be allocated to nodes with low reliability. It causes the task to be reexecuted, wasting time and resources. This paper proposes a reliability task scheduling strategy with a failure recovery mechanism, evaluates the trustworthiness of resource nodes in the cloud environment and builds a trustworthiness model. By using the simulation platform CloudSim, the stability of the task scheduling algorithm and scheduling model are verified in this paper.


2012 ◽  
Vol 62 (2) ◽  
pp. 855-873 ◽  
Author(s):  
Babar Nazir ◽  
Kalim Qureshi ◽  
Paul Manuel

Sign in / Sign up

Export Citation Format

Share Document