Dynamic Software Aging Detection-Based Fault Tolerant Software Rejuvenation Model for Virtualized Environment

Author(s):  
I. M. Umesh ◽  
G N. Srinivasan
2021 ◽  
Author(s):  
Jigar Patel

When a fault-tolerant layered distributed system continues its operation despite the presence of component failures, its performance is usually degraded. Its performance can also be degraded if it is executing continuously for a long period of time due to a phenomenon known as software aging. To prevent unexpected or unplanned outages due to aging, a pro-active technique called software rejuvenation can be employed. This technique involves gracefully terminating an application and immediately restarting it with a refreshed internal state. For proper modeling of these systems, their performance and dependiability characteristics need to be considered in a unified way, called performability. This thesis proposes a new model called "Rejuvenated-FTLQN", to evaluate the effects of software aging and rejuvenation on performability of these layered systems. Specifically a Layered Queueing Network (LQN) is used for performance analysis and a Multi State Fault Tree (MSFT) is used for dependability analysis. The model is also used to study the impact of performing rejuvenation, time to perform rejuvenation and rejuvenation frequencey on performability of a system. A software tool called "Rejuvenated-FTLQNS" has been developed to automate the model solution.


2021 ◽  
Author(s):  
Jigar Patel

When a fault-tolerant layered distributed system continues its operation despite the presence of component failures, its performance is usually degraded. Its performance can also be degraded if it is executing continuously for a long period of time due to a phenomenon known as software aging. To prevent unexpected or unplanned outages due to aging, a pro-active technique called software rejuvenation can be employed. This technique involves gracefully terminating an application and immediately restarting it with a refreshed internal state. For proper modeling of these systems, their performance and dependiability characteristics need to be considered in a unified way, called performability. This thesis proposes a new model called "Rejuvenated-FTLQN", to evaluate the effects of software aging and rejuvenation on performability of these layered systems. Specifically a Layered Queueing Network (LQN) is used for performance analysis and a Multi State Fault Tree (MSFT) is used for dependability analysis. The model is also used to study the impact of performing rejuvenation, time to perform rejuvenation and rejuvenation frequencey on performability of a system. A software tool called "Rejuvenated-FTLQNS" has been developed to automate the model solution.


Author(s):  
Nagaraj G Cholli ◽  
Srinivasan G N

A software aging in convoluted system refers to the situation where software degrades with span of time. This phenomenon, which may eventually lead to system performance degradation or crash/hang failure, is the result of depletion of operating system resources, data deception and numerical error assembly. A technique called software rejuvenation has been incorporated, which essentially involves periodic aborting an application or a system, flushing its intramural state and re-starting it. A main issue in rejuvenation is to discover ideal time to initiate software rejuvenation. Software rejuvenation is a proactive technique that allows preventing the occurrence of software failing. A novel approach called Smart interval and payload (SIP) policy is introduced to overcome all the hurdles in the present scenario based on Software Rejuvenation approaches. SIP policy accepts time from user and optimizes the rejuvenation time whenever workload is variable; otherwise the system is rejuvenated at its rejuvenation point. SIP policy avoids software failure and it helps to achieve high availability of convoluted system.


Author(s):  
Shruthi P. ◽  
Nagaraj G. Cholli

Cloud Computing is the environment in which several virtual machines (VM) run concurrently on physical machines. The cloud computing infrastructure hosts multiple cloud service segments that communicate with each other using the interfaces. This creates distributed computing environment. During operation, the software systems accumulate errors or garbage that leads to system failure and other hazardous consequences. This status is called software aging. Software aging happens because of memory fragmentation, resource consumption in large scale and accumulation of numerical error. Software aging degrads the performance that may result in system failure. This happens because of premature resource exhaustion. This issue cannot be determined during software testing phase because of the dynamic nature of operation. The errors that cause software aging are of special types. These errors do not disturb the software functionality but target the response time and its environment. This issue is to be resolved only during run time as it occurs because of the dynamic nature of the problem. To alleviate the impact of software aging, software rejuvenation technique is being used. Rejuvenation process reboots the system or re-initiates the softwares. This avoids faults or failure. Software rejuvenation removes accumulated error conditions, frees up deadlocks and defragments operating system resources like memory. Hence, it avoids future failures of system that may happen due to software aging. As service availability is crucial, software rejuvenation is to be carried out at defined schedules without disrupting the service. The presence of Software rejuvenation techniques can make software systems more trustworthy. Software designers are using this concept to improve the quality and reliability of the software. Software aging and rejuvenation has generated a lot of research interest in recent years. This work reviews some of the research works related to detection of software aging and identifies research gaps.


2015 ◽  
Vol 25 (09n10) ◽  
pp. 1491-1513 ◽  
Author(s):  
Jean Rahme ◽  
Haiping Xu

Correctly measuring the reliability and availability of a cloud-based system is critical for evaluating its system performance. Due to the promised high reliability of physical facilities provided for cloud services, software faults have become one of the major factors for the failures of cloud-based systems. In this paper, we focus on the software aging phenomenon where system performance may be progressively degraded due to exhaustion of system resources, fragmentation and accumulation of errors. We use a proactive technique, called software rejuvenation, to counteract the software aging problem. The dynamic fault tree (DFT) formalism is adopted to model the system reliability before and during a software rejuvenation process in an aging cloud-based system. A novel analytical approach is presented to derive the reliability function of a cloud-based Hot SPare (HSP) gate, which is further verified using Continuous Time Markov Chains (CTMC) for its correctness. We use a case study of a cloud-based system to illustrate the validity of our approach. Based on the reliability analytical results, we show how cost-effective software rejuvenation schedules can be created to keep the system reliability consistently staying above a predefined critical level.


Author(s):  
KAZUKI IWAMOTO ◽  
TADASHI DOHI ◽  
NAOTO KAIO

Software rejuvenation is a preventive and proactive solution that is particularly useful for counteracting the phenomenon of software aging. In this article, we consider the similar periodic software rejuvenation model to Garg et al.13 under the different operation circumstance. That is, we model the stochastic behavior of telecommunication billing applications by using a discrete-time Markov regenerative process, and determine the optimal periodic software rejuvenation schedule maximizing the so-called cost effectiveness, in discrete-time setting. Also, we provide a statistically non-parametric method to estimate the optimal software rejuvenation schedule, based on the discrete total time on test concept. Numerical examples are devoted to illustrate the determination/estimation of the optimal software rejuvenation schedule and to examine the asymptotic behavior of the estimator developed here.


Sign in / Sign up

Export Citation Format

Share Document