Genetic multistart algorithm for the design of fault-tolerant systems

Author(s):  
K Echtle ◽  
I Eusgeld ◽  
D Hirsch

This paper presents a new approach to the multiobjective design of fault-tolerant systems. The design objectives are fault tolerance and cost. Reducing the cost is of particular importance for fault-tolerant systems because the overhead caused by redundant components is considerable. The new design method consists of a special genetic algorithm that is tailored to the particular issues of fault-tolerant systems. The interface of the present tool ePADuGA (elitist and Pareto-based Approach to Design fault-tolerant systems using a Genetic Algorithm) allows for adaptation to various fields of application. The degree of fault tolerance is measured by the number of tolerated faults rather than traditional reliability metrics, because reliability numbers are mostly unknown during early design phases. The special features of the genetic algorithm comprise a graph-oriented representation of systems (which are the individuals during the evolutionary process), a simple yet expressive fault model, a very efficient procedure for fault-tolerance evaluation, and a Pareto-oriented fitness function. In a genetic algorithm generating thousands of individuals, a very fast evaluation of each individual is mandatory. For this purpose, state-space-oriented evaluation methods have been cut down to an extremely simple function which is still sufficient to assess the fault tolerance of individuals. An innovative aspect is also a multistart technique to find a Pareto solution set, which is independent of any parameters. In this paper, experimental results are presented showing the feasibility of the approach as well as the usefulness of the final fault-tolerant architectures, particularly in the field of mechatronic systems.

Author(s):  
A Thompson

Artificial evolution can integrate fault tolerance considerations into the automatic design process, producing inherently fault-tolerant designs without explicit redundant parts. Population dynamics can give rise to some level of fault tolerance ‘for free’. Requirements for fault tolerance can also be incorporated into the fitness function. The practicalities of these methods are investigated, grounded in the study of a real-world evolved electronic control system for a robot.


Another alternative derating, which was described in the previous chapter, is application of fault tolerant structures for the power converter. Fault tolerance is the property that enables a system to continue operating properly in the event of a failure of (or one or more faults within) some of its components. Fault tolerant systems are systems that can be operating after fault occurrence with no degraded performance in their basic functional requirements. This is the main difference between fault tolerant systems and derated systems. In this chapter, some methods for fault tolerance in electric power converters are presented. Fault tolerance is almost the only method for achieving a desired reliability in a converter that operates with non-zero fault possibility. There are two main approaches for this aim: re-configuration of the faulty system and using redundant systems. Redundancy is the provision of functional capabilities that would be unnecessary in a fault-free environment. Various types of redundant systems as passive and active redundancy are described and their application in power supply systems is presented. A new approach for a reliable and fault tolerant power supply is proposed and justified with experimental results. The concept of fault tolerance in electrical machines is presented.


Author(s):  
Wenbing Zhao

The use of good random numbers is crucial to the security of many mission-critical systems. However, when such systems are replicated for Byzantine fault tolerance, a serious issue arises, i.e., how do we preserve the integrity of the systems while ensuring strong replica consistency? Despite the fact that there exists a large body of work on how to render replicas deterministic under the benign fault model, the solutions regarding the random number control are often overly simplistic without regard to the security requirement, and hence, they are not suitable for practical Byzantine fault tolerance. In this chapter, we present a novel integrity-preserving replica coordination algorithm for Byzantine fault tolerant systems. The central idea behind our CD-BFT algorithm is that all random numbers to be used by the replicas are collectively determined, based on the contributions made by a quorum of replicas, at least f+1 of which are not faulty.


Author(s):  
I.V. Asharina

This three-part paper analyzes existing approaches and methods of organizing failure- and fault-tolerant computing in distributed multicomputer systems (DMCS), identifies and provides rationale for a list of issues to be solved. We review the application areas of failure- and fault- tolerant control systems for complex network and distributed objects. The second part further investigates the issues of organizing failure- and fault- tolerance in the DMCS. The systemic, functional, and test diagnostics are viewed as the basis for building unattended failure- and fault-tolerant systems. We introduce the concept of self-managed degradation (when the DMCS eventually proceeds to a safe shutdown at a critical level of degradation) as a means to increase the DMCS active life.


2021 ◽  
Vol 2021 ◽  
pp. 1-6
Author(s):  
Shuang Wang

The two most important factors that must be considered in the wireless sensor networks are energy efficiency and fault tolerance. Multipath routing is an effective method to improve the fault tolerance in wireless sensor networks. By taking the energy consumption into consideration, in this paper, a multipath routing algorithm for the wireless sensor networks based on the genetic algorithm is proposed. The proposed algorithm computes the fitness function by using the distance between nodes in the network and then generates the routing scheme at the base station. The routing scheme is shared with all the nodes of the entire network, to realize the multipath routing for each node. Finally, the simulation experiment is used to verify the validity of our method, and the results show that the routing method in this paper has a better effect.


Author(s):  
Wenbing Zhao

The use of good random numbers is crucial to the security of many mission-critical systems. However, when such systems are replicated for Byzantine fault tolerance, a serious issue arises (i.e., how do we preserve the integrity of the systems while ensuring strong replica consistency?). Despite the fact that there exists a large body of work on how to render replicas deterministic under the benign fault model, the solutions regarding the random number control are often overly simplistic without regard to the security requirement, and hence, they are not suitable for practical Byzantine fault tolerance. In this chapter, the authors present a novel integrity-preserving replica coordination algorithm for Byzantine fault tolerant systems. The central idea behind our CD-BFT algorithm is that all random numbers to be used by the replicas are collectively determined, based on the contributions made by a quorum of replicas, at least f+1 of which are not faulty.


2012 ◽  
Vol 472-475 ◽  
pp. 2235-2240 ◽  
Author(s):  
Tu Lan ◽  
Sheng Ju Tang ◽  
Bo Chen ◽  
De Ke Guo

Because of the outstanding advantages of genetic algorithms in solving optimization problems of engineering design, in this paper, using genetic algorithm combined with Kansei Engineering as a optimal design method for product’s semantics modeling design, and using real-coded way to characterize shape characteristics of CNC machine tools, comprehensive accounted of user’s semantic needs and color Mido, then, established the fitness function, combined with the key components to complete the design. By case analysis shows that the method is feasible, and lays a foundation for intelligent design of CNC machine tools.


Author(s):  
Sastry Kodanda Rama Jammalamadaka ◽  
Kamesh Bala Krishna Duvvuri ◽  
Devi Anusha CH ◽  
Padmini P ◽  
Siva Anjaneyulu G

<span>Cloud computing technologies and infrastructure facilities are coming up in a big way making it cost effective for the users to implement their IT based solutions to run business in most cost-effective and economical way. Many intricate issues however, have cropped-up which must be addressed to be able to use clouds the purpose for which they are designed and implemented. Among all, fault tolerance and securing the data stored on the clouds takes most of the importance. Continuous availability of the services is dependent on many factors. Faults bound to happen within a network, software, and platform or within the infrastructure which are all used for establishing the cloud. The network that connects various servers, devices, peripherals etc., have to be fault tolerant to start-with so that intended and un-interrupted services to the user can be made available. A novel network design method that leads to achieve high availability of the network and thereby the cloud itself has been presented in this paper</span>


2012 ◽  
Vol 546-547 ◽  
pp. 1574-1579
Author(s):  
Zhi Wen Xiong ◽  
Wen Feng Wang ◽  
Hong Zeng

Fault tolerant is one of major requirements for embedded systems. As the embedded systems become more and more complex, more chances for various fault. When design embedded system developer has to handle these faults. Before handling faults designer has to identify and understand the types and nature of faults.Faults is the sources for low dependability, faults can be hardware and software. Hardware faults can be distinguished from systematic faults like software or design errors. The Fault can be deleted, such as extensive testing or formal verification and tolerated by fault tolerance techniques. We restrict ourselves to the problem of fault tolerance and refer to other methods for troubleshooting.This paper discusses a new design method about the fault tolerant system of embedded system. We designed a fault tolerant system of data acquisition system in dynamically re-configurable FPGA. The experiment results show that the system not only be able to higher self-adaptive ability and reliability, but also can Through the FGPA to complete a specific algorithm.


Author(s):  
Sastry Kodanda Rama Jammalamadaka ◽  
Kamesh Bala Krishna Duvvuri ◽  
Devi Anusha CH ◽  
Padmini P ◽  
Siva Anjaneyulu G

<span>Cloud computing technologies and infrastructure facilities are coming up in a big way making it cost effective for the users to implement their IT based solutions to run business in most cost-effective and economical way. Many intricate issues however, have cropped-up which must be addressed to be able to use clouds the purpose for which they are designed and implemented. Among all, fault tolerance and securing the data stored on the clouds takes most of the importance. Continuous availability of the services is dependent on many factors. Faults bound to happen within a network, software, and platform or within the infrastructure which are all used for establishing the cloud. The network that connects various servers, devices, peripherals etc., have to be fault tolerant to start-with so that intended and un-interrupted services to the user can be made available. A novel network design method that leads to achieve high availability of the network and thereby the cloud itself has been presented in this paper</span>


Sign in / Sign up

Export Citation Format

Share Document