A PARALLEL PROBABILISTIC SYSTEM-LEVEL FAULT DIAGNOSIS APPROACH FOR LARGE MULTIPROCESSOR SYSTEMS

In this paper, we present a system-level fault identification algorithm, using a parallel genetic algorithm, for diagnosing faulty nodes in large heterogeneous systems. The algorithm is based on a probabilistic model where individual node fails with an a priori probability p. The assumptions concerning test outcomes are the same as in the PMC model, that is, fault-free testers always give correct test outcomes and faulty testers are totally unpredictable. The parallel diagnosis algorithm was implemented and simulated on randomly generated large systems. The proposed parallelization is intended to speed up the performance of the evolutionary diagnosis approach, hence reducing the computation time by evolving various sub-populations in parallel. Simulation results are provided showing that the parallel diagnosis did improve the efficiency of the evolutionary diagnosis approach, in that it allowed faster diagnosis of faulty situations, making it a viable alternative to existing techniques of diagnosis. Moreover, the evolutionary approach still provide good results even when extreme non-diagnosable faulty situations are considered.

Download Full-text

Accelerating simulations using REDCHEM_v0.0 for atmospheric chemistry mechanism reduction

Geoscientific Model Development ◽

10.5194/gmd-11-3391-2018 ◽

2018 ◽

Vol 11 (8) ◽

pp. 3391-3407 ◽

Cited By ~ 1

Author(s):

Zacharias Marinou Nikolaou ◽

Jyh-Yuan Chen ◽

Yiannis Proestos ◽

Jos Lelieveld ◽

Rolf Sander

Keyword(s):

Atmospheric Chemistry ◽

Transport Model ◽

A Priori ◽

Chemical Mechanism ◽

Climate Modelling ◽

Mechanism Reduction ◽

Computational Speed ◽

Speed Up ◽

Spatio Temporal ◽

Skeletal Mechanism

Abstract. Chemical mechanism reduction is common practice in combustion research for accelerating numerical simulations; however, there have been limited applications of this practice in atmospheric chemistry. In this study, we employ a powerful reduction method in order to produce a skeletal mechanism of an atmospheric chemistry code that is commonly used in air quality and climate modelling. The skeletal mechanism is developed using input data from a model scenario. Its performance is then evaluated both a priori against the model scenario results and a posteriori by implementing the skeletal mechanism in a chemistry transport model, namely the Weather Research and Forecasting code with Chemistry. Preliminary results, indicate a substantial increase in computational speed-up for both cases, with a minimal loss of accuracy with regards to the simulated spatio-temporal mixing ratio of the target species, which was selected to be ozone.

Download Full-text

The Regularized Weak Functional Matching Pursuit for linear inverse problems

Journal of Inverse and Ill-Posed Problems ◽

10.1515/jiip-2018-0013 ◽

2019 ◽

Vol 27 (3) ◽

pp. 317-340 ◽

Cited By ~ 3

Author(s):

Max Kontak ◽

Volker Michel

Keyword(s):

Inverse Problems ◽

Matching Pursuit ◽

A Priori ◽

Computation Time ◽

Point Of View ◽

Computational Point ◽

Linear Inverse Problems ◽

Infinite Dimensional ◽

Frame Condition ◽

Ill Posed

Abstract In this work, we present the so-called Regularized Weak Functional Matching Pursuit (RWFMP) algorithm, which is a weak greedy algorithm for linear ill-posed inverse problems. In comparison to the Regularized Functional Matching Pursuit (RFMP), on which it is based, the RWFMP possesses an improved theoretical analysis including the guaranteed existence of the iterates, the convergence of the algorithm for inverse problems in infinite-dimensional Hilbert spaces, and a convergence rate, which is also valid for the particular case of the RFMP. Another improvement is the cancellation of the previously required and difficult to verify semi-frame condition. Furthermore, we provide an a-priori parameter choice rule for the RWFMP, which yields a convergent regularization. Finally, we will give a numerical example, which shows that the “weak” approach is also beneficial from the computational point of view. By applying an improved search strategy in the algorithm, which is motivated by the weak approach, we can save up to 90 of computation time in comparison to the RFMP, whereas the accuracy of the solution does not change as much.

Download Full-text

Improving ozone profile retrieval from spaceborne UV backscatter spectrometers using convergence behaviour diagnostics

Atmospheric Measurement Techniques ◽

10.5194/amt-3-1555-2010 ◽

2010 ◽

Vol 3 (6) ◽

pp. 1555-1568 ◽

Cited By ~ 13

Author(s):

B. Mijling ◽

O. N. E. Tuinder ◽

R. F. van Oss ◽

R. J. van der A

Keyword(s):

Cross Sections ◽

A Priori ◽

Computation Time ◽

External Input ◽

Computational Time ◽

Ozone Profile ◽

Global Performance ◽

Convergence Behaviour ◽

Low Cloud ◽

Average Computation Time

Abstract. The Ozone Profile Algorithm (OPERA), developed at KNMI, retrieves the vertical ozone distribution from nadir spectral satellite measurements of back scattered sunlight in the ultraviolet and visible wavelength range. To produce consistent global datasets the algorithm needs to have good global performance, while short computation time facilitates the use of the algorithm in near real time applications. To test the global performance of the algorithm we look at the convergence behaviour as diagnostic tool of the ozone profile retrievals from the GOME instrument (on board ERS-2) for February and October 1998. In this way, we uncover different classes of retrieval problems, related to the South Atlantic Anomaly, low cloud fractions over deserts, desert dust outflow over the ocean, and the intertropical convergence zone. The influence of the first guess and the external input data including the ozone cross-sections and the ozone climatologies on the retrieval performance is also investigated. By using a priori ozone profiles which are selected on the expected total ozone column, retrieval problems due to anomalous ozone distributions (such as in the ozone hole) can be avoided. By applying the algorithm adaptations the convergence statistics improve considerably, not only increasing the number of successful retrievals, but also reducing the average computation time, due to less iteration steps per retrieval. For February 1998, non-convergence was brought down from 10.7% to 2.1%, while the mean number of iteration steps (which dominates the computational time) dropped 26% from 5.11 to 3.79.

Download Full-text

Parallel Algorithms for Spatial Rainfall Distribution

Jurnal INKOM ◽

10.14203/j.inkom.383 ◽

2014 ◽

Vol 8 (1) ◽

pp. 29 ◽

Cited By ~ 1

Author(s):

Arnida Lailatul Latifah ◽

Adi Nurhadiyatna

Keyword(s):

Parallel Algorithms ◽

Computation Time ◽

Computational Time ◽

Rainfall Distribution ◽

Flood Modelling ◽

Computation Efficiency ◽

Distance Weighting ◽

Speed Up ◽

Important Input ◽

Serial Algorithms

This paper proposes parallel algorithms for precipitation of flood modelling, especially applied in spatial rainfall distribution. As an important input in flood modelling, spatial distribution of rainfall is always needed as a pre-conditioned model. In this paper two interpolation methods, Inverse distance weighting (IDW) and Ordinary kriging (OK) are discussed. Both are developed in parallel algorithms in order to reduce the computational time. To measure the computation efficiency, the performance of the parallel algorithms are compared to the serial algorithms for both methods. Findings indicate that: (1) the computation time of OK algorithm is up to 23% longer than IDW; (2) the computation time of OK and IDW algorithms is linearly increasing with the number of cells/ points; (3) the computation time of the parallel algorithms for both methods is exponentially decaying with the number of processors. The parallel algorithm of IDW gives a decay factor of 0.52, while OK gives 0.53; (4) The parallel algorithms perform near ideal speed-up.

Download Full-text

Hyper-optimized tensor network contraction

Quantum ◽

10.22331/q-2021-03-15-410 ◽

2021 ◽

Vol 5 ◽

pp. 410

Author(s):

Johnnie Gray ◽

Stefanos Kourtis

Keyword(s):

Computation Time ◽

Quantum Circuit ◽

Optimization Approach ◽

Many Body ◽

Tensor Networks ◽

Randomized Protocols ◽

Classical Simulation ◽

Tensor Network ◽

Speed Up ◽

Many Body Systems

Tensor networks represent the state-of-the-art in computational methods across many disciplines, including the classical simulation of quantum many-body systems and quantum circuits. Several applications of current interest give rise to tensor networks with irregular geometries. Finding the best possible contraction path for such networks is a central problem, with an exponential effect on computation time and memory footprint. In this work, we implement new randomized protocols that find very high quality contraction paths for arbitrary and large tensor networks. We test our methods on a variety of benchmarks, including the random quantum circuit instances recently implemented on Google quantum chips. We find that the paths obtained can be very close to optimal, and often many orders or magnitude better than the most established approaches. As different underlying geometries suit different methods, we also introduce a hyper-optimization approach, where both the method applied and its algorithmic parameters are tuned during the path finding. The increase in quality of contraction schemes found has significant practical implications for the simulation of quantum many-body systems and particularly for the benchmarking of new quantum chips. Concretely, we estimate a speed-up of over 10,000× compared to the original expectation for the classical simulation of the Sycamore `supremacy' circuits.

Download Full-text

Two-Step Process Identification With Correlation Analysis and Least-Squares Parameter Estimation

Journal of Dynamic Systems Measurement and Control ◽

10.1115/1.3426840 ◽

1974 ◽

Vol 96 (4) ◽

pp. 426-432 ◽

Cited By ~ 24

Author(s):

R. Isermann ◽

U. Bauer

Keyword(s):

Correlation Analysis ◽

Process Model ◽

A Priori ◽

Industrial Process ◽

Computation Time ◽

Small Computer ◽

Nonparametric Model ◽

Identification Method ◽

On Line ◽

Process Computer

An identification method is described which first identifies a linear nonparametric model (crosscorrelation function, impulse response) by correlation analysis and then estimates the parameters of a parametric model (discrete transfer function) and also includes a method for the detection of the model order and the time delay. The performance, the computational expense and the overall reliability of this method is compared with five other identification methods. This two-step identification method, which can be applied off-line or on-line, is especially suited to identification by process computers, since it has the properties: Little a priori knowledge about the structure of the process model; very short computation time; small computer storage; no initial values of matrices and parameters are necessary and no divergence is possible for the on-line version. Results of an on-line identification of an industrial process with a process computer are shown.

Download Full-text

Parallel-Pipelined Video Processing in Multicore Heterogeneous Systems on Chip

Proceedings of Universities ELECTRONICS ◽

10.24151/1561-5405-2021-26-2-172-183 ◽

2021 ◽

Vol 26 (2) ◽

pp. 172-183

Author(s):

E.S. Yanakova ◽

◽

G.T. Macharadze ◽

L.G. Gagarina ◽

A.A. Shvachko ◽

...

Keyword(s):

Video Processing ◽

High Efficiency ◽

Heterogeneous Systems ◽

Video Data ◽

System Level ◽

Video Information ◽

Systems On Chip ◽

The Face ◽

Parallel Pipelined ◽

On Chip

A turn from homogeneous to heterogeneous architectures permits to achieve the advantages of the efficiency, size, weight and power consumption, which is especially important for the built-in solutions. However, the development of the parallel software for heterogeneous computer systems is rather complex task due to the requirements of high efficiency, easy programming and the process of scaling. In the paper the efficiency of parallel-pipelined processing of video information in multiprocessor heterogeneous systems on a chip (SoC) such as DSP, GPU, ISP, VDP, VPU and others, has been investigated. A typical scheme of parallel-pipelined processing of video data using various accelerators has been presented. The scheme of the parallel-pipelined video data on heterogeneous SoC 1892VM248 has been developed. The methods of efficient parallel-pipelined processing of video data in heterogeneous computers (SoC), consisting of the operating system level, programming technologies level and the application level, have been proposed. A comparative analysis of the most common programming technologies, such as OpenCL, OpenMP, MPI, OpenAMP, has been performed. The analysis has shown that depend-ing on the device finite purpose two programming paradigms should be applied: based on OpenCL technology (for built-in system) and MPI technology (for inter-cell and inter processor interaction). The results obtained of the parallel-pipelined processing within the framework of the face recognition have confirmed the effectiveness of the chosen solutions.

Download Full-text

Simulation-Based Scheduling of Waterway Projects Using a Parallel Genetic Algorithm

Transportation Systems and Engineering ◽

10.4018/978-1-4666-8473-7.ch016 ◽

2015 ◽

pp. 334-347 ◽

Cited By ~ 2

Author(s):

Ning Yang ◽

Shiaaulir Wang ◽

Paul Schonfeld

Keyword(s):

Genetic Algorithm ◽

Parallel Computing ◽

Message Passing ◽

Message Passing Interface ◽

Computation Time ◽

Parallel Genetic Algorithm ◽

Simulation Based ◽

Multiple Processors ◽

Simulation Based Optimization ◽

Speed Up

A Parallel Genetic Algorithm (PGA) is used for a simulation-based optimization of waterway project schedules. This PGA is designed to distribute a Genetic Algorithm application over multiple processors in order to speed up the solution search procedure for a very large combinational problem. The proposed PGA is based on a global parallel model, which is also called a master-slave model. A Message-Passing Interface (MPI) is used in developing the parallel computing program. A case study is presented, whose results show how the adaption of a simulation-based optimization algorithm to parallel computing can greatly reduce computation time. Additional techniques which are found to further improve the PGA performance include: (1) choosing an appropriate task distribution method, (2) distributing simulation replications instead of different solutions, (3) avoiding the simulation of duplicate solutions, (4) avoiding running multiple simulations simultaneously in shared-memory processors, and (5) avoiding using multiple processors which belong to different clusters (physical sub-networks).

Download Full-text

Electronic System Level Design of Heterogeneous Systems: a Motor Speed Control System Case Study

2019 17th IEEE International New Circuits and Systems Conference (NEWCAS) ◽

10.1109/newcas44328.2019.8961289 ◽

2019 ◽

Author(s):

Breytner Fernandez-Mesa ◽

Liliana Andrade ◽

Frederic Perrot

Keyword(s):

Control System ◽

Heterogeneous Systems ◽

Speed Control ◽

Electronic System ◽

System Level ◽

System Level Design ◽

Motor Speed ◽

Electronic System Level ◽

Speed Control System

Download Full-text

Parallel computing applied to auto-tuning of state feedback speed controller for PMSM drive

ITM Web of Conferences ◽

10.1051/itmconf/20192801031 ◽

2019 ◽

Vol 28 ◽

pp. 01031

Author(s):

Rafal Szczepanski ◽

Tomasz Tarczewski ◽

Lech M. Grzesiak

Keyword(s):

Parallel Computing ◽

State Feedback ◽

Computation Time ◽

The Other ◽

Speed Controller ◽

Speed Up ◽

Tuning Process ◽

Auto Tuning ◽

Reduce Computation Time ◽

Simulation Time

Nowadays the simulation is inseparable part of researcher's work. Its computation time may significantly exceed the experiment time. On the other hand, multi-core processors can be used to reduce computation time by using parallel computing. The parallel computing can be employed to decrease the overall simulation time. In this paper the parallel computing is used to speed-up the auto-tuning process of state feedback speed controller for PMSM drive.

Download Full-text