APPLICATION OF PERFORMANCE REDUCTION METHODS FOR MINIMIZATION OF ANALYZED NUMBER OF PARALLEL PROGRAM VARIANTS

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2019.09.pp.043-049 ◽

2019 ◽

pp. 43-49 ◽

Cited By ~ 1

Author(s):

A. I. Dordopulo

Keyword(s):

Computer Systems ◽

Automatic Parallelization ◽

Parallel Program ◽

Parallel Applications ◽

Performance Reduction ◽

Reduction Methods ◽

And Performance ◽

Hardware Costs ◽

Sequential Reduction ◽

Information Graph

In this paper, we review and compare the methods of parallel applications’ development based on the automatic program parallelizing for computer systems with shared and distributed memory and on the information graph’s hardware costs and performance reduction for reconfigurable computer systems. The increase in the number of computer system’s units or in the problem’s dimension leads to the significant growth of the automatic parallelization complexity for a procedural program. As a result, the obtainment of parallelizing results in acceptable time using state-of-the-art computer systems is very problematic. In reconfigurable computer systems, the reduction of absolutely parallel information graph of a problem is applied for the parallel program creation. The information graph illustrates the parallelizing and pipelining of computations. In addition to the traditionally practiced reduction of basic subgraphs’ number, the reductions of computational operations’ quantity and of data digit capacity can be utilized for the performance or hardware costs’ scaling. We have proved that the methods of information graph hardware costs and performance reduction provide a considerable decrease in the number of steps needed for adaptation of parallel application to reconfigurable computer systems’ architectures in comparison with automatic parallelizing. We have proved the theorem of coefficient value at sequential reduction, the theorem of increase in reduction coefficient at custom value and the theorem of commutativity of various reduction transformations. The proved theorems help to find a rational sequence of reduction transformations.

Download Full-text

On the problem of automatic development of parallel applications for reconfigurable computer systems

Вычислительные технологии ◽

10.25743/ict.2020.25.1.005 ◽

2020 ◽

Author(s):

И.И. Левин ◽

А.И. Дордопуло

Keyword(s):

Distinctive Feature ◽

Reconfigurable Computing ◽

Computer Systems ◽

Computing System ◽

Computing Resource ◽

Performance Reduction ◽

Reduction Coefficient ◽

Hardware Costs ◽

Computing Structure ◽

Information Graph

Рассмотрена оригинальная методика отображения информационного графа прикладной программы на архитектуру реконфигурируемой вычислительной системы с помощью методов редукции производительности, обеспечивающих решение задач, аппаратные затраты на реализацию которых превышают доступный вычислительный ресурс. Доказаны теоремы о свойствах последовательного применения редукций по числу базовых подграфов, по числу вычислительных устройств и разрядности. На основе доказанных теорем и следствий из них сформулирована методика редукционных преобразований информационного графа прикладной программы для автоматической адаптации к архитектуре реконфигурируемой вычислительной системы. Приведена оценка максимального числа преобразований согласно предложенной методике для сбалансированной редукции производительности и аппаратных затрат прикладных программ для реконфигурируемых вычислительных систем. To solve applied problems, the hardware costs of which exceed the available computing resource of FPGA-based computer systems, an original technique was developed for mapping the information graph of an application program to the architecture of a reconfigurable computing system. The proposed technique is based on the performance reduction methods that reduce the productivity of an applied task, which, along with the reducing productivity, does so for the hardware costs of its implementation and, thereby, solve the problem on the available computing resource. We demonstrate that the decrease in hardware costs for the computing structure realization occurs only during the reduction the basic subgraph number, the number of computing devices in a basic subgraph and the data width. The influence of sequential reduction transformations on the computing structure of a problem is examined. The proved theorems are concerned with the possibility of representing the reduction coefficient as a product of the coefficients of successive reductions, on the inability of additive increase in reduction coefficient during sequential reductions and on the superposition commutativity of different sequential reductions. The proved theorems and the corollaries presented in the article allow formulating the basic principles for the method of reduction transformations of the information graph of the problem for adaptation to the architecture of a hybrid reconfigurable computing system. A distinctive feature of the technique is a relatively small number of transformations for a balanced reduction of the information graph of the problem and the implementation of the task on a reconfigurable computer system.The comparatively small number of transformations required for the balanced reduction of the information graph of the problem and for the implementation of calculations on a reconfigurable computer system is the distinctive feature of the technique. For the developed technique, we estimated the maximal number of transformations and found out the decrease in the quantity of analyzed reduction variants from each class. The proposed technique permits the significant reduction of the time needed to create the computational structure of a parallel program adapted to the architecture and configuration of the reconfigurable computing system. Furthermore, the technique allows automatization of this process using the specialized software and providing at least 5075 efficiency in comparison with the solutions of the same problems by specialists.

Download Full-text

Performance Reduction for Automatic Development of Parallel Applications for Reconfigurable Computer Systems

Supercomputing Frontiers and Innovations ◽

10.14529/jsfi200201 ◽

2020 ◽

Vol 7 (2) ◽

Keyword(s):

Computer Systems ◽

Parallel Applications ◽

Performance Reduction

Download Full-text

PERFORMANCE REDUCTION METHODS FOR TURBO-PROPELLER AIRCRAFT

Performance ◽

10.1016/b978-1-4831-9729-6.50017-6 ◽

1959 ◽

pp. 1-5:20

Author(s):

Kenneth J. Lush ◽

John K. Moakes

Keyword(s):

Performance Reduction ◽

Reduction Methods

Download Full-text

Capacity and performance analysis of computer systems

Proceedings of Winter Simulation Conference ◽

10.1109/wsc.1994.717070 ◽

2005 ◽

Cited By ~ 1

Author(s):

J.N. Robinson

Keyword(s):

Performance Analysis ◽

Computer Systems ◽

And Performance

Download Full-text

Policy-based techniques for self-managing parallel applications

The Knowledge Engineering Review ◽

10.1017/s0269888906000890 ◽

2006 ◽

Vol 21 (3) ◽

pp. 205-219 ◽

Cited By ~ 1

Author(s):

RICHARD ANTHONY

Keyword(s):

Empirical Investigation ◽

Environmental Variability ◽

Adaptive Strategy ◽

Parallel Applications ◽

Self Management ◽

Parallel Application ◽

Loosely Coupled ◽

Heterogeneous Nature ◽

Management Techniques ◽

And Performance

This paper presents an empirical investigation of policy-based self-management techniques for parallel applications executing in loosely-coupled environments. The dynamic and heterogeneous nature of these environments is discussed and the special considerations for parallel applications are identified. An adaptive strategy for the run-time deployment of tasks of parallel applications is presented. The strategy is based on embedding numerous policies which are informed by contextual and environmental inputs. The policies govern various aspects of behaviour, enhancing flexibility so that the goals of efficiency and performance are achieved despite high levels of environmental variability. A prototype self-managing parallel application is used as a vehicle to explore the feasibility and benefits of the strategy. In particular, several aspects of stability are investigated. The implementation and behaviour of three policies are discussed and sample results examined.

Download Full-text

An integrated approach to parallel program debugging and performance analysis onlarge-scale multiprocessors

Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging - PADD '88 ◽

10.1145/68210.69231 ◽

1988 ◽

Cited By ~ 17

Author(s):

Robert J. Fowler ◽

Thomas J. LeBlanc ◽

John M. Mellor-Crummey

Keyword(s):

Performance Analysis ◽

Integrated Approach ◽

Parallel Program ◽

Program Debugging ◽

And Performance

Download Full-text

Some preliminary results of memory cache analysis with the use of non-extensive

Annales Universitatis Mariae Curie-Sklodowska sectio AI – Informatica ◽

10.17951/ai.2016.16.2.43 ◽

2017 ◽

Vol 16 (2) ◽

pp. 43

Author(s):

Dominik Strzałka

Keyword(s):

Operating Systems ◽

Computer Systems ◽

Memory Systems ◽

Cache Memory ◽

Preliminary Results ◽

Memory Cache ◽

Statistical Behavior ◽

And Performance ◽

Inherent Part ◽

Different Parts

<p>The problem of modeling different parts of computer systems requires accurate statistical tools. Cache memory systems is an inherent part of nowadays computer systems, where the memory hierarchical structure plays a key point role in behavior and performance of the whole system. In the case of Windows operating systems, cache memory is a place in memory subsystem where the I/O system puts recently used data from disk. In paper some preliminary results about statistical behavior of one selected system counter behavior are presented. Obtained results shown that the real phenomena, which have appeared during human-computer interaction, can be expressed in terms of non-extensive statistics that is related to Tsallis proposal of new entropy definition.</p>

Download Full-text

Effect of Particle Parameters on Erosion Wear and Performance of Screw Centrifugal Pump

Volume 7: Fluids Engineering ◽

10.1115/imece2018-88586 ◽

2018 ◽

Cited By ~ 1

Author(s):

Zhengjing Shen ◽

Wuli Chu

Keyword(s):

Centrifugal Pump ◽

Shape Factor ◽

Erosion Rate ◽

Reduction Rate ◽

Pump Efficiency ◽

Erosion Wear ◽

Pump Performance ◽

Performance Reduction ◽

Particle Parameters ◽

And Performance

Sediment erosion is recognized as a serious engineering problem in slurry handling such as screw centrifugal pump, which has wide efficiency region and non-plugging performance. In the present study, the screw centrifugal pump was simulated based on the Euler-Lagrange method. The Mclaury model was adopted for the erosion prediction of flow passage components. By analyzing the correlation factor functions contained in the erosion model and performing some preliminary research with a simplified model, particle velocity, particle shape factor and particle concentration were selected as the influencing factors to analysis the quantitative relationship among particle parameters, erosion wear and performance of screw centrifugal pump. The results show that the erosion of volute casing is higher than impeller, and the erosion rate of suction side is higher than pressure side. The particles velocity is positively correlated with erosion wear and pump performance reduction rate. While the increase of particles shape factor shows the opposite trend. Erosion rate is found to be increases sharply and then slowly when particles concentration increases, because of the adhesion effect of sand particles in the volute casing inhibits the total erosion wear. The increase of erosion rate promoted the reduction rate of pump performance, and the pump efficiency decreased more significantly when the erosion rate increased to a certain extent. The results of this study are of great significance for further optimization of hydraulic design and structural design for screw centrifugal pump.

Download Full-text

Application of Using Simulated Annealing to Combine Clustering with Collaborative Filtering for Item Recommendation

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.347-350.2747 ◽

2013 ◽

Vol 347-350 ◽

pp. 2747-2751 ◽

Cited By ~ 1

Author(s):

Zhi Ming Feng ◽

Yi Dan Su

Keyword(s):

Simulated Annealing ◽

Collaborative Filtering ◽

Recommender System ◽

Recommendation System ◽

Experimental Results ◽

Good Effect ◽

Large Item ◽

Performance Reduction ◽

Productive Parallel Programming: The PCN Approach

Scientific Programming ◽

10.1155/1992/519840 ◽

1992 ◽

Vol 1 (1) ◽

pp. 51-66 ◽

Cited By ~ 36

Author(s):

Ian Foster ◽

Robert Olson ◽

Steven Tuecke

Keyword(s):

Parallel Computer ◽

Parallel Applications ◽

Programming System ◽

Concurrent Algorithms ◽

Scientists And Engineers ◽

Simple Notation ◽

And Performance ◽

Performance Analysis Tools ◽

Program Components ◽

Parallel Supercomputers

We describe the PCN programming system, focusing on those features designed to improve the productivity of scientists and engineers using parallel supercomputers. These features include a simple notation for the concise specification of concurrent algorithms, the ability to incorporate existing Fortran and C code into parallel applications, facilities for reusing parallel program components, a portable toolkit that allows applications to be developed on a workstation or small parallel computer and run unchanged on supercomputers, and integrated debugging and performance analysis tools. We survey representative scientific applications and identify problem classes for which PCN has proved particularly useful.

Download Full-text