scholarly journals Scheduling in Heterogeneous Distributed Computing Systems Based on Internal Structure of Parallel Tasks Graphs with Meta-Heuristics

2020 ◽  
Vol 10 (18) ◽  
pp. 6611
Author(s):  
Apolinar Velarde Martinez

The problem of scheduling parallel tasks graphs (PTGs) represented by directed acyclic graphs (DAGs) in heterogeneous distributed computing systems (HDCSs) is considered an nondeterministic polynomial time (NP) problem due to the diversity of characteristics and parameters, generally opposed, intended to be optimized. The PTGs are scheduled by a scheduler that determines the best location for the sub-tasks that constitute the PTGs and is responsible for allocating the resources of the HDCS to the sub-tasks of the PTGs. To optimize scheduling and allocations, the scheduler extracts characteristics from the internal structure of the PTGs. The prevailing characteristic in existing research is the critical path (CP), which is limited to providing execution paths of PTGs; considering this limitation, we extend the array method proposed in Velarde, which extracts two additional characteristics to the CP: the layering and the density of the graph for scheduling. These characteristics are represented as integer values of the PTGs to be scheduled; the values obtained from the characteristics are stored in arrays representing populations that are evaluated with the heuristic univariate marginal distribution algorithm (UMDA) and in terms of comparison with the genetic algorithm. With the best allocations produced by the algorithms, two performance parameters are evaluated: makespan and waiting time. The results indicate that when more PTGs characteristics are considered, resource allocations are optimized, and scheduling times are reduced. The results obtained with the heuristic algorithms show that UMDA provides shorter scheduling and allocation times compared with the genetic algorithm; UMDA widely distributes the sub-tasks in the clusters, whereas the genetic algorithm compacts the assignments of the PTGs in the clusters with a longer convergence time that translates into longer scheduling and allocation times. Extensive explanations of these conclusions are provided in this work, based on the conducted experiments.

2019 ◽  
Author(s):  
Jaime Freire de Souza ◽  
Hermes Senger ◽  
Fabricio A. B. Silva

Bag-of-Tasks (BoT) applications are parallel applications composed of independent (i.e., embarrassingly parallel) tasks, which do not communicate with each other, may depend upon one or more input files, and can be executed in any order. BoT applications are very frequent in several scientific areas, and it is the ideal application class for execution on large distributed computing systems composed of hundreds to many thousands of computational resources. This paper focusses on the scalability of BoT applications running on large heterogeneous distributed computing systems organized as a master-slave platform. The results demonstrate that heterogeneous master-slave platforms can achieve higher scalability than homogeneous platforms for the execution of BoT applications, when the computational power of individual nodes in the homogeneous platform is fixed. However, when individual nodes of the homogeneous platform can scale-up, experiments show that master-slave platforms can achieve near linear speedups.


2016 ◽  
Vol 16 (1) ◽  
pp. 69-78
Author(s):  
Altaf Hussain ◽  
Faisal Azam ◽  
Muhammad Sharif ◽  
Mussarat Yasmin ◽  
Sajjad Mohsin

Heterogeneous Distributed Computing Systems (HeDCS) efficiently utilize the heterogeneity of diverse computational resources which are interlinked through high speed networks for executing a group of computing intensive applications. Directed acyclic graphs (DAGs) are usually used to represent these parallel applications with varied computational requirements and constraints. The optimal scheduling of the given set of precedence constrained tasks to available resources is a core concern in HeDCS and is known to be NP Complete problem. Non deterministic nature of application programs and heterogeneous environment are the main challenges in designing, implementing and analyzing phases of task scheduling techniques. A myriad of heuristic and meta-heuristic approaches have been proposed in the literature to solve this complex problem. The basic purpose of this study is to cover ANN based task scheduling strategies in the distributed computing environment perspective. Further existing scheduling heuristics could be classified in a new state of art classification including the description of frequently used parameters in the mentioned scheduling strategies. The flexible and powerful nature of ANN for identifying the data patterns, underlying time and other constraints and learning capabilities have shown to be a promising candidate among other heuristics.Nepal Journal of Science and Technology Vol. 16, No.1 (2015) pp. 69-78


Sign in / Sign up

Export Citation Format

Share Document