HMDS: A Makespan Minimizing DAG Scheduler for Heterogeneous Distributed Systems

The problem of scheduling Directed Acyclic Graphs in order to minimize makespan ( schedule length ), is known to be a challenging and computationally hard problem. Therefore, researchers have endeavored towards the design of various heuristic solution generation techniques both for homogeneous as well as heterogeneous computing platforms. This work first presents HMDS-Bl , a list-based heuristic makespan minimization algorithm for task graphs on fully connected heterogeneous platforms. Subsequently, HMDS-Bl has been enhanced by empowering it with a low-overhead depth-first branch and bound based search approach, resulting in a new algorithm called HMDS . HMDS has been equipped with a set of novel tunable pruning mechanisms, which allow the designer to obtain a judicious balance between performance ( makespan ) and solution generation times, depending on the specific scenario at hand. Experimental analyses using randomly generated DAGs as well as benchmark task graphs, have shown that HMDS is able to comprehensively outperform state-of-the-art algorithms such as HEFT , PEFT , PPTS , etc., in terms of archived makespans while incurring bounded additional computation time overhead.

Download Full-text

A Tabu Search Approach to the Optimal Sequential Partitions of Directed Acyclic Graphs

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss1987.116.10_1149 ◽

1996 ◽

Vol 116 (10) ◽

pp. 1149-1157 ◽

Cited By ~ 1

Author(s):

Taichi Kaji ◽

Azuma Ohuchi

Keyword(s):

Tabu Search ◽

Directed Acyclic Graphs ◽

Acyclic Graphs ◽

Search Approach

Download Full-text

A tabu search approach to the optimal sequential partitions of directed acyclic graphs

Electrical Engineering in Japan ◽

10.1002/(sici)1520-6416(199706)119:4<42::aid-eej5>3.0.co;2-j ◽

1997 ◽

Vol 119 (4) ◽

pp. 42-51

Author(s):

Taichi Kaji ◽

Azuma Ohuchi

Keyword(s):

Tabu Search ◽

Directed Acyclic Graphs ◽

Acyclic Graphs ◽

Search Approach

Download Full-text

New Method for Bayesian Network Learning

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001419590055 ◽

2018 ◽

Vol 33 (02) ◽

pp. 1959005 ◽

Cited By ~ 1

Author(s):

Y. Benmouna ◽

M. Benazzouz ◽

M. A. Chikh ◽

S. Mahmoudi

Keyword(s):

Computation Time ◽

Directed Acyclic Graphs ◽

New Method ◽

Data Sets ◽

Network Learning ◽

Memory Overhead ◽

Multiple Data ◽

Acyclic Graphs ◽

Multiple Data Sets ◽

Selection Of

This paper presents a new method for learning the structure of Bayesian Networks. Broadly speaking, we leverage the Branch and Bound (B&B) to derive the best Directed Acyclic Graphs (DAGs) that describes the structure of the network. Our contribution consists in introducing two main heuristics: the first one allows the selection of the graph that has the best score among those that contain less cycles, the second one eliminates the shortest cycle from the selected graph; it aims to reduce the number of explored nodes. Our experimental study asserts that the suggested proposal improves the results for multiple data sets. These facts are confirmed by the reduction of the computation time and the memory overhead.

Download Full-text

Task Scheduling in Heterogeneous Multiprocessor Environments – An Efficient ACO-Based Approach

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v10.i1.pp320-329 ◽

2018 ◽

Vol 10 (1) ◽

pp. 320 ◽

Cited By ~ 2

Author(s):

Nekiesha Edward ◽

Jeffrey Elcock

Keyword(s):

Task Scheduling ◽

Heterogeneous Computing ◽

Optimization Technique ◽

Shortest Paths ◽

Directed Acyclic Graphs ◽

Food Sources ◽

Scheduling Problem ◽

Ant Colonies ◽

Acyclic Graphs ◽

Critical Issues

In heterogeneous computing environments, finding optimized solutions continues to be one of the most important and yet, very challenging problems. Task scheduling in such environments is NP-hard, so efficient mapping of tasks to the processors remains one of the most critical issues to be tackled. For several types of applications, the task scheduling problem is crucial, and across the literature, a number of algorithms with several different approaches have been proposed. One such effective approach is known as Ant Colony Optimization (ACO). This popular optimization technique is inspired by the capabilities of ant colonies to find the shortest paths between their nests and food sources. Consequently, we propose an ACO-based algorithm, called rACS, as a solution to the task scheduling problem. Our algorithm utilizes pheromone and a priority-based heuristic, known as the upward rank value, as well as an insertion-based policy and a pheromone aging mechanism to guide the ants to high quality solutions. To evaluate the performance of our algorithm, we compared our algorithm with the ACS algorithm and the ACO-TMS algorithm using randomly generated directed acyclic graphs (DAGs). The simulation results indicated that our algorithm experienced comparable or even better performance, than the selected algorithms.

Download Full-text

AutoShrink: A Topology-Aware NAS for Discovering Efficient Neural Architecture

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6163 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6829-6836

Author(s):

Tunhou Zhang ◽

Hsin-Pai Cheng ◽

Zhenwen Li ◽

Feng Yan ◽

Chengyu Huang ◽

...

Keyword(s):

Cell Structure ◽

Search Time ◽

Building Blocks ◽

Search Space ◽

Directed Acyclic Graphs ◽

Neural Architecture ◽

Cell Structures ◽

Acyclic Graphs ◽

Search Approach ◽

Network Patterns

Resource is an important constraint when deploying Deep Neural Networks (DNNs) on mobile and edge devices. Existing works commonly adopt the cell-based search approach, which limits the flexibility of network patterns in learned cell structures. Moreover, due to the topology-agnostic nature of existing works, including both cell-based and node-based approaches, the search process is time consuming and the performance of found architecture may be sub-optimal. To address these problems, we propose AutoShrink, a topology-aware Neural Architecture Search (NAS) for searching efficient building blocks of neural architectures. Our method is node-based and thus can learn flexible network patterns in cell structures within a topological search space. Directed Acyclic Graphs (DAGs) are used to abstract DNN architectures and progressively optimize the cell structure through edge shrinking. As the search space intrinsically reduces as the edges are progressively shrunk, AutoShrink explores more flexible search space with even less search time. We evaluate AutoShrink on image classification and language tasks by crafting ShrinkCNN and ShrinkRNN models. ShrinkCNN is able to achieve up to 48% parameter reduction and save 34% Multiply-Accumulates (MACs) on ImageNet-1K with comparable accuracy of state-of-the-art (SOTA) models. Specifically, both ShrinkCNN and ShrinkRNN are crafted within 1.5 GPU hours, which is 7.2× and 6.7× faster than the crafting time of SOTA CNN and RNN models, respectively.

Download Full-text