Symbiosis: Network-aware task scheduling in data-parallel frameworks

Directed acyclic graph (DAG)-aware task scheduling algorithms have been studied extensively in recent years, and these algorithms have achieved significant performance improvements in data-parallel analytic platforms. However, current DAG-aware task scheduling algorithms, among which HEFT and GRAPHENE are notable, pay little attention to the cache management policy, which plays a vital role in in-memory data-parallel systems such as Spark. Cache management policies that are designed for Spark exhibit poor performance in DAG-aware task-scheduling algorithms, which leads to cache misses and performance degradation. In this study, we propose a new cache management policy known as Long-Running Stage Set First (LSF), which makes full use of the task dependencies to optimize the cache management performance in DAG-aware scheduling algorithms. LSF calculates the caching and prefetching priorities of resilient distributed datasets according to their unprocessed workloads and significance in parallel scheduling, which are key factors in DAG-aware scheduling algorithms. Moreover, we present a cache-aware task scheduling algorithm based on LSF to reduce the resource fragmentation in computing. Experiments demonstrate that, compared to DAG-aware scheduling algorithms with LRU and MRD, the same algorithms with LSF improve the JCT by up to 42% and 30%, respectively. The proposed cache-aware scheduling algorithm also exhibits about 12% reduction in the average job completion time compared to GRAPHENE with LSF.

Download Full-text

How does manipulation of secondary task scheduling affect human performance?

PsycEXTRA Dataset ◽

10.1037/e577712012-049 ◽

2006 ◽

Author(s):

Patrice D. Tremoulet ◽

Kathleen M. Stibler ◽

Patrick Craven ◽

Joyce Barton ◽

Adam Gifford ◽

...

Keyword(s):

Task Scheduling ◽

Secondary Task ◽

Human Performance

Download Full-text

Task Scheduling Algorithm based on Resources Segregation in Cloud Environment

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset21841125 ◽

2018 ◽

pp. 96-101

Author(s):

Shailendra Raghuvanshi ◽

Priyanka Dubey

Keyword(s):

Load Balancing ◽

Task Scheduling ◽

Waiting Time ◽

Virtual Machines ◽

Scheduling Algorithm ◽

Cloud Environment ◽

Task Scheduling Algorithm ◽

Machine Utilization ◽

Independent Tasks ◽

Optimal Machine

Load balancing of non-preemptive independent tasks on virtual machines (VMs) is an important aspect of task scheduling in clouds. Whenever certain VMs are overloaded and remaining VMs are under loaded with tasks for processing, the load has to be balanced to achieve optimal machine utilization. In this paper, we propose an algorithm named honey bee behavior inspired load balancing, which aims to achieve well balanced load across virtual machines for maximizing the throughput. The proposed algorithm also balances the priorities of tasks on the machines in such a way that the amount of waiting time of the tasks in the queue is minimal. We have compared the proposed algorithm with existing load balancing and scheduling algorithms. The experimental results show that the algorithm is effective when compared with existing algorithms. Our approach illustrates that there is a significant improvement in average execution time and reduction in waiting time of tasks on queue using workflowsim simulator in JAVA.

Download Full-text