E-BaTS: Energy-Aware Scheduling for Bag-of-Task Applications in HPC Clusters

High-Performance Computing (HPC) systems consume large amounts of energy. As the energy consumption predictions for HPC show increasing numbers, it is important to make users aware of the energy spent for the execution of their applications. Drawing from our experience with exposing cost and performance in public clouds, in this paper we present a generic mechanism to compute fast and accurate estimates for the tradeoffs between the performance (expressed as makespan) and the energy consumption of applications running on HPC clusters. We validate our approach by implementing it in a prototype, called E-BaTS and validating it with a wide variety of HPC bags-of-tasks. Our experiments show that E-BaTS produces conservative estimates with errors below 5%, while requiring at most 12% of the energy and time of an exhaustive search for providing configurations close to the optimal ones in terms of trade-offs between energy consumption and makespan.

Download Full-text

Editorial for the third international conference on energy-aware high performance computing

Computer Science - Research and Development ◽

10.1007/s00450-012-0231-3 ◽

2012 ◽

Vol 29 (2) ◽

pp. 95-96

Author(s):

Timo Minartz ◽

Thomas Ludwig

Keyword(s):

High Performance Computing ◽

High Performance ◽

Energy Aware ◽

International Conference ◽

The Third ◽

Performance Computing

Download Full-text

Approach towards an energy-aware and energy-efficient high performance computing environment

2011 IEEE 7th International Conference on Intelligent Computer Communication and Processing ◽

10.1109/iccp.2011.6047921 ◽

2011 ◽

Cited By ~ 2

Author(s):

Alexander Kipp ◽

Jia Liu ◽

Tao Jiang ◽

Dmitry Khabi ◽

Yevgeniya Kovalenko ◽

...

Keyword(s):

High Performance Computing ◽

Energy Efficient ◽

High Performance ◽

Computing Environment ◽

Energy Aware ◽

Performance Computing

Download Full-text

Adaptive estimation and prediction of power and performance in high performance computing

Computer Science - Research and Development ◽

10.1007/s00450-010-0125-1 ◽

2010 ◽

Vol 25 (3-4) ◽

pp. 177-186 ◽

Cited By ~ 5

Author(s):

Reza Zamani ◽

Ahmad Afsahi

Keyword(s):

High Performance Computing ◽

High Performance ◽

Adaptive Estimation ◽

Estimation And Prediction ◽

And Performance ◽

Performance Computing

Download Full-text

Design and performance measurement of a high-performance computing cluster

2012 IEEE International Instrumentation and Measurement Technology Conference Proceedings ◽

10.1109/i2mtc.2012.6229359 ◽

2012 ◽

Cited By ~ 2

Author(s):

Kiran George ◽

Vivek Venugopal

Keyword(s):

Performance Measurement ◽

High Performance Computing ◽

High Performance ◽

And Performance ◽

High Performance Computing Cluster ◽

Performance Computing ◽

Computing Cluster

Download Full-text

Editorial: High-performance computing system architectures: design and performance

IET Computers & Digital Techniques ◽

10.1049/iet-cdt.2012.0114 ◽

2012 ◽

Vol 6 (5) ◽

pp. 257-258

Author(s):

N. Bagherzadeh ◽

H. Sarbazi-Azad

Keyword(s):

High Performance Computing ◽

High Performance ◽

Computing System ◽

System Architectures ◽

High Performance Computing System ◽

And Performance ◽

Performance Computing

Download Full-text

Reducing energy usage in resource-intensive Java-based scientific applications via micro-benchmark based code refactorings

Computer Science and Information Systems ◽

10.2298/csis180608009l ◽

2019 ◽

Vol 16 (2) ◽

pp. 541-564

Author(s):

Mathias Longo ◽

Ana Rodriguez ◽

Cristian Mateos ◽

Alejandro Zunino

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

Computer Simulations ◽

High Performance ◽

Machine Learning Algorithms ◽

Energy Usage ◽

Energy Aware ◽

Scientific Application ◽

Performance Computing ◽

Reducing Energy

In-silico research has grown considerably. Today?s scientific code involves long-running computer simulations and hence powerful computing infrastructures are needed. Traditionally, research in high-performance computing has focused on executing code as fast as possible, while energy has been recently recognized as another goal to consider. Yet, energy-driven research has mostly focused on the hardware and middleware layers, but few efforts target the application level, where many energy-aware optimizations are possible. We revisit a catalog of Java primitives commonly used in OO scientific programming, or micro-benchmarks, to identify energy-friendly versions of the same primitive. We then apply the micro-benchmarks to classical scientific application kernels and machine learning algorithms for both single-thread and multi-thread implementations on a server. Energy usage reductions at the micro-benchmark level are substantial, while for applications obtained reductions range from 3.90% to 99.18%.

Download Full-text

Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments

Scientific Programming ◽

10.1155/2019/8348791 ◽

2019 ◽

Vol 2019 ◽

pp. 1-19 ◽

Cited By ~ 4

Author(s):

Pawel Czarnul ◽

Jerzy Proficz ◽

Adam Krzywaniak

Keyword(s):

High Performance Computing ◽

High Performance ◽

Hybrid Methods ◽

State Of The Art ◽

Control Methods ◽

Energy Aware ◽

Power Capping ◽

Power Limits ◽

Performance Computing

The paper presents state of the art of energy-aware high-performance computing (HPC), in particular identification and classification of approaches by system and device types, optimization metrics, and energy/power control methods. System types include single device, clusters, grids, and clouds while considered device types include CPUs, GPUs, multiprocessor, and hybrid systems. Optimization goals include various combinations of metrics such as execution time, energy consumption, and temperature with consideration of imposed power limits. Control methods include scheduling, DVFS/DFS/DCT, power capping with programmatic APIs such as Intel RAPL, NVIDIA NVML, as well as application optimizations, and hybrid methods. We discuss tools and APIs for energy/power management as well as tools and environments for prediction and/or simulation of energy/power consumption in modern HPC systems. Finally, programming examples, i.e., applications and benchmarks used in particular works are discussed. Based on our review, we identified a set of open areas and important up-to-date problems concerning methods and tools for modern HPC systems allowing energy-aware processing.

Download Full-text

Comparative Study of Runtime Systems for Energy-Aware High-Performance Computing

Handbook of Energy-Aware and Green Computing, Volume 2 ◽

10.1201/b11640-9 ◽

2013 ◽

pp. 85-106

Keyword(s):

Comparative Study ◽

High Performance Computing ◽

High Performance ◽

Runtime Systems ◽

Energy Aware ◽

Performance Computing

Download Full-text

Flood Prediction Model Simulation With Heterogeneous Trade-Offs In High Performance Computing Framework

ECMS 2015 Proceedings edited by: Valeri M. Mladenov, Petia Georgieva, Grisha Spasov, Galidiya Petrova ◽

10.7148/2015-0115 ◽

2015 ◽

Cited By ~ 3

Author(s):

Antonio Portero ◽

Radim Vavrik ◽

Stepan Kuchar ◽

Martin Golasowski ◽

Vit Vondrak ◽

...

Keyword(s):

Prediction Model ◽

High Performance Computing ◽

High Performance ◽

Model Simulation ◽

Flood Prediction ◽

Trade Offs ◽

Computing Framework ◽

Performance Computing

Download Full-text

Accurate Energy and Performance Prediction for Frequency-Scaled GPU Kernels

Computation ◽

10.3390/computation8020037 ◽

2020 ◽

Vol 8 (2) ◽

pp. 37

Author(s):

Kaijie Fan ◽

Biagio Cosenza ◽

Ben Juurlink

Keyword(s):

Energy Consumption ◽

Performance Prediction ◽

High Performance ◽

Pareto Set ◽

Large Set ◽

Balance Performance ◽

Multi Objective ◽

Dynamic Voltage ◽

And Performance ◽

Performance Computing

Energy optimization is an increasingly important aspect of today’s high-performance computing applications. In particular, dynamic voltage and frequency scaling (DVFS) has become a widely adopted solution to balance performance and energy consumption, and hardware vendors provide management libraries that allow the programmer to change both memory and core frequencies manually to minimize energy consumption while maximizing performance. This article focuses on modeling the energy consumption and speedup of GPU applications while using different frequency configurations. The task is not straightforward, because of the large set of possible and uniformly distributed configurations and because of the multi-objective nature of the problem, which minimizes energy consumption and maximizes performance. This article proposes a machine learning-based method to predict the best core and memory frequency configurations on GPUs for an input OpenCL kernel. The method is based on two models for speedup and normalized energy predictions over the default frequency configuration. Those are later combined into a multi-objective approach that predicts a Pareto-set of frequency configurations. Results show that our approach is very accurate at predicting extema and the Pareto set, and finds frequency configurations that dominate the default configuration in either energy or performance.

Download Full-text