Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments

The paper presents state of the art of energy-aware high-performance computing (HPC), in particular identification and classification of approaches by system and device types, optimization metrics, and energy/power control methods. System types include single device, clusters, grids, and clouds while considered device types include CPUs, GPUs, multiprocessor, and hybrid systems. Optimization goals include various combinations of metrics such as execution time, energy consumption, and temperature with consideration of imposed power limits. Control methods include scheduling, DVFS/DFS/DCT, power capping with programmatic APIs such as Intel RAPL, NVIDIA NVML, as well as application optimizations, and hybrid methods. We discuss tools and APIs for energy/power management as well as tools and environments for prediction and/or simulation of energy/power consumption in modern HPC systems. Finally, programming examples, i.e., applications and benchmarks used in particular works are discussed. Based on our review, we identified a set of open areas and important up-to-date problems concerning methods and tools for modern HPC systems allowing energy-aware processing.

Download Full-text

Editorial for the third international conference on energy-aware high performance computing

Computer Science - Research and Development ◽

10.1007/s00450-012-0231-3 ◽

2012 ◽

Vol 29 (2) ◽

pp. 95-96

Author(s):

Timo Minartz ◽

Thomas Ludwig

Keyword(s):

High Performance Computing ◽

High Performance ◽

Energy Aware ◽

International Conference ◽

The Third ◽

Performance Computing

Download Full-text

Architecture for the Integration of High Performance Computing Applications in PLM

Volume 2: 27th Computers and Information in Engineering Conference, Parts A and B ◽

10.1115/detc2007-35185 ◽

2007 ◽

Author(s):

Reiner Anderl ◽

Orkun Yaman

Keyword(s):

Data Management ◽

High Performance Computing ◽

High Performance ◽

State Of The Art ◽

Reference Information ◽

Simulation Domain ◽

Architectural Framework ◽

Industrial Context ◽

Performance Computing ◽

Integrate Data

High Performance Computing (HPC) has become ubiquitous for simulations in the industrial context. To identify the requirements for integration of HPC-relevant data and processes a survey has been conducted concerning the German car manufacturers and service and component suppliers. This contribution presents the results of the evaluation and suggests an architecture concept to integrate data and workflows related with CAE and HPC-facilities in PLM. It describes the state of the art of HPC-applications within the simulation domain. Intensive efforts are currently invested on CAE-data management. However, an approach to systematic data management of HPC does not exist. This study states importance of an integrating approach for data management of HPC-applications and develops an architectural framework to implement HPC-data management into the existing PLM landscape. Requirements on key functionalities and interfaces are defined as well as a framework for a reference information model is conceptualized.

Download Full-text

High performance computing RBFN for classification of sizable microarrays

2018 4th International Conference on Recent Advances in Information Technology (RAIT) ◽

10.1109/rait.2018.8389053 ◽

2018 ◽

Author(s):

Ransingh B. Ray ◽

Mukesh Kumar ◽

Santanu K. Rath

Keyword(s):

High Performance Computing ◽

High Performance ◽

Performance Computing

Download Full-text

High-performance computing systems: Status and outlook

Acta Numerica ◽

10.1017/s0962492912000050 ◽

2012 ◽

Vol 21 ◽

pp. 379-474 ◽

Cited By ~ 36

Author(s):

J. J. Dongarra ◽

A. J. van der Steen

Keyword(s):

High Performance Computing ◽

High Performance ◽

State Of The Art ◽

Computing Systems ◽

Future Developments ◽

Steady Growth ◽

Current State ◽

Near Future ◽

Performance Computing ◽

Shed Light

This article describes the current state of the art of high-performance computing systems, and attempts to shed light on near-future developments that might prolong the steady growth in speed of such systems, which has been one of their most remarkable characteristics. We review the different ways devised to speed them up, both with regard to components and their architecture. In addition, we discuss the requirements for software that can take advantage of existing and future architectures.

Download Full-text

Approach towards an energy-aware and energy-efficient high performance computing environment

2011 IEEE 7th International Conference on Intelligent Computer Communication and Processing ◽

10.1109/iccp.2011.6047921 ◽

2011 ◽

Cited By ~ 2

Author(s):

Alexander Kipp ◽

Jia Liu ◽

Tao Jiang ◽

Dmitry Khabi ◽

Yevgeniya Kovalenko ◽

...

Keyword(s):

High Performance Computing ◽

Energy Efficient ◽

High Performance ◽

Computing Environment ◽

Energy Aware ◽

Performance Computing

Download Full-text

Resilient gossip-inspired all-reduce algorithms for high-performance computing: Potential, limitations, and open questions

The International Journal of High Performance Computing Applications ◽

10.1177/1094342018762531 ◽

2018 ◽

Vol 33 (2) ◽

pp. 366-383

Author(s):

Marc Casas ◽

Wilfried N Gansterer ◽

Elias Wimmer

Keyword(s):

Fault Tolerance ◽

High Performance Computing ◽

High Performance ◽

State Of The Art ◽

The State ◽

Reduction Algorithm ◽

Data Corruption ◽

Parallel Reduction ◽

Open Questions ◽

Performance Computing

We investigate the usefulness of gossip-based reduction algorithms in a high-performance computing (HPC) context. We compare them to state-of-the-art deterministic parallel reduction algorithms in terms of fault tolerance and resilience against silent data corruption (SDC) as well as in terms of performance and scalability. New gossip-based reduction algorithms are proposed, which significantly improve the state-of-the-art in terms of resilience against SDC. Moreover, a new gossip-inspired reduction algorithm is proposed, which promises a much more competitive runtime performance in an HPC context than classical gossip-based algorithms, in particular for low accuracy requirements.

Download Full-text

State of the Art and Future Trends in Data Reduction for High-Performance Computing

Supercomputing Frontiers and Innovations ◽

10.14529/jsfi200101 ◽

2020 ◽

Vol 7 (1) ◽

Keyword(s):

High Performance Computing ◽

Data Reduction ◽

High Performance ◽

State Of The Art ◽

Future Trends ◽

Performance Computing

Download Full-text

Reducing energy usage in resource-intensive Java-based scientific applications via micro-benchmark based code refactorings

Computer Science and Information Systems ◽

10.2298/csis180608009l ◽

2019 ◽

Vol 16 (2) ◽

pp. 541-564

Author(s):

Mathias Longo ◽

Ana Rodriguez ◽

Cristian Mateos ◽

Alejandro Zunino

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

Computer Simulations ◽

High Performance ◽

Machine Learning Algorithms ◽

Energy Usage ◽

Energy Aware ◽

Scientific Application ◽

Performance Computing ◽

Reducing Energy

In-silico research has grown considerably. Today?s scientific code involves long-running computer simulations and hence powerful computing infrastructures are needed. Traditionally, research in high-performance computing has focused on executing code as fast as possible, while energy has been recently recognized as another goal to consider. Yet, energy-driven research has mostly focused on the hardware and middleware layers, but few efforts target the application level, where many energy-aware optimizations are possible. We revisit a catalog of Java primitives commonly used in OO scientific programming, or micro-benchmarks, to identify energy-friendly versions of the same primitive. We then apply the micro-benchmarks to classical scientific application kernels and machine learning algorithms for both single-thread and multi-thread implementations on a server. Energy usage reductions at the micro-benchmark level are substantial, while for applications obtained reductions range from 3.90% to 99.18%.

Download Full-text