Thermal and Mechanical Design of the Fastest Supercomputer of the World in Cognitive Systems: IBM POWER AC 922

ASME 2019 International Technical Conference and Exhibition on Packaging and Integration of Electronic and Photonic Microsystems ◽

10.1115/ipack2019-6444 ◽

2019 ◽

Cited By ~ 2

Author(s):

Anil Yuksel ◽

Vic Mahaney ◽

Chris Marroquin ◽

Shurong Tian ◽

Mark Hoffmeyer ◽

...

Keyword(s):

High Performance ◽

Mechanical Design ◽

Cognitive Systems ◽

Design Strategies ◽

Electronic Components ◽

New Era ◽

The World ◽

Computing Performance ◽

Petaflops Computing ◽

Performance Computing

Abstract High performance computing (HPC), artificial intelligence (AI) and cognitive systems have initiated a new era of computing. Efficient thermal management technologies of these systems have been vital due to the increasing power density in the electronic components. In 2018 IBM delivered the fastest supercomputer of the world through Summit with 200 petaflops computing performance with LINPACK benchmarks. The system is both air and water cooled, where water is employed to cool the high power dissipated electronic components which are the IBM POWER9 processors and NVIDIA GPUs. In this paper, we highlight the overview of the thermal and mechanical design strategies applied on these systems. In air cooled systems, we discuss the fan and heat sink designs, as well as the preheating effect on PCI section. Liquid cooled system has a unique coldplate design which cool the processors and the GPUs with water. We examine the water flow path design for the processor and the GPUs by providing the thermal performance of the coldplate. Also, an overview of the cooling assemblies such as TIMs and air baffles in the servers are discussed. Moreover, unit and rack manifolds are investigated; flow and pressure distribution at the node and rack level are provided.

An Overview of Thermal and Mechanical Design, Control, and Testing of the World's Most Powerful and Fastest Supercomputer

Journal of Electronic Packaging ◽

10.1115/1.4046847 ◽

2020 ◽

Vol 143 (1) ◽

Author(s):

Anil Yuksel ◽

Vic Mahaney ◽

Chris Marroquin ◽

Shurong Tian ◽

Mark Hoffmeyer ◽

...

Keyword(s):

Water Flow ◽

Graphics Processing Units ◽

High Performance ◽

Mechanical Design ◽

Control Strategies ◽

Thermal Control ◽

Cold Plate ◽

Design Strategies ◽

Electronic Components ◽

Central Processing

Abstract A new era of computing has begun with the development of high-performance computing (HPC), artificial intelligence (AI), machine learning (ML), and cognitive systems. Dramatic increases in the power density of the electronic components have led to the design and architecture of efficient thermal management technologies on these systems. IBM designed and delivered in 2018 the most powerful and fastest supercomputers of the world known as Summit and Sierra having 200 petaflops peak computing performance through LINPACK benchmarks. These systems which are called as IBM POWER AC922 are both air and liquid cooled, where water is employed in liquid-cooled systems to cool the high-power electronic components including IBM POWER9 processors and NVIDIA graphics processing units (GPUs). In this paper, we highlight the overview of the thermal and mechanical design strategies applied to these systems. Testing and experimental analysis with comparison to computational modeling is provided. Thermal control strategies are investigated for the optimization of overall system efficiency. In air cooled systems, we discuss the fan and heat sink designs, as well as the preheating effect on the PCIe section. In liquid-cooled systems, which have a unique cold plate design cooling the processors and the GPUs with water, we examine the water flow path design for the central processing units (CPUs), the GPUs, and the thermal performance of the cold plate. An overview of the cooling assemblies such as TIMs and air baffles in these systems is discussed. Unit and rack manifolds and rear door heat exchanger (RDHx) are investigated. Water flow and pressure distribution at the node and rack-level are provided.

News & Trends - Is high-performance computing entering a new era?

IEEE Internet Computing ◽

10.1109/mic.2004.1273479 ◽

2004 ◽

Vol 8 (2) ◽

pp. 9-11

Author(s):

G. Goth

Keyword(s):

High Performance Computing ◽

High Performance ◽

New Era ◽

Performance Computing

On Construction of a Diskless Cluster Computing Environment in a Computer Classroom

International Journal of Grid and High Performance Computing ◽

10.4018/jghpc.2012100105 ◽

2012 ◽

Vol 4 (4) ◽

pp. 68-88

Author(s):

Chao-Tung Yang ◽

Wen-Feng Hsieh

Keyword(s):

High Performance ◽

Cluster Computing ◽

Relevant Information ◽

Computing Environment ◽

Cluster Architecture ◽

Computer Classroom ◽

Computation Node ◽

Cluster Environment ◽

Computing Performance ◽

Performance Computing

This paper’s objective is to implement and evaluate a high-performance computing environment by clustering idle PCs (personal computers) with diskless slave nodes on campuses to obtain the effectiveness of the largest computer potency. Two sets of Cluster platforms, BCCD and DRBL, are used to compare computing performance. It’s to prove that DRBL has better performance than BCCD in this experiment. Originally, DRBL was created to facilitate instructions for a Free Software Teaching platform. In order to achieve the purpose, DRBL is applied to the computer classroom with 32 PCs so to enable PCs to be switched manually or automatically among different OS (operating systems). The bioinformatics program, mpiBLAST, is executed smoothly in the Cluster architecture as well. From management’s view, the state of each Computation Node in Clusters is monitored by “Ganglia”, an existing Open Source. The authors gather the relevant information of CPU, Memory, and Network Load for each Computation Node in every network section. Through comparing aspects of performance, including performance of Swap and different network environment, they attempted to find out the best Cluster environment in a computer classroom at the school. Finally, HPL of HPCC is used to demonstrate cluster performance.

ThunderX2 Performance and Energy-Efficiency for HPC Workloads

Computation ◽

10.3390/computation8010020 ◽

2020 ◽

Vol 8 (1) ◽

pp. 20 ◽

Cited By ~ 1

Author(s):

Enrico Calore ◽

Alessandro Gabbana ◽

Sebastiano Fabio Schifano ◽

Raffaele Tripiccione

Keyword(s):

Energy Efficiency ◽

High Performance ◽

Data Centers ◽

Parallel Systems ◽

High Energy ◽

Building Systems ◽

Computing Performance ◽

High Energy Efficiency ◽

Power And Energy ◽

Performance Computing

In the last years, the energy efficiency of HPC systems is increasingly becoming of paramount importance for environmental, technical, and economical reasons. Several projects have investigated the use of different processors and accelerators in the quest of building systems able to achieve high energy efficiency levels for data centers and HPC installations. In this context, Arm CPU architecture has received a lot of attention given its wide use in low-power and energy-limited applications, but server grade processors have appeared on the market just recently. In this study, we targeted the Marvell ThunderX2, one of the latest Arm-based processors developed to fit the requirements of high performance computing applications. Our interest is mainly focused on the assessment in the context of large HPC installations, and thus we evaluated both computing performance and energy efficiency, using the ERT benchmark and two HPC production ready applications. We finally compared the results with other processors commonly used in large parallel systems and highlight the characteristics of applications which could benefit from the ThunderX2 architecture, in terms of both computing performance and energy efficiency. Pursuing this aim, we also describe how ERT has been modified and optimized for ThunderX2, and how to monitor power drain while running applications on this processor.

Evaluating high-level design strategies on FPGAs for high-performance computing

2017 27th International Conference on Field Programmable Logic and Applications (FPL) ◽

10.23919/fpl.2017.8056756 ◽

2017 ◽

Author(s):

Artur Podobas ◽

Hamid Reza Zohouri ◽

Naoya Maruyama ◽

Satoshi Matsuoka

Keyword(s):

High Performance Computing ◽

High Performance ◽

Design Strategies ◽

Level Design ◽

High Level ◽

Performance Computing

Using the World Wide Web to provide a platform independent interface to high performance computing

Digest of Papers COMPCON 95 Technologies for the Information Superhighway CMPCON-95 ◽

10.1109/cmpcon.1995.512355 ◽

2002 ◽

Author(s):

D.W. Robertson ◽

W.E. Johnston

Keyword(s):

World Wide Web ◽

High Performance Computing ◽

High Performance ◽

World Wide ◽

The World ◽

Performance Computing

Evaluating high-level design strategies on FPGAs for high-performance computing

2017 27th International Conference on Field Programmable Logic and Applications (FPL) ◽

10.23919/fpl.2017.8056760 ◽

2017 ◽

Author(s):

Artur Podobas ◽

Hamid Reza Zohouri ◽

Naoya Maruyama ◽

Satoshi Matsuoka

Keyword(s):

High Performance Computing ◽

High Performance ◽

Design Strategies ◽

Level Design ◽

High Level ◽

Performance Computing

Special Issue on Automatic Application Tuning for HPC Architectures

Scientific Programming ◽

10.1155/2014/208480 ◽

2014 ◽

Vol 22 (4) ◽

pp. 259-260 ◽

Cited By ~ 1

Author(s):

Siegfried Benkner ◽

Franz Franchetti ◽

Hans Michael Gerndt ◽

Jeffrey K. Hollingsworth

Keyword(s):

High Performance ◽

Research Field ◽

Performance Tuning ◽

Full Potential ◽

Research Groups ◽

Special Issue ◽

The World ◽

And Performance ◽

Application Tuning ◽

Performance Computing

High Performance Computing architectures have become incredibly complex and exploiting their full potential is becoming more and more challenging. As a consequence, automatic performance tuning (autotuning) of HPC applications is of growing interest and many research groups around the world are currently involved. Autotuning is still a rapidly evolving research field with many different approaches being taken. This special issue features selected papers presented at the Dagstuhl seminar on “Automatic Application Tuning for HPC Architectures” in October 2013, which brought together researchers from the areas of autotuning and performance analysis in order to exchange ideas and steer future collaborations.

CONTENT OF A SPECIAL COURSE ON HIGH PERFORMANCE COMPUTING

Bulletin Series of Physics & Mathematical Sciences ◽

10.51889/2020-3.1728-7901.40 ◽

2020 ◽

Vol 71 (3) ◽

pp. 263-267

Author(s):

М. Serik ◽

◽

G. Zh. Yerlanova ◽

Keyword(s):

Information Technology ◽

High Performance Computing ◽

Computer Technology ◽

High Performance ◽

Practical Importance ◽

Modern Society ◽

Educational Process ◽

Dynamic Development ◽

The World ◽

Performance Computing

At present, along with the dynamic development of computer technology in the world, the most effective ways of solving problems of practical importance are being considered. High performance computing takes the lead in this. Therefore, the development of modern society is closely related to the training of experienced, modern specialists in the field of information technology. This, in turn, depends on the inclusion of new courses in the curriculum and full coverage of these issues in the content of the taught courses. This article analyzes the courses on high performance computing, taught at experimental bases and abroad, on the basis of this, the topics of the special course and the content recommended for implementation in the educational process are determined. During the training, the competencies of students in high performance computing were identified.

Scientific Grid computing

Philosophical Transactions of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rsta.2005.1632 ◽

2005 ◽

Vol 363 (1833) ◽

pp. 1707-1713 ◽

Cited By ~ 22

Author(s):

Peter V Coveney

Keyword(s):

Grid Computing ◽

High Performance ◽

World Wide ◽

Computational Grids ◽

Grid Infrastructure ◽

Theme Issue ◽

Computational Steering ◽

The World ◽

Definition Of ◽

Performance Computing

We introduce a definition of Grid computing which is adhered to throughout this Theme Issue. We compare the evolution of the World Wide Web with current aspirations for Grid computing and indicate areas that need further research and development before a generally usable Grid infrastructure becomes available. We discuss work that has been done in order to make scientific Grid computing a viable proposition, including the building of Grids, middleware developments, computational steering and visualization. We review science that has been enabled by contemporary computational Grids, and associated progress made through the widening availability of high performance computing.