Scalable Shared-Memory Multiprocessing: The Future of High-Performance Computing

Author(s):  
Daniel Lenoski
Author(s):  
Italo Epicoco ◽  
Silvia Mocavero ◽  
Andrew R Porter ◽  
Stephen M Pickles ◽  
Mike Ashworth ◽  
...  

This work describes the introduction of a second level of parallelism based on the OpenMP shared memory paradigm to NEMO, one of the most widely used ocean models in the European climate community. Although the existing parallelisation scheme in NEMO, based on the MPI paradigm, has served it well for many years, it is becoming unsuited to current high-performance computing architectures due to their increasing tendency to have fat nodes containing tens of compute cores. Three different parallel approaches for introducing OpenMP are presented, discussed and compared on several platforms. Finally we have also considered the effect on performance of the data layout employed in NEMO.


Author(s):  
Kim Grover-Haskin

Present day and projected labor demands forecast a need for minds to comprehend in algorithm in order to leverage computing developments for real world problem resolutions. This chapter focuses not so much on solutions to the preparation of the learners and the scientists, but on the future leadership that will advocate and open doors for the high performance computing community to be funded, supported, and practiced. Supercomputing's sustainable future lies in its future of leadership. Studies over the last ten years identify a shift in leadership as the Baby Boomers enter retirement. The talent pool following the Baby Boomers will shrink in numbers between 2010-2020. Women continue to be under represented in IT leadership. This chapter provides information on the talent pool for supercomputing, discusses leadership and organizational culture as influenced by gender, and explores how a mentoring community fosters leaders for the future.


Author(s):  
Domen Verber

A state-of-the-art and a possible future of High Performance Computing (HPC) are discussed. The steady advances in hardware have resulted in increasingly more powerful computers. Some HPC applications that were years ago only in the domain of supercomputers can nowadays be executed on desktop and mobile computers. Furthermore, the future of computers is in the “Internet-of-things” and cyber-physical systems. There, computers are embedded into the devices such as cars, house appliances, production lines, into our clothing, etc. They are interconnected with each other and they may cooperate. Based on that, a new kind of application emerges, which requires the HPC architectures and development techniques. The primary focus of the chapter is on different hardware architectures for HPC and some particularities of HPC programming. Some alternatives to traditional computational models are given. At the end, some replacements for semiconductor technologies of modern computers are debated.


2014 ◽  
Vol 539 ◽  
pp. 429-433
Author(s):  
Lin Feng Zhang ◽  
Lei Chen

MPI is one of the important standards in high performance computing. MPI performance is generally focused on collective communications. And FCA (Fabric Collective Accelerator) is a new method accelerating collective communications. Through high performance computing environment testing, this paper mainly analyses the result of FCA with shared memory and without share memory accelerating IBM Platform MPI, FCA's principle and integration between IBM Platform MPI and FCA. At the same time, this paper may be a good reference for high performance computing using FCA.


Sign in / Sign up

Export Citation Format

Share Document