XcalableMP PGAS Programming Language

AbstractThis chapter describes a multi-SPMD (mSPMD) programming model and a set of software and libraries to support the mSPMD programming model. The mSPMD programming model has been proposed to realize scalable applications on huge and hierarchical systems. It has been evident that simple SPMD programs such as MPI, XMP, or hybrid programs such as OpenMP/MPI cannot exploit the postpeta- or exascale systems efficiently due to the increasing complexity of applications and systems. The mSPMD programming model has been designed to adopt multiple programming models across different architecture levels. Instead of invoking a single parallel program on millions of processor cores, multiple SPMD programs of moderate sizes can be worked together in the mSPMD programming model. As components of the mSPMD programming model, XMP has been supported. Fault-tolerance features, correctness checks, and some numerical libraries’ implementations in the mSPMD programming model have been presented.

Download Full-text

XcalableACC: An Integration of XcalableMP and OpenACC

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_4 ◽

2020 ◽

pp. 123-146

Author(s):

Akihiro Tabuchi ◽

Hitoshi Murai ◽

Masahiro Nakao ◽

Tetsuya Odajima ◽

Taisuke Boku

Keyword(s):

Distributed Memory ◽

Direct Communication

AbstractXcalableACC (XACC) is an extension of XcalableMP for accelerated clusters. It is defined as a diagonal integration of XcalableMP and OpenACC, which is another directive-based language designed to program heterogeneous CPU/accelerator systems. XACC has features for handling distributed-memory parallelism, inherited from XMP, offloading tasks to accelerators, inherited from OpenACC, and two additional functions: data/work mapping among multiple accelerators and direct communication between accelerators.

Download Full-text

Hybrid-View Programming of Nuclear Fusion Simulation Code in XcalableMP

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_7 ◽

2020 ◽

pp. 181-203

Author(s):

Keisuke Tsugane ◽

Taisuke Boku ◽

Hitoshi Murai ◽

Mitsuhisa Sato ◽

William Tang ◽

...

Keyword(s):

Message Passing Interface ◽

Programming Model ◽

Fusion Plasmas ◽

Simulation Code ◽

Princeton University ◽

Global View ◽

Local View ◽

Hybrid View ◽

Space Data ◽

Global Data

AbstractXcalableMP(XMP) supports a global-view model that allows programmers to define global data and to map them to a set of processors, which execute the distributed global data as a single thread. In XMP, the concept of a coarray is also employed for local-view programming. In this study, we port Gyrokinetic Toroidal Code - Princeton (GTC-P), which is a three-dimensional gyrokinetic PIC code developed at Princeton University to study the microturbulence phenomenon in magnetically confined fusion plasmas, to XMP as an example of hybrid memory model coding with the global-view and local-view programming models. In local-view programming, the coarray notation is simple and intuitive compared with Message Passing Interface (MPI) programming, while the performance is comparable to that of the MPI version. Thus, because the global-view programming model is suitable for expressing the data parallelism for a field of grid space data, we implement a hybrid-view version using a global-view programming model to compute the field and a local-view programming model to compute the movement of particles. The performance is degraded by 20% compared with the original MPI version, but the hybrid-view version facilitates more natural data expression for static grid space data (in the global-view model) and dynamic particle data (in the local-view model), and it also increases the readability of the code for higher productivity.

Download Full-text

Parallelization of Atomic Image Reconstruction from X-ray Fluorescence Holograms with XcalableMP

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_8 ◽

2020 ◽

pp. 205-218

Author(s):

Atsushi Kubota ◽

Tomohiro Matsushita ◽

Naohisa Happo

Keyword(s):

Fourier Transform ◽

Programming Language ◽

Local Structure ◽

Three Dimensional ◽

X Ray ◽

Sequential Execution ◽

Pc Cluster ◽

Middle Range ◽

Long Time ◽

Atomic Image

AbstractX-ray fluorescence holography is a three-dimensional middle range local structure analysis method, which can provide three-dimensional atomic images around specific elements within a radius of a few nanometers. Three-dimensional atomic images are reconstructed by applying discrete Fourier transform (DFT) to hologram data. Presently, it takes long time to process this DFT. In this study, the DFT program is parallelized by using a parallel programming language XcalableMP. The DFT process, whose input is 21 holograms data of 179 × 360 points and output is a three-dimensional atomic image of 1923 points, is executed on PC cluster which consists of 8 nodes of Intel Xeon X5660 processors and 96 cores in total and we confirmed that the parallelized DFT execution is 94 times faster than the sequential execution.

Download Full-text

Three-Dimensional Fluid Code with XcalableMP

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_6 ◽

2020 ◽

pp. 165-179

Author(s):

Hitoshi Sakagami

Keyword(s):

Decomposition Method ◽

Programming Model ◽

Domain Decomposition Method ◽

Three Dimensional ◽

Parallel Computers ◽

Fortran Compiler ◽

Language Extension ◽

Local View ◽

High Level ◽

Portable Parallel Programming

AbstractIn order to adapt parallel computers to general convenient tools for computational scientists, a high-level and easy-to-use portable parallel programming paradigm is mandatory. XcalableMP, which is proposed by the XcalableMP Specification Working Group, is a directive-based language extension for Fortran and C to easily describe parallelization in programs for distributed memory parallel computers. The Omni XcalableMP compiler, which is provided as a reference XcalableMP compiler, is currently implemented as a source-to-source translator. It converts XcalableMP programs to standard MPI programs, which can be easily compiled by the native Fortran compiler and executed on most of parallel computers. A three-dimensional Eulerian fluid code written in Fortran is parallelized by XcalableMP using two different programming models with the ordinary domain decomposition method, and its performances are measured on the K computer. Programs converted by the Omni XcalableMP compiler prevent native Fortran compiler optimizations and show lower performance than that of hand-coded MPI programs. Finally almost the same performances are obtained by using specific compiler options of the native Fortran compiler in the case of a global-view programming model, but performance degradation is not improved by specifying any native compiler options when the code is parallelized by a local-view programming model.

Download Full-text

Coarrays in the Context of XcalableMP

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_3 ◽

2020 ◽

pp. 97-122

Author(s):

Hidetoshi Iwashita ◽

Masahiro Nakao

Keyword(s):

High Performance ◽

Memory Allocation ◽

Communication Algorithms ◽

Zero Copy ◽

Ping Pong ◽

Allocation Methods

AbstractCoarray features have been implemented on the Omni XcalableMP compiler with a source-to-source translator and layered runtime libraries. Three memory allocation methods for coarrays were implemented for the GASNet and MPI-3 communication libraries and the native interface of Fujitsu. For the coarray PUT/GET communication, algorithms using DMA (zero-copy) and buffering were introduced. Important techniques for achieving high performance were the non-blocking PUT communication implemented in the runtime library and the optimization for the GET communication in the translator. Using the ping-pong benchmark and the modified version, the fundamental performance was evaluated and analyzed. The MPI version of the Himeno benchmark was ported to the coarray version and modified for fully using the non-blocking PUT. As a result of the evaluation, the non-blocking coarray version clearly outperformed the original and non-blocking MPI versions.

Download Full-text

XcalableMP Programming Model and Language

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_1 ◽

2020 ◽

pp. 1-71

Author(s):

Hitoshi Murai ◽

Masahiro Nakao ◽

Mitsuhisa Sato

Keyword(s):

Parallel Programming ◽

Distributed Memory ◽

Programming Model ◽

Parallel Computers ◽

Address Space ◽

Partitioned Global Address Space ◽

Language Extension ◽

Global View ◽

Local View ◽

Language Specification

AbstractXcalableMP (XMP) is a directive-based language extension of Fortran and C for distributed-memory parallel computers, and can be classified as a partitioned global address space (PGAS) language. One of the remarkable characteristics of XMP is that it supports both global-view and local-view parallel programming. This chapter describes the programming model and language specification of XMP.

Download Full-text

XcalableMP 2.0 and Future Directions

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_10 ◽

2020 ◽

pp. 245-262

Author(s):

Mitsuhisa Sato ◽

Hitoshi Murai ◽

Masahiro Nakao ◽

Keisuke Tsugane ◽

Tesuya Odajima ◽

...

Keyword(s):

Performance Evaluation ◽

High Performance ◽

Cutting Edge ◽

Future Directions ◽

Manycore Processors ◽

High Performance Systems ◽

And Performance

AbstractThis chapter presents the XcalableMP on the Fugaku supercomputer, the Japanese flagship supercomputer developed by FLAGSHIP2020 project in RIKEN R-CCS. The porting and the performance evaluation were done as a part of this project, and the XcalableMP is available for the Fugaku users for improving the productivity and performance of parallel programing. The performance of XcalableMP on the Fugaku is enhanced by the manycore processor and a new Tofu-D interconnect. We are now working on the next version, XcalableMP 2.0, for cutting-edge high-performance systems with manycore processors by multithreading and multi-tasking with integrations of PGAS model and synchronization models. We conclude this book with retrospectives and challenges for future PGAS models.

Download Full-text

Implementation and Performance Evaluation of Omni Compiler

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_2 ◽

2020 ◽

pp. 73-96

Author(s):

Masahiro Nakao ◽

Hitoshi Murai

Keyword(s):

Performance Evaluation ◽

Parallel Language ◽

Reference Implementation ◽

Benchmark Suite ◽

And Performance

AbstractThis chapter describes the implementation and performance evaluation of Omni compiler, which is a reference implementation of the compiler for XcalableMP. For performance evaluation, this chapter also presents how to implement the HPC Challenge benchmarks, which is a benchmark suite for an HPC parallel language. The results show that the performance of XMP is comparable to that of MPI in many cases.

Download Full-text

Mixed-Language Programming with XcalableMP

XcalableMP PGAS Programming Language ◽

10.1007/978-981-15-7683-6_5 ◽

2020 ◽

pp. 147-163

Author(s):

Masahiro Nakao

Keyword(s):

Programming Languages ◽

Cluster System ◽

High Productivity ◽

Mixed Language

AbstractThis chapter presents the mixed-language programming with XcalableMP and other programming languages. It is supported by the linkage functions between XcalableMP and MPI library. We also demonstrate how to call XcalableMP program from Python program (M. Nakao et al., Linkage of XcalableMP and Python languages for high productivity on HPC cluster system, Proceedings of Workshops of HPC Asia, No .9, pp.39–47, 2018).

Download Full-text

XcalableMP PGAS Programming Language
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer Singapore

Multi-SPMD Programming Model with YML and XcalableMP

XcalableACC: An Integration of XcalableMP and OpenACC

Hybrid-View Programming of Nuclear Fusion Simulation Code in XcalableMP

Parallelization of Atomic Image Reconstruction from X-ray Fluorescence Holograms with XcalableMP

Three-Dimensional Fluid Code with XcalableMP

Coarrays in the Context of XcalableMP

XcalableMP Programming Model and Language

XcalableMP 2.0 and Future Directions

Implementation and Performance Evaluation of Omni Compiler

Mixed-Language Programming with XcalableMP

Export Citation Format

XcalableMP PGAS Programming LanguageLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer Singapore

Multi-SPMD Programming Model with YML and XcalableMP

XcalableACC: An Integration of XcalableMP and OpenACC

Hybrid-View Programming of Nuclear Fusion Simulation Code in XcalableMP

Parallelization of Atomic Image Reconstruction from X-ray Fluorescence Holograms with XcalableMP

Three-Dimensional Fluid Code with XcalableMP

Coarrays in the Context of XcalableMP

XcalableMP Programming Model and Language

XcalableMP 2.0 and Future Directions

Implementation and Performance Evaluation of Omni Compiler

Mixed-Language Programming with XcalableMP

XcalableMP PGAS Programming Language
Latest Publications