Toward Performance Portability of Highly Parametrizable TRSM Algorithm Using SYCL

International Workshop on OpenCL ◽

10.1145/3456669.3456694 ◽

2021 ◽

Author(s):

Thales Sabino ◽

Mehdi Goli

Keyword(s):

Performance Portability

Download Full-text

Toward a Better Performance Portability Metric

2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) ◽

10.1109/pdp52278.2021.00036 ◽

2021 ◽

Author(s):

Ami Marowka

Keyword(s):

Performance Portability

Download Full-text

Performance portability on EARTH: a case study across several parallel architectures

Cluster Computing ◽

10.1007/s10586-007-0011-1 ◽

2007 ◽

Vol 10 (2) ◽

pp. 115-126 ◽

Author(s):

Weirong Zhu ◽

Yanwei Niu ◽

Guang R. Gao

Keyword(s):

Parallel Architectures ◽

Performance Portability

Download Full-text

Performance Portability of Molecular Docking Miniapp On Leadership Computing Platforms

2020 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) ◽

10.1109/p3hpc51967.2020.00009 ◽

2020 ◽

Author(s):

Mathialakan Thavappiragasam ◽

Aaron Scheinberg ◽

Wael Elwasif ◽

Oscar Hernandez ◽

Ada Sedova

Keyword(s):

Molecular Docking ◽

Performance Portability ◽

Computing Platforms

Download Full-text

Performance-Portability Results for the Non-Hydrostatic Atmosphere Dycore of E3SM at Cloud-Resolving Resolutions.

10.1002/essoar.10504848.1 ◽

2020 ◽

Author(s):

Luca Bertagna ◽

Oksana Guba ◽

Mark Taylor ◽

James Foucar ◽

Andrew Bradley ◽

...

Keyword(s):

Performance Portability

Download Full-text

Programmability and performance portability aspects of heterogeneous multi-/manycore systems

2011 Design, Automation & Test in Europe ◽

10.1109/date.2012.6176582 ◽

2012 ◽

Author(s):

C. Kessler ◽

U. Dastgeer ◽

S. Thibault ◽

R. Namyst ◽

A. Richards ◽

...

Keyword(s):

Performance Portability ◽

And Performance

Download Full-text

On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations

2013 International Conference on Parallel and Distributed Systems ◽

10.1109/icpads.2013.23 ◽

2013 ◽

Author(s):

Huayou Su ◽

Nan Wu ◽

Mei Wen ◽

Chunyuan Zhang ◽

Xing Cai

Keyword(s):

Stencil Computations ◽

Performance Portability

Download Full-text

A lightweight approach to performance portability with targetDP

The International Journal of High Performance Computing Applications ◽

10.1177/1094342016682071 ◽

2016 ◽

Vol 32 (2) ◽

pp. 288-301

Author(s):

Alan Gray ◽

Kevin Stratford

Keyword(s):

Particle Physics ◽

Message Passing ◽

Graphics Processing Units ◽

High Performance ◽

Large Scale ◽

Message Passing Interface ◽

Graphics Processing Unit ◽

Processing Unit ◽

Performance Portability ◽

Graphics Processing

Leading high performance computing systems achieve their status through use of highly parallel devices such as NVIDIA graphics processing units or Intel Xeon Phi many-core CPUs. The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer. In this paper we describe targetDP, a lightweight abstraction layer which allows grid-based applications to target data parallel hardware in a platform agnostic manner. We demonstrate the effectiveness of our pragmatic approach by presenting performance results for a complex fluid application (with which the model was co-designed), plus separate lattice quantum chromodynamics particle physics code. For each application, a single source code base is seen to achieve portable performance, as assessed within the context of the Roofline model. TargetDP can be combined with Message Passing Interface (MPI) to allow use on systems containing multiple nodes: we demonstrate this through provision of scaling results on traditional and graphics processing unit-accelerated large scale supercomputers.

Download Full-text

Performance Portability Analysis for Real-Time Simulations of Smoke Propagation Using OpenACC

Lecture Notes in Computer Science - High Performance Computing ◽

10.1007/978-3-319-67630-2_35 ◽

2017 ◽

pp. 477-495 ◽

Author(s):

Anne Küsters ◽

Sandra Wienke ◽

Lukas Arnold

Keyword(s):

Real Time ◽

Performance Portability ◽

Real Time Simulations

Download Full-text

Examining Performance Portability with Kokkos for an Ewald Sum Coulomb Solver

Parallel Processing and Applied Mathematics - Lecture Notes in Computer Science ◽

10.1007/978-3-030-43222-5_4 ◽

2020 ◽

pp. 35-45

Author(s):

Rene Halver ◽

Jan H. Meinke ◽

Godehard Sutmann

Keyword(s):

Performance Portability ◽

Download Full-text

Enhancing Performance Portability of MPI Applications through Annotation-Based Transformations

2013 42nd International Conference on Parallel Processing ◽

10.1109/icpp.2013.77 ◽

2013 ◽

Author(s):

Md. Ziaul Haque ◽

Qing Yi ◽

James Dinan ◽

Pavan Balaji

Keyword(s):

Performance Portability ◽

Mpi Applications

Download Full-text