AtTune: A Heuristic based Framework for Parallel Applications Autotuning

Mapping Intimacies ◽

10.5753/sbesc_estendido.2020.13105 ◽

2020 ◽

Author(s):

Hiago Rocha ◽

Janaina Schwarzrock ◽

Monica Pereira ◽

Lucas Schnorr ◽

Philippe Navaux ◽

...

Keyword(s):

Energy Efficiency ◽

Parallel Programming ◽

Experimental Results ◽

Parallel Applications ◽

Programming Models ◽

Data Synchronization ◽

Parallel Programming Models ◽

Efficiency Performance ◽

Several aspects limit the scalability of parallel applications, e.g., off-chip bus saturation and data synchronization. Moreover, the high cost of cooling HPC systems, which can outweigh the cost of developing the system itself, has pushed the parallel application’s execution to another level of requirements, in terms of performance and energy. In this work, we propose AtTune: a heuristic-based framework for tuning the number of processes/threads and CPU frequency to optimize the parallel applications’ execution. AtTune is transparent for the user, independent of the input size, and it optimizes for different parallel programming models. We evaluated our proposed solution considering five well-known kernels implemented in MPI and OpenMP. Experimental results on two real multi-core systems showed that AtTune improves up to 36%, 11%, and 32% the energy efficiency, performance, and Energy-Delay Product, respectively.

Download Full-text

Study of parallel programming models on computer clusters with Intel MIC coprocessors

The International Journal of High Performance Computing Applications ◽

10.1177/1094342015580864 ◽

2015 ◽

Vol 31 (4) ◽

pp. 303-315 ◽

Author(s):

Miaoqing Huang ◽

Chenggang Lai ◽

Xuan Shi ◽

Zhijun Hao ◽

Haihang You

Keyword(s):

Parallel Programming ◽

High Performance ◽

Programming Model ◽

Fixed Number ◽

Parallel Applications ◽

Programming Models ◽

Communication Overhead ◽

Computer Clusters ◽

Parallel Programming Models ◽

Coprocessors based on the Intel Many Integrated Core (MIC) Architecture have been adopted in many high-performance computer clusters. Typical parallel programming models, such as MPI and OpenMP, are supported on MIC processors to achieve the parallelism. In this work, we conduct a detailed study on the performance and scalability of the MIC processors under different programming models using the Beacon computer cluster. Our findings are as follows. (1) The native MPI programming model on the MIC processors is typically better than the offload programming model, which offloads the workload to MIC cores using OpenMP. (2) On top of the native MPI programming model, multithreading inside each MPI process can further improve the performance for parallel applications on computer clusters with MIC coprocessors. (3) Given a fixed number of MPI processes, it is a good strategy to schedule these MPI processes to as few MIC processors as possible to reduce the cross-processor communication overhead. (4) The hybrid MPI programming model, in which data processing is distributed to both MIC cores and CPU cores, can outperform the native MPI programming model.

Download Full-text

APPROACHING DEVELOPMENTS ON PARALLEL PROGRAMMING MODELS THROUGH JAVA

i-manager’s Journal on Software Engineering ◽

10.26634/jse.10.3.4900 ◽

2016 ◽

Vol 10 (3) ◽

pp. 14

Author(s):

VEERASAMY BALA DHANDAYUTHAPANI ◽

NASIRA G.M ◽

◽

Keyword(s):

Parallel Programming ◽

Programming Models ◽

Parallel Programming Models

Download Full-text

Linear programming models for measuring economy-wide energy efficiency performance

Energy Policy ◽

10.1016/j.enpol.2008.03.041 ◽

2008 ◽

Vol 36 (8) ◽

pp. 2911-2916 ◽

Author(s):

P. Zhou ◽

B.W. Ang

Keyword(s):

Energy Efficiency ◽

Linear Programming ◽

Programming Models ◽

Efficiency Performance ◽

Download Full-text

Dynamic clustering for distinct parallel programming models on NoC-based MPSoCs

Proceedings of the 4th International Workshop on Network on Chip Architectures - NoCArc '11 ◽

10.1145/2076501.2076514 ◽

2011 ◽

Author(s):

Gustavo Girão ◽

Thiago Santini ◽

Flávio R. Wagner

Keyword(s):

Parallel Programming ◽

Programming Models ◽

Dynamic Clustering ◽

Parallel Programming Models

Download Full-text

Evaluating attainable memory bandwidth of parallel programming models via BabelStream

International Journal of Computational Science and Engineering ◽

10.1504/ijcse.2017.10011352 ◽

2017 ◽

Vol 1 (1) ◽

pp. 1

Author(s):

Matt Martineau ◽

Simon McIntosh Smith ◽

James Price ◽

Tom Deakin

Keyword(s):

Parallel Programming ◽

Programming Models ◽

Memory Bandwidth ◽

Parallel Programming Models

Download Full-text

Tying Memory Management to Parallel Programming Models

Euro-Par 2006 Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/11823285_69 ◽

2006 ◽

pp. 666-675

Author(s):

Ioannis E. Venetis ◽

Theodore S. Papatheodorou

Keyword(s):

Parallel Programming ◽

Memory Management ◽

Programming Models ◽

Parallel Programming Models

Download Full-text

Fifth International Workshop on High-level Parallel Programming Models and Supportive Environments HIPS 2000

Lecture Notes in Computer Science - Parallel and Distributed Processing ◽

10.1007/3-540-45591-4_34 ◽

2000 ◽

pp. 257-260

Author(s):

Martin Schulz

Keyword(s):

Parallel Programming ◽

International Workshop ◽

Programming Models ◽

Parallel Programming Models ◽

Supportive Environments ◽

Download Full-text

On the adequacy of lightweight thread approaches for high-level parallel programming models

Future Generation Computer Systems ◽

10.1016/j.future.2018.02.016 ◽

2018 ◽

Vol 84 ◽

pp. 22-31 ◽

Author(s):

Adrián Castelló ◽

Rafael Mayo ◽

Kevin Sala ◽

Vicenç Beltran ◽

Pavan Balaji ◽

...

Keyword(s):

Parallel Programming ◽

Programming Models ◽

Parallel Programming Models ◽

Download Full-text

4. Parallel Programming Models

Parallel MATLAB for Multicore and Multinode Computers ◽

10.1137/1.9780898718126.ch4 ◽

2009 ◽

pp. 55-76

Keyword(s):

Parallel Programming ◽

Programming Models ◽

Parallel Programming Models

Download Full-text

A comparison of the shared-memory parallel programming models OpenMP, OpenACC and Kokkos in the context of implicit solvers for high-order FEM

Computer Physics Communications ◽

10.1016/j.cpc.2020.107245 ◽

2020 ◽

Vol 255 ◽

pp. 107245 ◽

Author(s):

Jan Eichstädt ◽

Martin Vymazal ◽

David Moxey ◽

Joaquim Peiró

Keyword(s):

Parallel Programming ◽

Shared Memory ◽

Programming Models ◽

Implicit Solvers ◽

Parallel Programming Models

Download Full-text