parallel programming model Latest Research Papers

Many applications in scientific computing process very large sparse matrices on parallel architectures. The presented work in this paper is a part of a project where our general aim is to develop an auto-tuner system for the selection of the best matrix compression format in the context of high-performance computing. The target smart system can automatically select the best compression format for a given sparse matrix, a numerical method processing this matrix, a parallel programming model and a target architecture. Hence, this paper describes the design and implementation of the proposed concept. We consider a case study consisting of a numerical method reduced to the sparse matrix vector product (SpMV), some compression formats, the data parallel as a programming model and, a distributed multi-core platform as a target architecture. This study allows extracting a set of important novel metrics and parameters which are relative to the considered programming model. Our metrics are used as input to a machine-learning algorithm to predict the best matrix compression format. An experimental study targeting a distributed multi-core platform and processing random and real-world matrices shows that our system can improve in average up to 7% the accuracy of the machine learning.

Download Full-text

Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors

Electronics ◽

10.3390/electronics10212681 ◽

2021 ◽

Vol 10 (21) ◽

pp. 2681

Author(s):

Joonmoo Huh ◽

Deokwoo Lee

Keyword(s):

Parallel Programming ◽

Shared Memory ◽

Message Passing ◽

Programming Model ◽

Multicore Architectures ◽

Worst Case ◽

High Performing ◽

Parallel Programming Model ◽

On Chip ◽

Sharing Patterns

Shared memory is the most popular parallel programming model for multi-core processors, while message passing is generally used for large distributed machines. However, as the number of cores on a chip increases, the relative merits of shared memory versus message passing change, and we argue that message passing becomes a viable, high performing, and parallel programming model. To demonstrate this hypothesis, we compare a shared memory architecture with a new message passing architecture on a suite of applications tuned for each system independently. Perhaps surprisingly, the fundamental behaviors of the applications studied in this work, when optimized for both models, are very similar to each other, and both could execute efficiently on multicore architectures despite many implementations being different from each other. Furthermore, if hardware is tuned to support message passing by supporting bulk message transfer and the elimination of unnecessary coherence overheads, and if effective support is available for global operations, then some applications would perform much better on a message passing architecture. Leveraging our insights, we design a message passing architecture that supports both memory-to-memory and cache-to-cache messaging in hardware. With the new architecture, message passing is able to outperform its shared memory counterparts on many of the applications due to the unique advantages of the message passing hardware as compared to cache coherence. In the best case, message passing achieves up to a 34% increase in speed over its shared memory counterpart, and it achieves an average 10% increase in speed. In the worst case, message passing is slowed down in two applications—CG (conjugate gradient) and FT (Fourier transform)—because it could not perform well on the unique data sharing patterns as its counterpart of shared memory. Overall, our analysis demonstrates the importance of considering message passing as a high performing and hardware-supported programming model on future multicore architectures.

Download Full-text

Parallelism exploration in sequential algorithms via animation tool

Multiagent and Grid Systems ◽

10.3233/mgs-210347 ◽

2021 ◽

Vol 17 (2) ◽

pp. 145-158

Author(s):

Ahmad Qawasmeh ◽

Salah Taamneh ◽

Ashraf H. Aljammal ◽

Nabhan Hamadneh ◽

Mustafa Banikhalaf ◽

...

Keyword(s):

Parallel Programming ◽

High Performance ◽

Programming Model ◽

Parallel Applications ◽

Sequential Algorithm ◽

Sequential Algorithms ◽

Web Based ◽

Test Study ◽

Performance Techniques ◽

Parallel Programming Model

Different high performance techniques, such as profiling, tracing, and instrumentation, have been used to tune and enhance the performance of parallel applications. However, these techniques do not show how to explore the potential of parallelism in a given application. Animating and visualizing the execution process of a sequential algorithm provide a thorough understanding of its usage and functionality. In this work, an interactive web-based educational animation tool was developed to assist users in analyzing sequential algorithms to detect parallel regions regardless of the used parallel programming model. The tool simplifies algorithms’ learning, and helps students to analyze programs efficiently. Our statistical t-test study on a sample of students showed a significant improvement in their perception of the mechanism and parallelism of applications and an increase in their willingness to learn algorithms and parallel programming.

Download Full-text

A Comparative Survey of Big Data Computing and HPC: From a Parallel Programming Model to a Cluster Architecture

International Journal of Parallel Programming ◽

10.1007/s10766-021-00717-y ◽

2021 ◽

Author(s):

Fei Yin ◽

Feng Shi

Keyword(s):

Big Data ◽

Parallel Programming ◽

Programming Model ◽

Cluster Architecture ◽

Parallel Programming Model ◽

Comparative Survey ◽

Big Data Computing

Download Full-text

Interaction with the User in the SAPFOR System

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2021-24-1-157-183 ◽

2021 ◽

Vol 24 (1) ◽

pp. 157-183

Author(s):

Никита Андреевич Катаев

Keyword(s):

Parallel Programming ◽

Program Transformation ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Parallel Program ◽

Program Parallelization ◽

Parallel Programming Model ◽

The One ◽

High Level

Automation of parallel programming is important at any stage of parallel program development. These stages include profiling of the original program, program transformation, which allows us to achieve higher performance after program parallelization, and, finally, construction and optimization of the parallel program. It is also important to choose a suitable parallel programming model to express parallelism available in a program. On the one hand, the parallel programming model should be capable to map the parallel program to a variety of existing hardware resources. On the other hand, it should simplify the development of the assistant tools and it should allow the user to explore the parallel program the assistant tools generate in a semi-automatic way. The SAPFOR (System FOR Automated Parallelization) system combines various approaches to automation of parallel programming. Moreover, it allows the user to guide the parallelization if necessary. SAPFOR produces parallel programs according to the high-level DVMH parallel programming model which simplify the development of efficient parallel programs for heterogeneous computing clusters. This paper focuses on the approach to semi-automatic parallel programming, which SAPFOR implements. We discuss the architecture of the system and present the interactive subsystem which is useful to guide the SAPFOR through program parallelization. We used the interactive subsystem to parallelize programs from the NAS Parallel Benchmarks in a semi-automatic way. Finally, we compare the performance of manually written parallel programs with programs the SAPFOR system builds.

Download Full-text

Debugging Parallel Programs in DVM-System

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-4-866-886 ◽

2020 ◽

Vol 23 (4) ◽

pp. 866-886

Author(s):

Vladimir Aleksandrovich Bakhtin ◽

Dmitry Aleksandrovich Zakharov ◽

Aleksandr Aleksandrovich Ermichev ◽

Victor Alekseevich Krukov

Keyword(s):

Parallel Programming ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Parallel Programming Model ◽

Intel Xeon

DVM-system is designed for the development of parallel programs of scientific and technical calculations in the C-DVMH and Fortran-DVMH languages. These languages use a single DVMH-model of parallel programming model and are an extension of the standard C and Fortran languages with parallelism specifications in the form of compiler directives. The DVMH model makes it possible to create efficient parallel programs for heterogeneous computing clusters, in the nodes of which accelerators, graphic processors or Intel Xeon Phi coprocessors can be used as computing devices along with universal multi-core processors. The article describes the method of debugging parallel programs in DVM-system, as well as new features of DVM-debugger.

Download Full-text

An Analysis of Haskell Parallel Programming Model in the HaLVM

Journal of Physics Conference Series ◽

10.1088/1742-6596/1566/1/012070 ◽

2020 ◽

Vol 1566 ◽

pp. 012070

Author(s):

Junseok Cheon ◽

Yeoneo Kim ◽

Taekwang Hur ◽

Sugwoo Byun ◽

Gyun Woo

Keyword(s):

Parallel Programming ◽

Programming Model ◽

Parallel Programming Model

Download Full-text

The Using of DVM-System for Developing of a Program for Calculations of the Problem of Radiation Magnetic Gas Dynamics and Research of Plasma Dynamics in the QSPA Channel

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-4-594-614 ◽

2020 ◽

Vol 23 (4) ◽

pp. 594-614

Author(s):

Vladimir Aleksandrovich Bakhtin ◽

Dmitry Aleksandrovich Zakharov ◽

Andrey Nikolaevich Kozlov ◽

Veniamin Sergeevich Konovalov

Keyword(s):

Parallel Programming ◽

Gas Dynamics ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Plasma Dynamics ◽

Parallel Programming Model ◽

Software Code

DVM-system is designed for the development of parallel programs of scientific and technical calculations in the C-DVMH and Fortran-DVMH languages. These languages use a single DVMH-model of parallel programming model and are an extension of the standard C and Fortran languages with parallelism specifications in the form of compiler directives. The DVMH model makes it possible to create efficient parallel programs for heterogeneous computing clusters, in the nodes of which accelerators, graphic processors or Intel Xeon Phi coprocessors can be used as computing devices along with universal multi-core processors. The article describes the experience of the successful using of DVM-system to develop a parallel software code for calculating the problem of radiation magnetic gas dynamics and for research of plasma dynamics in the QSPA channel.

Download Full-text

Progress in Dvm-System

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-3-247-270 ◽

2020 ◽

Vol 23 (3) ◽

pp. 247-270

Author(s):

Valery Fedorovich Aleksahin ◽

Vladimir Aleksandrovich Bakhtin ◽

Olga Fedorovna Zhukova ◽

Dmitry Aleksandrovich Zakharov ◽

Victor Alekseevich Krukov ◽

...

Keyword(s):

Parallel Programming ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Parallel Programming Model ◽

Intel Xeon

DVM-system is designed for the development of parallel programs of scientific and technical calculations in the C-DVMH and Fortran-DVMH languages. These languages use a single DVMH-model of parallel programming model and are an extension of the standard C and Fortran languages with parallelism specifications in the form of compiler directives. The DVMH model makes it possible to create efficient parallel programs for heterogeneous computing clusters, in the nodes of which accelerators, graphic processors or Intel Xeon Phi coprocessors can be used as computing devices along with universal multi-core processors. The article presents new features of DVM-system that have been developed recently.

Download Full-text

Map-Balance-Reduce: An improved parallel programming model for load balancing of MapReduce

Future Generation Computer Systems ◽

10.1016/j.future.2017.03.013 ◽

2020 ◽

Vol 105 ◽

pp. 993-1001 ◽

Cited By ~ 11

Author(s):

Jianjiang Li ◽

Yajun Liu ◽

Jian Pan ◽

Peng Zhang ◽

Wei Chen ◽

...

Keyword(s):

Load Balancing ◽

Parallel Programming ◽

Programming Model ◽

Parallel Programming Model

Download Full-text

parallel programming model
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Machine Learning to Design an Auto-tuning System for the Best Compressed Format Detection for Parallel Sparse Computations

Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors

Parallelism exploration in sequential algorithms via animation tool

A Comparative Survey of Big Data Computing and HPC: From a Parallel Programming Model to a Cluster Architecture

Interaction with the User in the SAPFOR System

Debugging Parallel Programs in DVM-System

An Analysis of Haskell Parallel Programming Model in the HaLVM

The Using of DVM-System for Developing of a Program for Calculations of the Problem of Radiation Magnetic Gas Dynamics and Research of Plasma Dynamics in the QSPA Channel

Progress in Dvm-System

Map-Balance-Reduce: An improved parallel programming model for load balancing of MapReduce

Export Citation Format

parallel programming modelRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Machine Learning to Design an Auto-tuning System for the Best Compressed Format Detection for Parallel Sparse Computations

Effective On-Chip Communication for Message Passing Programs on Multi-Core Processors

Parallelism exploration in sequential algorithms via animation tool

A Comparative Survey of Big Data Computing and HPC: From a Parallel Programming Model to a Cluster Architecture

Interaction with the User in the SAPFOR System

Debugging Parallel Programs in DVM-System

An Analysis of Haskell Parallel Programming Model in the HaLVM

The Using of DVM-System for Developing of a Program for Calculations of the Problem of Radiation Magnetic Gas Dynamics and Research of Plasma Dynamics in the QSPA Channel

Progress in Dvm-System

Map-Balance-Reduce: An improved parallel programming model for load balancing of MapReduce

parallel programming model
Recently Published Documents