Debugging Parallel Programs in DVM-System

Vladimir Aleksandrovich Bakhtin; Dmitry Aleksandrovich Zakharov; Aleksandr Aleksandrovich Ermichev; Victor Alekseevich Krukov

doi:10.26907/1562-5419-2020-23-4-866-886

Debugging Parallel Programs in DVM-System

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-4-866-886 ◽

2020 ◽

Vol 23 (4) ◽

pp. 866-886

Author(s):

Vladimir Aleksandrovich Bakhtin ◽

Dmitry Aleksandrovich Zakharov ◽

Aleksandr Aleksandrovich Ermichev ◽

Victor Alekseevich Krukov

Keyword(s):

Parallel Programming ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Parallel Programming Model ◽

Intel Xeon

DVM-system is designed for the development of parallel programs of scientific and technical calculations in the C-DVMH and Fortran-DVMH languages. These languages use a single DVMH-model of parallel programming model and are an extension of the standard C and Fortran languages with parallelism specifications in the form of compiler directives. The DVMH model makes it possible to create efficient parallel programs for heterogeneous computing clusters, in the nodes of which accelerators, graphic processors or Intel Xeon Phi coprocessors can be used as computing devices along with universal multi-core processors. The article describes the method of debugging parallel programs in DVM-system, as well as new features of DVM-debugger.

Get full-text (via PubEx)

Progress in Dvm-System

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-3-247-270 ◽

2020 ◽

Vol 23 (3) ◽

pp. 247-270

Author(s):

Valery Fedorovich Aleksahin ◽

Vladimir Aleksandrovich Bakhtin ◽

Olga Fedorovna Zhukova ◽

Dmitry Aleksandrovich Zakharov ◽

Victor Alekseevich Krukov ◽

...

Keyword(s):

Parallel Programming ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Parallel Programming Model ◽

Intel Xeon

DVM-system is designed for the development of parallel programs of scientific and technical calculations in the C-DVMH and Fortran-DVMH languages. These languages use a single DVMH-model of parallel programming model and are an extension of the standard C and Fortran languages with parallelism specifications in the form of compiler directives. The DVMH model makes it possible to create efficient parallel programs for heterogeneous computing clusters, in the nodes of which accelerators, graphic processors or Intel Xeon Phi coprocessors can be used as computing devices along with universal multi-core processors. The article presents new features of DVM-system that have been developed recently.

Get full-text (via PubEx)

The Using of DVM-System for Developing of a Program for Calculations of the Problem of Radiation Magnetic Gas Dynamics and Research of Plasma Dynamics in the QSPA Channel

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-4-594-614 ◽

2020 ◽

Vol 23 (4) ◽

pp. 594-614

Author(s):

Vladimir Aleksandrovich Bakhtin ◽

Dmitry Aleksandrovich Zakharov ◽

Andrey Nikolaevich Kozlov ◽

Veniamin Sergeevich Konovalov

Keyword(s):

Parallel Programming ◽

Gas Dynamics ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Plasma Dynamics ◽

Parallel Programming Model ◽

Software Code

DVM-system is designed for the development of parallel programs of scientific and technical calculations in the C-DVMH and Fortran-DVMH languages. These languages use a single DVMH-model of parallel programming model and are an extension of the standard C and Fortran languages with parallelism specifications in the form of compiler directives. The DVMH model makes it possible to create efficient parallel programs for heterogeneous computing clusters, in the nodes of which accelerators, graphic processors or Intel Xeon Phi coprocessors can be used as computing devices along with universal multi-core processors. The article describes the experience of the successful using of DVM-system to develop a parallel software code for calculating the problem of radiation magnetic gas dynamics and for research of plasma dynamics in the QSPA channel.

Get full-text (via PubEx)

Interaction with the User in the SAPFOR System

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2021-24-1-157-183 ◽

2021 ◽

Vol 24 (1) ◽

pp. 157-183

Author(s):

Никита Андреевич Катаев

Keyword(s):

Parallel Programming ◽

Program Transformation ◽

Heterogeneous Computing ◽

Programming Model ◽

Parallel Programs ◽

Parallel Program ◽

Program Parallelization ◽

Parallel Programming Model ◽

The One ◽

High Level

Automation of parallel programming is important at any stage of parallel program development. These stages include profiling of the original program, program transformation, which allows us to achieve higher performance after program parallelization, and, finally, construction and optimization of the parallel program. It is also important to choose a suitable parallel programming model to express parallelism available in a program. On the one hand, the parallel programming model should be capable to map the parallel program to a variety of existing hardware resources. On the other hand, it should simplify the development of the assistant tools and it should allow the user to explore the parallel program the assistant tools generate in a semi-automatic way. The SAPFOR (System FOR Automated Parallelization) system combines various approaches to automation of parallel programming. Moreover, it allows the user to guide the parallelization if necessary. SAPFOR produces parallel programs according to the high-level DVMH parallel programming model which simplify the development of efficient parallel programs for heterogeneous computing clusters. This paper focuses on the approach to semi-automatic parallel programming, which SAPFOR implements. We discuss the architecture of the system and present the interactive subsystem which is useful to guide the SAPFOR through program parallelization. We used the interactive subsystem to parallelize programs from the NAS Parallel Benchmarks in a semi-automatic way. Finally, we compare the performance of manually written parallel programs with programs the SAPFOR system builds.

Get full-text (via PubEx)

EVALUATION OF OPENMP OPTIMIZATION IN HETEROGENEOUS COMPUTING MODE BY CODE OFFLOADING ON INTEL XEON PHI CO-PROCESSOR

International Journal of Advanced Research in Computer Science ◽

10.26483/ijarcs.v9i2.5746 ◽

2018 ◽

Vol 9 (2) ◽

pp. 460-466

Author(s):

Kajal Chauhan ◽

Keyword(s):

Heterogeneous Computing ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Code Offloading ◽

Intel Xeon

Get full-text (via PubEx)

DALIGNER Performance Evaluation on Intel Xeon Phi Architecture

10.29007/j5cs ◽

2019 ◽

Author(s):

Evaldo Costa ◽

Gabriel Silva ◽

Marcello Teixeira

Keyword(s):

Dna Sequence ◽

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Base Pairs ◽

Scalable Architectures ◽

Sequencing Technologies ◽

Dna Sequence Assembly ◽

Long Read ◽

Intel Xeon

In bioinformatics, DNA sequence assembly refers to the reconstruction of an original DNA sequence by the alignment and merging of fragments that can be obtained from several sequencing methods. The main sequencing methods process thousands or even millions of these fragments, which can be short (hundreds of base pairs) or long (thousands of base pairs) read sequences. This is a highly computational task, which usually requires the use of parallel programs and algorithms, so that it can be performed with desirable accuracy and within suitable time limits. In this paper, we evaluate the performance of DALIGNER long read sequences aligner in a system using the Intel Xeon Phi 7210 processor. We are looking for scalable architectures that could provide a higher throughput that can be applied to future sequencing technologies.

Get full-text (via PubEx)

Energy and Power Characterization of Parallel Programs Running on Intel Xeon Phi

2014 43rd International Conference on Parallel Processing Workshops ◽

10.1109/icppw.2014.43 ◽

2014 ◽

Cited By ~ 5

Author(s):

Joal Wood ◽

Ziliang Zong ◽

Qijun Gu ◽

Rong Ge

Keyword(s):

Parallel Programs ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Intel Xeon

Get full-text (via PubEx)

Comparison of Three Popular Parallel Programming Models on the Intel Xeon Phi

Lecture Notes in Computer Science - Euro-Par 2014: Parallel Processing Workshops ◽

10.1007/978-3-319-14313-2_27 ◽

2014 ◽

pp. 314-325 ◽

Cited By ~ 4

Author(s):

Ashkan Tousimojarad ◽

Wim Vanderbauwhede

Keyword(s):

Parallel Programming ◽

Programming Models ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Parallel Programming Models ◽

Intel Xeon

Get full-text (via PubEx)

MILC Code Performance on High End CPU and GPU Supercomputer Clusters

EPJ Web of Conferences ◽

10.1051/epjconf/201817502009 ◽

2018 ◽

Vol 175 ◽

pp. 02009

Author(s):

Carleton DeTar ◽

Steven Gottlieb ◽

Ruizi Li ◽

Doug Toussaint

Keyword(s):

Conjugate Gradient ◽

Memory Hierarchy ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Code Performance ◽

Recent Developments ◽

Knights Landing ◽

Many Core ◽

Intel Xeon

With recent developments in parallel supercomputing architecture, many core, multi-core, and GPU processors are now commonplace, resulting in more levels of parallelism, memory hierarchy, and programming complexity. It has been necessary to adapt the MILC code to these new processors starting with NVIDIA GPUs, and more recently, the Intel Xeon Phi processors. We report on our efforts to port and optimize our code for the Intel Knights Landing architecture. We consider performance of the MILC code with MPI and OpenMP, and optimizations with QOPQDP and QPhiX. For the latter approach, we concentrate on the staggered conjugate gradient and gauge force. We also consider performance on recent NVIDIA GPUs using the QUDA library.

Get full-text (via PubEx)

Sequential Monte Carlo based parameter estimation for structural health monitoring with an Intel Xeon Phi optimized ultrasound kernel

10.1063/1.5099739 ◽

2019 ◽

Author(s):

William C. Schneck ◽

Heather Reed ◽

Elizabeth D. Gregory ◽

Cara A. C. Leckey

Keyword(s):

Monte Carlo ◽

Parameter Estimation ◽

Structural Health Monitoring ◽

Health Monitoring ◽

Sequential Monte Carlo ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Structural Health ◽

Intel Xeon

Get full-text (via PubEx)

MapReduce Parallel Programming Model: A State-of-the-Art Survey

International Journal of Parallel Programming ◽

10.1007/s10766-015-0395-0 ◽

2015 ◽

Vol 44 (4) ◽

pp. 832-866 ◽

Cited By ~ 24

Author(s):

Ren Li ◽

Haibo Hu ◽

Heng Li ◽

Yunsong Wu ◽

Jianxi Yang

Keyword(s):

Parallel Programming ◽

Programming Model ◽

State Of The Art ◽

Parallel Programming Model

Get full-text (via PubEx)