Improving parallel program execution time with message consolidation

Author(s):  
S. Shafer ◽  
K. Ghose
Author(s):  
С.Е. Киреев

На основе разработанного ранее программного комплекса для моделирования многофазных потоков в деформируемой пористой среде реализован новый параллельный программный комплекс, оптимизированный для запуска на кластере с ускорителями Intel Xeon Phi. Рассмотрены различные способы оптимизации, специфичные для данного ускорителя, и их влияние на время работы программы. Выполнено сравнение различных способов использования ускорителей в составе кластера: симметричного режима и режима offload. Получены оценки ускорения и эффективности при использовании различного числа узлов кластера. On the basis of a previously developed program for modeling the multiphase flows in finite-deformed porous media, a new parallel program optimized for clusters with Intel Xeon Phi accelerators is implemented. Several optimization techniques specific for such accelerators are considered and their effect on the program execution time is discussed. A comparison of the symmetric and offload programming models for the accelerators is performed. The parallelization speedup and efficiency are estimatedwhen using various numbers of cluster's nodes.


2003 ◽  
Vol 13 (04) ◽  
pp. 513-524 ◽  
Author(s):  
H. GAUTAMA ◽  
A. J. C. VAN GEMUND

Speculative parallelism refers to searching in parallel for a solution, such as finding a pattern in a data base, where finding the first solution terminates the whole parallel process. Different performance prediction methods are required as compared to traditional parallelism. In this paper we introduce an analytical approach to predict the execution time distribution of data-dependent parallel programs that feature N-ary and binary speculative parallel compositions. The method is based on the use of statistical moments which allows program execution time distribution to be approximated at O(1) solution complexity. Measurement results for synthetic distributions indicate an accuracy that lies in the percent range while for empirical distributions on internet search engines the prediction accuracy is acceptable, provided sufficient workload unimodality.


Sign in / Sign up

Export Citation Format

Share Document