Data Parallel Algorithms

Author(s):  
Dieter W. Heermann ◽  
Anthony N. Burkitt
1994 ◽  
Vol 22 (2) ◽  
pp. 185-201 ◽  
Author(s):  
J. Gabarro ◽  
R. Gavalda

1990 ◽  
Vol 1 (4) ◽  
pp. 486-499 ◽  
Author(s):  
C.-T. King ◽  
W.-H. Chou ◽  
L.M. Ni

Author(s):  
J. Gaber ◽  
G. Goncalves ◽  
T. Hsu ◽  
P. Lecouffe ◽  
B. Toursel

Complexity ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Liangfu Jiang ◽  
Haoran Yuan

In this paper, we analyse and study the interdisciplinary style of stable parallel algorithms for online computer education and real problem scenarios for STEM education. Under the guidance of the STEM concept, the project theme is designed based on the principles of project design, and the interdisciplinary knowledge points related to the theme are determined. Based on the project theme, the teaching design model of the STEM-based robotics project was constructed. The analysis of the results showed that the STEM-based middle-school robotics project not only increased students’ interest in robotics but also promoted their creativity, problem-solving, teamwork, and communication skills, as well as their learning of other subjects. In this paper, an adaptive overflow-aware loss amplification strategy is proposed, which effectively alleviates the training nonconvergence problem caused by gradient overflow in mixed accuracy training. Also, this paper demonstrates that data-parallel training under multiple machines and multiple cards should use local batch normalization, which is particularly effective in accelerating neural networks containing more batch normalization layers.


2007 ◽  
Vol 12 (1) ◽  
pp. 71-79 ◽  
Author(s):  
Alexander Jakušev

Parallel array library ParSol is an easy way to parallelize data parallel algorithms implemented in C/C++. However, in order to use all the features provided by C++ and OOP in real life applications, the efficiency of C++ code that uses ParSol library must be similar to the one of C code. Template metaprogramming is one of the ways to achieve this goal. This paper describes the details of application of this technology to parallel arrays, and presents the efficiency tests.


Sign in / Sign up

Export Citation Format

Share Document