Large-Scale Parallel Method of Moments on CPU/MIC Heterogeneous Clusters

2017 ◽  
Vol 65 (7) ◽  
pp. 3782-3787 ◽  
Author(s):  
Yan Chen ◽  
Sheng Zuo ◽  
Yu Zhang ◽  
Xunwang Zhao ◽  
Huanhuan Zhang
2017 ◽  
Vol 2017 ◽  
pp. 1-12 ◽  
Author(s):  
Shanghong Zhang ◽  
Wenda Li ◽  
Zhu Jing ◽  
Yujun Yi ◽  
Yong Zhao

Three parallel methods (OpenMP, MPI, and OpenACC) are evaluated for the computation of a two-dimensional dam-break model using the explicit finite volume method. A dam-break event in the Pangtoupao flood storage area in China is selected as a case study to demonstrate the key technologies for implementing parallel computation. The subsequent acceleration of the methods is also evaluated. The simulation results show that the OpenMP and MPI parallel methods achieve a speedup factor of 9.8× and 5.1×, respectively, on a 32-core computer, whereas the OpenACC parallel method achieves a speedup factor of 20.7× on NVIDIA Tesla K20c graphics card. The results show that if the memory required by the dam-break simulation does not exceed the memory capacity of a single computer, the OpenMP parallel method is a good choice. Moreover, if GPU acceleration is used, the acceleration of the OpenACC parallel method is the best. Finally, the MPI parallel method is suitable for a model that requires little data exchange and large-scale calculation. This study compares the efficiency and methodology of accelerating algorithms for a dam-break model and can also be used as a reference for selecting the best acceleration method for a similar hydrodynamic model.


2020 ◽  
Vol 228 ◽  
pp. 106170 ◽  
Author(s):  
Fei Yu ◽  
Yan Zeng ◽  
Z.Q. Guan ◽  
S.H. Lo
Keyword(s):  

2014 ◽  
Vol 2014 ◽  
pp. 1-11 ◽  
Author(s):  
Zhongchao Lin ◽  
Yu Zhang ◽  
Shugang Jiang ◽  
Xunwang Zhao ◽  
Jingyan Mo

The parallel higher-order Method of Moments based on message passing interface (MPI) has been successfully used to analyze the changes in radiation patterns of a microstrip patch array antenna mounted on different positions of an airplane. The block-partitioned scheme for the large dense MoM matrix and a block-cyclic matrix distribution scheme are designed to achieve excellent load balance and high parallel efficiency. Numerical results demonstrate that the rigorous parallel Method of Moments can efficiently and accurately solve large complex electromagnetic problems with composite structures.


2010 ◽  
Vol 2010.5 (0) ◽  
pp. _59079-1_-_59079-12_
Author(s):  
Hammad Mazhar ◽  
Dan Negrut ◽  
Arman Pazouki ◽  
Alessandro Tasora

2005 ◽  
Vol 2005.18 (0) ◽  
pp. 159-160
Author(s):  
Tetsuya Sakurai ◽  
Yoshihisa Kodaki ◽  
Hiroto Tadano ◽  
Yuichi Inadomi ◽  
Hiroaki Umeda ◽  
...  

2011 ◽  
Vol 58-60 ◽  
pp. 298-303
Author(s):  
Lei Yu ◽  
Zhi Yong Liu ◽  
Dong Rui Fan ◽  
Yike Ma ◽  
Feng Long Song ◽  
...  

Streaming program is an important application of large-scale parallel processing. The common parallel method for streaming program is not suitable for many-core architecture. For the streaming programs, this paper proposes a novel mapping method based on hardwire architecture. The method uses SPM (Scratch-Pad Memory) to transfer the data. Thus, the number of accessing shared Cache is reduced and the cost of communication is also reduced. The experimental results show the effectiveness of the method.


Sign in / Sign up

Export Citation Format

Share Document