CUDA Based Parallel Computation for Gauss Elimination Method

Volume 3: Structures, Safety, and Reliability ◽

10.1115/omae2018-78479 ◽

2018 ◽

Author(s):

Xiao Liu ◽

Lei Xu

Keyword(s):

Numerical Experiments ◽

Processing Unit ◽

Ocean Engineering ◽

Elimination Method ◽

Central Processing ◽

Structural Problems ◽

Device Architecture ◽

Gauss Elimination ◽

Speedup Ratio ◽

Gauss Elimination Method

The Central Processing Unit (CPU) parallel algorithm based on Computing Unified Device Architecture (CUDA) has shown great power of computing speedup ability. What performance will the new technique show in the field of structural computation? We choose the Gauss elimination method as the research object. In this study, the parallel Gauss elimination is realized in CUDA on GPU. Furthermore, we carry out two groups of numerical experiments. The first group investigates the effect of Matrix Bandwidths (MBs) and Node Numbers (NNs) on speedup ratio. The second one compares our method with the commercial software by analyzing two actual structural problems in ocean engineering.

Download Full-text

Supplementary material to "A Gauss Elimination Method for estimating locations of extrema in gridded data: Applications for Potential Field Data"

10.5194/npg-2020-21-supplement ◽

2020 ◽

Author(s):

Dung Nguyen Kim ◽

Dung Tran Tuan

Keyword(s):

Potential Field ◽

Field Data ◽

Gridded Data ◽

Elimination Method ◽

Potential Field Data ◽

Gauss Elimination ◽

Supplementary Material ◽

Gauss Elimination Method

Download Full-text

Modified Gauss Elimination Method

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2017.8250 ◽

2017 ◽

Vol V (VIII) ◽

pp. 1756-1758

Author(s):

J. Abubakkar Siddiq

Keyword(s):

Elimination Method ◽

Gauss Elimination ◽

Gauss Elimination Method

Download Full-text

CPU AND GPU (CUDA) TEMPLATE MATCHING COMPARISON / CPU IR GPU (CUDA) PALYGINIMAS VYKDANT ŠABLONŲ ATITIKTIES ALGORITMĄ

Mokslas - Lietuvos ateitis ◽

10.3846/mla.2014.16 ◽

2014 ◽

Vol 6 (2) ◽

pp. 129-133

Author(s):

Evaldas Borcovas ◽

Gintautas Daunys

Keyword(s):

Template Matching ◽

Gpu Computing ◽

Computing Time ◽

Processing Unit ◽

Compute Unified Device Architecture ◽

Central Processing ◽

Device Architecture ◽

Cuda Technology ◽

Dual Core ◽

Template Size

Image processing, computer vision or other complicated opticalinformation processing algorithms require large resources. It isoften desired to execute algorithms in real time. It is hard tofulfill such requirements with single CPU processor. NVidiaproposed CUDA technology enables programmer to use theGPU resources in the computer. Current research was madewith Intel Pentium Dual-Core T4500 2.3 GHz processor with4 GB RAM DDR3 (CPU I), NVidia GeForce GT320M CUDAcompliable graphics card (GPU I) and Intel Core I5-2500K3.3 GHz processor with 4 GB RAM DDR3 (CPU II), NVidiaGeForce GTX 560 CUDA compatible graphic card (GPU II).Additional libraries as OpenCV 2.1 and OpenCV 2.4.0 CUDAcompliable were used for the testing. Main test were made withstandard function MatchTemplate from the OpenCV libraries.The algorithm uses a main image and a template. An influenceof these factors was tested. Main image and template have beenresized and the algorithm computing time and performancein Gtpix/s have been measured. According to the informationobtained from the research GPU computing using the hardwarementioned earlier is till 24 times faster when it is processing abig amount of information. When the images are small the performanceof CPU and GPU are not significantly different. Thechoice of the template size makes influence on calculating withCPU. Difference in the computing time between the GPUs canbe explained by the number of cores which they have. Vaizdų apdorojimas, kompiuterinė rega ir kiti sudėtingi algoritmai, apdorojantys optinę informaciją, naudoja dideliusskaičiavimo išteklius. Dažnai šiuos algoritmus reikia realizuoti realiuoju laiku. Šį uždavinį išspręsti naudojant tik vienoCPU (angl. Central processing unit) pajėgumus yra sudėtinga. nVidia pasiūlyta CUDA (angl. Compute unified device architecture)technologija leidžia panaudoti GPU (angl. Graphic processing unit) išteklius. Tyrimui atlikti buvo pasirinkti du skirtingiCPU: Intel Pentium Dual-Core T4500 ir Intel Core I5 2500K, bei GPU: nVidia GeForce GT320M ir NVidia GeForce 560.Tyrime buvo panaudotos vaizdų apdorojimo bibliotekos: OpenCV 2.1 ir OpenCV 2.4. Tyrimui buvo pasirinktas šablonų atitiktiesalgoritmas. Algoritmui realizuoti reikalingas analizuojamas vaizdas ir ieškomo objekto vaizdo šablonas. Tyrimo metu buvokeičiamas vaizdo ir šablono dydis bei stebima, kaip tai veikia algoritmo vykdymo trukmę ir vykdomų operacijų skaičių persekundę. Iš gautų rezultatų galima teigti, kad apdorojant didelį duomenų kiekį GPU realizuoja algoritmą iki 24 kartų greičiaunei tik CPU. Dirbant su nedideliu duomenų kiekiu, skirtumas tarp CPU ir GPU yra minimalus. Lyginant skaičiavimus dviejuoseGPU, pastebėta, kad skaičiavimų sparta yra tiesiogiai proporcinga GPU turimų branduolių kiekiui. Mūsų tyrimo atvejuspartesniame GPU jų buvo 16 kartų daugiau, tad ir skaičiavimai vyko 16 kartų sparčiau.

Download Full-text

Taguchi and Gauss elimination method: A dual response approach for parametric optimization of CNC wire cut EDM of PRAlSiCMMC

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-004-2331-0 ◽

2005 ◽

Vol 28 (1-2) ◽

pp. 67-75 ◽

Cited By ~ 92

Author(s):

A. Manna ◽

B. Bhattacharyya

Keyword(s):

Parametric Optimization ◽

Elimination Method ◽

Dual Response ◽

Gauss Elimination ◽

Gauss Elimination Method

Download Full-text

A Comparison of Gauss Elimination Method for Dense Linear Systems on Hypercube and Mesh Parallel Architectures

SSRN Electronic Journal ◽

10.2139/ssrn.3170183 ◽

2018 ◽

Author(s):

Anvesha Katti

Keyword(s):

Linear Systems ◽

Parallel Architectures ◽

Elimination Method ◽

Gauss Elimination ◽

Gauss Elimination Method

Download Full-text

GPU-Accelerated Parallel FDTD on Distributed Heterogeneous Platform

International Journal of Antennas and Propagation ◽

10.1155/2014/321081 ◽

2014 ◽

Vol 2014 ◽

pp. 1-8 ◽

Cited By ~ 2

Author(s):

Ronglin Jiang ◽

Shugang Jiang ◽

Yu Zhang ◽

Ying Xu ◽

Lei Xu ◽

...

Keyword(s):

Message Passing ◽

Message Passing Interface ◽

Graphics Processing Unit ◽

Processing Unit ◽

Problem Size ◽

Central Processing ◽

Execution Speed ◽

Speedup Ratio ◽

Electromagnetic Calculations ◽

Graphics Processing

This paper introduces a (finite difference time domain) FDTD code written in Fortran and CUDA for realistic electromagnetic calculations with parallelization methods of Message Passing Interface (MPI) and Open Multiprocessing (OpenMP). Since both Central Processing Unit (CPU) and Graphics Processing Unit (GPU) resources are utilized, a faster execution speed can be reached compared to a traditional pure GPU code. In our experiments, 64 NVIDIA TESLA K20m GPUs and 64 INTEL XEON E5-2670 CPUs are used to carry out the pure CPU, pure GPU, and CPU + GPU tests. Relative to the pure CPU calculations for the same problems, the speedup ratio achieved by CPU + GPU calculations is around 14. Compared to the pure GPU calculations for the same problems, the CPU + GPU calculations have 7.6%–13.2% performance improvement. Because of the small memory size of GPUs, the FDTD problem size is usually very small. However, this code can enlarge the maximum problem size by 25% without reducing the performance of traditional pure GPU code. Finally, using this code, a microstrip antenna array with16×18elements is calculated and the radiation patterns are compared with the ones of MoM. Results show that there is a well agreement between them.

Download Full-text

ANALISIS KONSENTRASI CAMPURAN SENYAWA MENGGUNAKAN VB 2008

JURNAL ILMIAH SAINS ◽

10.35799/jis.13.1.2013.1860 ◽

2013 ◽

Vol 13 (1) ◽

pp. 15

Author(s):

Harry S.J Koleangan

Keyword(s):

Computer Program ◽

Programming Language ◽

Secondary Data ◽

Visual Basic ◽

Ethyl Benzene ◽

Elimination Method ◽

Application Program ◽

Gauss Method ◽

Gauss Elimination ◽

Gauss Elimination Method

ANALISIS KONSENTRASI CAMPURAN SENYAWA MENGGUNAKAN VB 2008 ABSTRAK Telah dibuat sebuah program aplikasi menggunakan VB 2008 yang ditujukan untuk menganalisis suatu larutan yang berisi campuran senyawa etil-benzena, o-silena, m-silena, dan p-silena. Konsentrasi dari masing-masing senyawa ini ditentukan menggunakan metode eliminasi Gauss dalam bentuk program komputer yang ditulis menggunakan bahasa pemrograman Visual Basic 2008. Penggunaan program ini terhadap suatu data sekunder, memberikan hasil konsentrasi (dalam satuan molar) sebagai berikut: etil-benzena = 0,04153, o-silena = 0,04067, m-silena = 0,02772, dan p-silena = 0,02522. Kata kunci: Metode Gauss, VB 2008 ANALYSIS OF MIXED CPMPOUND CONCENTRATION USING VB 2008 ABSTRACT A VB 2008-based application program to analyze a solution containing four different compounds, which are ethyl-benzene, o-xylene, m-xylene, and p-xylene, has been built. Concentration of each compound was then determined by using Gauss elimination method in the form of computer program written in Visual Basic 2008 programming language. Application of the program using the secondary data shows that concentrations (in molar) of each compuound are as follows: ethyl-benzene = 0,04153, o-xylena = 0,04067, m- xylena = 0,02772, and p- xylena = 0,02522. Keywords: Gauss method, VB 2008

Download Full-text

An Alternative technique to Gauss Elimination Method for Determinants: Integers Version

Journal of Advanced Research in Civil Engineering and Architecture ◽

10.33422/jarcea.2019.10.36 ◽

2019 ◽

Author(s):

Najm Obaid Salim Alghazali

Keyword(s):

Alternative Technique ◽

Elimination Method ◽

Gauss Elimination ◽

Gauss Elimination Method

Download Full-text

Transforming a time-domain electromagnetic signal to a frequency-domain electromagnetic response using Gauss elimination method

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/660/1/012059 ◽

2021 ◽

Vol 660 (1) ◽

pp. 012059

Author(s):

Cong Yang ◽

Xinxin Mao

Keyword(s):

Frequency Domain ◽

Time Domain ◽

Electromagnetic Response ◽

Electromagnetic Signal ◽

Elimination Method ◽

Time Domain Electromagnetic ◽

Gauss Elimination ◽

Gauss Elimination Method

Download Full-text

Optimization of K-Means Clustering on Graphics Processing Unit Using Compute Unified Device Architecture

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2017.6274 ◽

2017 ◽

Vol 14 (1) ◽

pp. 789-795

Author(s):

V Saveetha ◽

S Sophia

Keyword(s):

High Performance ◽

Programming Model ◽

Graphics Processing Unit ◽

Direct Access ◽

Communication Overhead ◽

Processing Unit ◽

Compute Unified Device Architecture ◽

Central Processing ◽

Device Architecture ◽

Graphics Processing

Parallel data clustering aims at using algorithms and methods to extract knowledge from fat databases in rational time using high performance architectures. The computational challenge faced by cluster analysis due to increasing capacity of data can be overcome by exploiting the power of these architectures. The recent development in parallel power of Graphics Processing Unit enables low cost high performance solutions for general purpose applications. The Compute Unified Device Architecture programming model provides application programming interface methods to handle data proficiently on Graphics Processing Unit for iterative clustering algorithms like K-Means. The existing Graphics Processing Unit based K-Means algorithms highly focus on improvising the speedup of the algorithms and fall short to handle the high time spent on transfer of data between the Central Processing Unit and Graphics Processing Unit. A competent K-Means algorithm is proposed in this paper to lessen the transfer time by introducing a novel approach to check the convergence of the algorithm and utilize the pinned memory for direct access. This algorithm outperforms the other algorithms by maximizing parallelism and utilizing the memory features. The relative speedups and the validity measure for the proposed algorithm is elevated when compared with K-Means on Graphics Processing Unit and K-Means using Flag on Graphics Processing Unit. Thus the planned approach proves that communication overhead can be reduced in K-Means clustering.

Download Full-text