Compute-unified device architecture implementation of a block-matching algorithm for multiple graphical processing unit cards

In this chapter, the aim is to discuss computational aspects of lattice-based cryptographic schemes focused on NTRU in view of the time complexity on a graphical processing unit (GPU). Polynomial multiplication algorithms, having a very important role in lattice-based cryptographic schemes, are implemented on the GPU using the compute unified device architecture (CUDA) platform. They are implemented in both serial and parallel way. Compact and efficient implementation architectures of polynomial multiplication for lattice-based cryptographic schemes are presented for the quotient ring both Zp [x]/(xn-1) and Zp [x]/(xn+1), where p is a prime number. Then, by using these implementations the NTRUEncrypt and signature scheme working over Zp [x]/(xn+1) are implemented on the GPU using CUDA platform. Implementation details are also discussed.

Download Full-text

Introduction to MOLFLOW+: New graphical processing unit-based Monte Carlo code for simulating molecular flows and for calculating angular coefficients in the compute unified device architecture environment

Journal of Vacuum Science & Technology A Vacuum Surfaces and Films ◽

10.1116/1.3153280 ◽

2009 ◽

Vol 27 (4) ◽

pp. 1017-1023 ◽

Cited By ~ 43

Author(s):

R. Kersevan ◽

J.-L. Pons

Keyword(s):

Monte Carlo ◽

Graphical Processing Unit ◽

Processing Unit ◽

Monte Carlo Code ◽

Compute Unified Device Architecture ◽

Device Architecture ◽

Graphical Processing

Download Full-text

Application of lattice Boltzmann methods for the multiphase fluid pipe flow on graphical processing unit

The Journal of Computational Multiphase Flows ◽

10.1177/1757482x17746922 ◽

2017 ◽

Vol 10 (3) ◽

pp. 109-118 ◽

Cited By ~ 1

Author(s):

Pengxin Cheng ◽

Nan Gui ◽

Xingtuan Yang ◽

JiyuanTu ◽

Shengyao Jiang

Keyword(s):

Lattice Boltzmann Method ◽

Pipe Flow ◽

Lattice Boltzmann ◽

Graphical Processing Unit ◽

Processing Unit ◽

Compute Unified Device Architecture ◽

Device Architecture ◽

Graphical Processing ◽

Multiphase Fluid ◽

Boltzmann Method

In this paper, we employ the lattice Boltzmann method implemented on compute unified device architecture-enabled graphical processing unit to investigate the multiphase fluid pipe flow. The basics of lattice Boltzmann method as well as the Shan–Chen multiphase model and the fundamentals of graphical processing unit with compute unified device architecture are thoroughly introduced. The procedure of implementation of lattice Boltzmann method on graphical processing unit and the comparison of the computing performance between graphical processing unit and CPU are presented. It is demonstrated that the graphical processing unit-based lattice Boltzmann method has remarkable advantages over CPU especially with selected appropriate parameters. The results of validation cases agree well with previous numerical results or analytical solutions. The vertical and horizontal multiphase pipe flow are simulated and discussed.

Download Full-text

Adaptive edge-based stereo block matching algorithm for a mobile Graphics Processing Unit

2017 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) ◽

10.23919/spa.2017.8166864 ◽

2017 ◽

Author(s):

Maciej Janeczek ◽

Piotr Skulimowski ◽

Mateusz Owczarek ◽

Pawel Strumillo

Keyword(s):

Graphics Processing Unit ◽

Block Matching ◽

Processing Unit ◽

Matching Algorithm ◽

Block Matching Algorithm ◽

Mobile Graphics ◽

Edge Based ◽

Graphics Processing

Download Full-text

GPU Algorithm for the Scaled Opposite-Spin (SOS) MP2 Energy Evaluation

Journal of the Mexican Chemical Society ◽

10.29356/jmcs.v61i1.129 ◽

2017 ◽

Vol 61 (1) ◽

Cited By ~ 1

Author(s):

Luis Ángel Martínez-Martínez ◽

Carlos Amador-Bedolla

Keyword(s):

Correlation Energy ◽

Energy Calculation ◽

Processing Unit ◽

Compute Unified Device Architecture ◽

Energy Evaluation ◽

Device Architecture ◽

Linear Alkanes ◽

Computationally Intensive ◽

Graphical Processing ◽

And Performance

<p>The most computationally intensive part of the SOS-MP2 algorithm for the calculation of the correlation energy [1], as executed in Q-Chem, is implemented for use in a graphical processing unit (GPU). Our approach adds new routines to the library initially developed by Aspuru-Guzik and co-workers [2], aiming at maximization of bandwidth and performance, by taking advantage of the asynchronous CPU-GPU communication capability of modern GPUs. These changes permit an almost six-fold acceleration in the correlation energy calculation of linear alkanes. This was achieved employing a NVIDIA Tesla K40C (Kepler) GPU and the Compute Unified Device Architecture (CUDA).</p>

Download Full-text

Wutai Mountain Mural Inpainting Based on Improved Block Matching Algorithm

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17102 ◽

2019 ◽

Vol 31 (1) ◽

pp. 118 ◽

Cited By ~ 1

Author(s):

Lijuan Jiao ◽

Wenjian Wang ◽

Bingjing Li ◽

Qingshan Zhao

Keyword(s):

Block Matching ◽

Matching Algorithm ◽

Block Matching Algorithm ◽

Wutai Mountain

Download Full-text

Adaptive multiple-candidate hierarchical search for block matching algorithm

Electronics Letters ◽

10.1049/el:19951116 ◽

1995 ◽

Vol 31 (19) ◽

pp. 1637-1639 ◽

Cited By ~ 16

Author(s):

Y.-L. Chan ◽

W.-C. Siu

Keyword(s):

Block Matching ◽

Matching Algorithm ◽

Block Matching Algorithm ◽

Hierarchical Search

Download Full-text

A new block-matching algorithm based on subspace and partial distance search techniques in the wavelet domain

IEEE Transactions on Consumer Electronics ◽

10.1109/30.681950 ◽

1998 ◽

Vol 44 (2) ◽

pp. 353-359 ◽

Cited By ~ 1

Author(s):

Wen-Jyi Hwang ◽

Chun-Ming Chang ◽

Yi-Chong Zeng

Keyword(s):

Block Matching ◽

Wavelet Domain ◽

Matching Algorithm ◽

Search Techniques ◽

Block Matching Algorithm

Download Full-text

Mixed-mode database miner classifier: Parallel computation of graphical processing unit mining

International Journal of Electrical Engineering Education ◽

10.1177/0020720920988494 ◽

2021 ◽

pp. 002072092098849

Author(s):

Soumya Ranjan Nayak ◽

S Sivakumar ◽

Akash Kumar Bhoi ◽

Gyoo-Soo Chae ◽

Pradeep Kumar Mallick

Keyword(s):

Credit Card ◽

Mixed Mode ◽

Processing Time ◽

Gpu Computing ◽

Graphical Processing Unit ◽

Computational Time ◽

Processing Unit ◽

Large Set ◽

Minimal Processing ◽

Graphical Processing

Graphical processing unit (GPU) has gained more popularity among researchers in the field of decision making and knowledge discovery systems. However, most of the earlier studies have GPU memory utilization, computational time, and accuracy limitations. The main contribution of this paper is to present a novel algorithm called the Mixed Mode Database Miner (MMDBM) classifier by implementing multithreading concepts on a large number of attributes. The proposed method use the quick sort algorithm in GPU parallel computing to overcome the state of the art limitations. This method applies the dynamic rule generation approach for constructing the decision tree based on the predicted rules. Moreover, the implementation results are compared with both SLIQ and MMDBM using Java and GPU with the computed acceleration ratio time using the BP dataset. The primary objective of this work is to improve the performance with less processing time. The results are also analyzed using various threads in GPU mining using eight different datasets of UCI Machine learning repository. The proposed MMDBM algorithm have been validated on these chosen eight different dataset with accuracy of 91.3% in diabetes, 89.1% in breast cancer, 96.6% in iris, 89.9% in labor, 95.4% in vote, 89.5% in credit card, 78.7% in supermarket and 78.7% in BP, and simultaneously, it also takes less computational time for given datasets. The outcome of this work will be beneficial for the research community to develop more effective multi thread based GPU solution in GPU mining to handle large set of data in minimal processing time. Therefore, this can be considered a more reliable and precise method for GPU computing.

Download Full-text