Performance comparison of software and FPGA implementation of computationally intensive algorithms

Now, with the availability of 3-D ultrasound data, a lot of research efforts are being devoted to developing 3-D ultrasound strain elastography (USE) systems. Because 3-D motion tracking, a core component in any 3-D USE system, is computationally intensive, a lot of efforts are under way to accelerate 3-D motion tracking. In the literature, the concept of Sum-Table has been used in a serial computing environment to reduce the burden of computing signal correlation, which is the single most computationally intensive component in 3-D motion tracking. In this study, parallel programming using graphics processing units (GPU) is used in conjunction with the concept of Sum-Table to improve the computational efficiency of 3-D motion tracking. To our knowledge, sum-tables have not been used in a GPU environment for 3-D motion tracking. Our main objective here is to investigate the feasibility of using sum-table-based normalized correlation coefficient (ST-NCC) method for the above-mentioned GPU-accelerated 3-D USE. More specifically, two different implementations of ST-NCC methods proposed by Lewis et al. and Luo-Konofagou are compared against each other. During the performance comparison, the conventional method for calculating the normalized correlation coefficient (NCC) was used as the baseline. All three methods were implemented using compute unified device architecture (CUDA; Version 9.0, Nvidia Inc., CA, USA) and tested on a professional GeForce GTX TITAN X card (Nvidia Inc., CA, USA). Using 3-D ultrasound data acquired during a tissue-mimicking phantom experiment, both displacement tracking accuracy and computational efficiency were evaluated for the above-mentioned three different methods. Based on data investigated, we found that under the GPU platform, Lou-Konofaguo method can still improve the computational efficiency (17–46%), as compared to the classic NCC method implemented into the same GPU platform. However, the Lewis method does not improve the computational efficiency in some configuration or improves the computational efficiency at a lower rate (7–23%) under the GPU parallel computing environment. Comparable displacement tracking accuracy was obtained by both methods.

Download Full-text

A performance comparison study of programmable platforms: FPAA and FPGA implementation of COOK communication system

2017 European Conference on Circuit Theory and Design (ECCTD) ◽

10.1109/ecctd.2017.8093237 ◽

2017 ◽

Author(s):

Enis Gunay ◽

Kenan Altun

Keyword(s):

Communication System ◽

Performance Comparison ◽

Fpga Implementation ◽

Comparison Study ◽

A Performance

Download Full-text

Performance Comparison of Binary LDPC Decoders and FPGA Implementation of Encoder

Bonfring International Journal of Research in Communication Engineering ◽

10.9756/bijrce.8203 ◽

2016 ◽

Vol 6 (Special Issue) ◽

pp. 65-70

Author(s):

Jayashree C. Nidagundi ◽

Dr. Siddarama R. Patil

Keyword(s):

Performance Comparison ◽

Fpga Implementation

Download Full-text

An FPGA Implementation of the Two-Dimensional FDTD Method and Its Performance Comparison with GPGPU

IEICE Transactions on Electronics ◽

10.1587/transele.e97.c.697 ◽

2014 ◽

Vol E97.C (7) ◽

pp. 697-706 ◽

Cited By ~ 4

Author(s):

Ryota TAKASU ◽

Yoichi TOMIOKA ◽

Yutaro ISHIGAKI ◽

Ning LI ◽

Tsugimichi SHIBATA ◽

...

Keyword(s):

Fdtd Method ◽

Performance Comparison ◽

Fpga Implementation ◽

Two Dimensional

Download Full-text

Performance Comparison of Finite Field Multipliers for SM2 Algorithm based on FPGA Implementation

2020 IEEE 14th International Conference on Anti-counterfeiting, Security, and Identification (ASID) ◽

10.1109/asid50160.2020.9271714 ◽

2020 ◽

Author(s):

Munkhbaatar Chinbat ◽

Liji Wu ◽

Altantsooj Batsukh ◽

Uyangaa Khuchit ◽

Xiangmin Zhang ◽

...

Keyword(s):

Finite Field ◽

Performance Comparison ◽

Fpga Implementation

Download Full-text

Deep Learning-Based X-Ray Baggage Hazardous Object Detection – An FPGA Implementation

Revue d intelligence artificielle ◽

10.18280/ria.350510 ◽

2021 ◽

Vol 35 (5) ◽

pp. 431-435

Author(s):

Vijayakumar Ponnusamy ◽

Diwakar R. Marur ◽

Deepa Dhanaskodi ◽

Thangavel Palaniappan

Keyword(s):

Neural Network ◽

Deep Learning ◽

Image Classification ◽

Real Time ◽

Activation Function ◽

Data Representation ◽

Fpga Implementation ◽

X Ray ◽

Computationally Intensive ◽

Deep Learning Neural Network

This work proposes deep learning neural network-based X-ray image classification. The X-ray baggage scanning machinery plays an essential role in the safeguard of customs, airports, and other systematically very important landmarks and infrastructures. The technology at present of baggage scanning machines is designed on X-ray attenuation. The detection of threatful objects is built on how different objects attenuate the X-ray beams going through them. In this paper, the deep convolutional neural network of YOLO is utilized in classifying baggage images. Real-time performance of the baggage image classification is an essential one for security scanning. There are many computationally intensive operations in the You Only Look Once (YOLO) architecture. The computational intensive operations are implemented in the Field Programmable Gate Array (FPGA) platform to optimize process delays. The critical issues involved in those implementations include data representation, inner products computation and implementation of activation function and resolving these issues will also be a significant task. The FPGA implementation results show that with less resource occupancy, the YOLO implementation provides maximum accuracy of 98.9% in classifying X-ray baggage images and identifying hazardous materials. This result proves that the proposed implementation is best suited for practical system deployments for real-time Baggage scanning.

Download Full-text

FPGA Implementation of Encoder for (15, k) Binary BCH Code Using VHDL and Performance Comparison for Multiple Error Correction Control

2012 International Conference on Communication Systems and Network Technologies ◽

10.1109/csnt.2012.170 ◽

2012 ◽

Cited By ~ 7

Author(s):

Amit Kumar Panda ◽

Shahbaz Sarik ◽

Abhishek Awasthi

Keyword(s):

Error Correction ◽

Performance Comparison ◽

Fpga Implementation ◽

Bch Code ◽

And Performance ◽

Multiple Error

Download Full-text

Performance comparison of conventional and backtracking algorithms in circuit routing

IEE Proceedings G (Electronic Circuits and Systems) ◽

10.1049/ip-g-1.1980.0051 ◽

1980 ◽

Vol 127 (6) ◽

pp. 309

Author(s):

D.J. Kinniment

Keyword(s):

Performance Comparison

Download Full-text

Performance comparison of software and FPGA implementation of computationally intensive algorithms

Performance Comparison of Finite Field Adders for SM2 Algorithm Based on FPGA Implementation

Performance comparison of generational and steady-state asynchronous multi-objective evolutionary algorithms for computationally-intensive problems

Accelerating 3-D GPU-based Motion Tracking for Ultrasound Strain Elastography Using Sum-Tables: Analysis and Initial Results

A performance comparison study of programmable platforms: FPAA and FPGA implementation of COOK communication system

Performance Comparison of Binary LDPC Decoders and FPGA Implementation of Encoder

An FPGA Implementation of the Two-Dimensional FDTD Method and Its Performance Comparison with GPGPU

Performance Comparison of Finite Field Multipliers for SM2 Algorithm based on FPGA Implementation

Deep Learning-Based X-Ray Baggage Hazardous Object Detection – An FPGA Implementation

FPGA Implementation of Encoder for (15, k) Binary BCH Code Using VHDL and Performance Comparison for Multiple Error Correction Control

Performance comparison of conventional and backtracking algorithms in circuit routing

Export Citation Format