Performance comparison of software and FPGA implementation of computationally intensive algorithms

Author(s):  
Laxmikant Bordekar ◽  
Gajanan S. Gawde
2015 ◽  
Vol 87 ◽  
pp. 47-60 ◽  
Author(s):  
Alexandru-Ciprian Zăvoianu ◽  
Edwin Lughofer ◽  
Werner Koppelstätter ◽  
Günther Weidenholzer ◽  
Wolfgang Amrhein ◽  
...  

2019 ◽  
Vol 9 (10) ◽  
pp. 1991 ◽  
Author(s):  
Bo Peng ◽  
Shasha Luo ◽  
Zhengqiu Xu ◽  
Jingfeng Jiang

Now, with the availability of 3-D ultrasound data, a lot of research efforts are being devoted to developing 3-D ultrasound strain elastography (USE) systems. Because 3-D motion tracking, a core component in any 3-D USE system, is computationally intensive, a lot of efforts are under way to accelerate 3-D motion tracking. In the literature, the concept of Sum-Table has been used in a serial computing environment to reduce the burden of computing signal correlation, which is the single most computationally intensive component in 3-D motion tracking. In this study, parallel programming using graphics processing units (GPU) is used in conjunction with the concept of Sum-Table to improve the computational efficiency of 3-D motion tracking. To our knowledge, sum-tables have not been used in a GPU environment for 3-D motion tracking. Our main objective here is to investigate the feasibility of using sum-table-based normalized correlation coefficient (ST-NCC) method for the above-mentioned GPU-accelerated 3-D USE. More specifically, two different implementations of ST-NCC methods proposed by Lewis et al. and Luo-Konofagou are compared against each other. During the performance comparison, the conventional method for calculating the normalized correlation coefficient (NCC) was used as the baseline. All three methods were implemented using compute unified device architecture (CUDA; Version 9.0, Nvidia Inc., CA, USA) and tested on a professional GeForce GTX TITAN X card (Nvidia Inc., CA, USA). Using 3-D ultrasound data acquired during a tissue-mimicking phantom experiment, both displacement tracking accuracy and computational efficiency were evaluated for the above-mentioned three different methods. Based on data investigated, we found that under the GPU platform, Lou-Konofaguo method can still improve the computational efficiency (17–46%), as compared to the classic NCC method implemented into the same GPU platform. However, the Lewis method does not improve the computational efficiency in some configuration or improves the computational efficiency at a lower rate (7–23%) under the GPU parallel computing environment. Comparable displacement tracking accuracy was obtained by both methods.


2014 ◽  
Vol E97.C (7) ◽  
pp. 697-706 ◽  
Author(s):  
Ryota TAKASU ◽  
Yoichi TOMIOKA ◽  
Yutaro ISHIGAKI ◽  
Ning LI ◽  
Tsugimichi SHIBATA ◽  
...  

2021 ◽  
Vol 35 (5) ◽  
pp. 431-435
Author(s):  
Vijayakumar Ponnusamy ◽  
Diwakar R. Marur ◽  
Deepa Dhanaskodi ◽  
Thangavel Palaniappan

This work proposes deep learning neural network-based X-ray image classification. The X-ray baggage scanning machinery plays an essential role in the safeguard of customs, airports, and other systematically very important landmarks and infrastructures. The technology at present of baggage scanning machines is designed on X-ray attenuation. The detection of threatful objects is built on how different objects attenuate the X-ray beams going through them. In this paper, the deep convolutional neural network of YOLO is utilized in classifying baggage images. Real-time performance of the baggage image classification is an essential one for security scanning. There are many computationally intensive operations in the You Only Look Once (YOLO) architecture. The computational intensive operations are implemented in the Field Programmable Gate Array (FPGA) platform to optimize process delays. The critical issues involved in those implementations include data representation, inner products computation and implementation of activation function and resolving these issues will also be a significant task. The FPGA implementation results show that with less resource occupancy, the YOLO implementation provides maximum accuracy of 98.9% in classifying X-ray baggage images and identifying hazardous materials. This result proves that the proposed implementation is best suited for practical system deployments for real-time Baggage scanning.


Sign in / Sign up

Export Citation Format

Share Document