Accelerating Spike-by-Spike Neural Networks on FPGA with Hybrid Custom Floating-Point and Logarithmic Dot-Product Approximation

2011 ◽

pp. 295-308

Author(s):

Nadia Nedjah ◽

Rodrigo Martins da Silva ◽

Luiza de Macedo Mourelle

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Activation Function ◽

Floating Point ◽

Weighted Sum ◽

Design Environment ◽

Silicon Area ◽

Point Representation ◽

Artificial Neural ◽

Mathematical Computation

Artificial Neural Networks (ANNs) is a well known bio-inspired model that simulates human brain capabilities such as learning and generalization. ANNs consist of a number of interconnected processing units, wherein each unit performs a weighted sum followed by the evaluation of a given activation function. The involved computation has a tremendous impact on the implementation efficiency. Existing hardware implementations of ANNs attempt to speed up the computational process. However, these implementations require a huge silicon area that makes it almost impossible to fit within the resources available on a state-of-the-art FPGAs. In this chapter, a hardware architecture for ANNs that takes advantage of the dedicated adder blocks, commonly called MACs, to compute both the weighted sum and the activation function is devised. The proposed architecture requires a reduced silicon area considering the fact that the MACs come for free as these are FPGA’s built-in cores. Our system uses integer (fixed point) mathematics and operates with fractions to represent real numbers. Hence, floating point representation is not employed and any mathematical computation of the ANN hardware is based on combinational circuitry (performing only sums and multiplications). The hardware is fast because it is massively parallel. Besides, the proposed architecture can adjust itself on-the-fly to the user-defined configuration of the neural network, i.e., the number of layers and neurons per layer of the ANN can be settled with no extra hardware changes. This is a very nice characteristic in robot-like systems considering the possibility of the same hardware may be exploited in different tasks. The hardware also requires another system (a software) that controls the sequence of the hardware computation and provides inputs, weights and biases for the ANN in hardware. Thus, a co-design environment is necessary.

Download Full-text

Reverse-Engineering Deep Neural Networks Using Floating-Point Timing Side-Channels

2020 57th ACM/IEEE Design Automation Conference (DAC) ◽

10.1109/dac18072.2020.9218707 ◽

2020 ◽

Cited By ~ 1

Author(s):

Cheng Gongye ◽

Yunsi Fei ◽

Thomas Wahl

Keyword(s):

Neural Networks ◽

Reverse Engineering ◽

Deep Neural Networks ◽

Floating Point ◽

Side Channels

Download Full-text

A Mathematical Approach Towards Quantization of Floating Point Weights in Low Power Neural Networks

2020 33rd International Conference on VLSI Design and 2020 19th International Conference on Embedded Systems (VLSID) ◽

10.1109/vlsid49098.2020.00048 ◽

2020 ◽

Author(s):

Joydeep Kumar Devnath ◽

Neelam Surana ◽

Joycee Mekie

Keyword(s):

Neural Networks ◽

Low Power ◽

Floating Point ◽

Mathematical Approach

Download Full-text

Optimized Fused Floating-Point Many-Term Dot-Product Hardware for Machine Learning Accelerators

2019 IEEE 26th Symposium on Computer Arithmetic (ARITH) ◽

10.1109/arith.2019.00021 ◽

2019 ◽

Cited By ~ 1

Author(s):

Himanshu Kaul ◽

Mark Anders ◽

Sanu Mathew ◽

Seongjong Kim ◽

Ram Krishnamurthy

Keyword(s):

Machine Learning ◽

Floating Point ◽

Dot Product

Download Full-text

An FPGA-based floating-point processor array supporting a high-precision dot product

2006 IEEE International Conference on Field Programmable Technology ◽

10.1109/fpt.2006.270337 ◽

2006 ◽

Cited By ~ 3

Author(s):

Fritz Mayer-Lindenberg ◽

Valerij Beller

Keyword(s):

High Precision ◽

Floating Point ◽

Processor Array ◽

Dot Product

Download Full-text

Improved Architectures for a Floating-Point Fused Dot Product Unit

2013 IEEE 21st Symposium on Computer Arithmetic ◽

10.1109/arith.2013.26 ◽

2013 ◽

Cited By ~ 4

Author(s):

Jongwook Sohn ◽

E. E. Swartzlander

Keyword(s):

Floating Point ◽

Dot Product

Download Full-text

A high speed floating point dot product unit

2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT) ◽

10.1109/icicict.2014.6781299 ◽

2014 ◽

Cited By ~ 2

Author(s):

Akash Kumar Gupta ◽

Birendra Biswal

Keyword(s):

High Speed ◽

Floating Point ◽

Dot Product

Download Full-text

Design and Implementation of Perceptron Neuron in Machine Learning for Handwritten Character Recognition

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h6680.0891020 ◽

2020 ◽

Vol 9 (10) ◽

pp. 357-363

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Feature Extraction ◽

Field Programmable Gate Arrays ◽

Fpga Implementation ◽

Floating Point ◽

Neuron Network ◽

Gate Arrays ◽

Field Programmable ◽

Programmable Gate Arrays

Due to the exponential increase of electronic devices that are connected to the Internet, the amount of data that they produce have grown to the same extent. In order to face the processing of these data, the use of some automatic learning algorithms, also known as Machine Learning, has become widespread. The most popular is the one known as neural networks. These algorithms need a great deal of resources to compute all their operations, and because of that, they have been traditionally implemented in application specific integrated circuits. However, recently there have been a boom in implementations in field programmable gate arrays, also known as FPGAs. These allow greater parallelism in the implementation of the algorithms. Field Programmable Gate Arrays (FPGA) implementation based feature extraction method is proposed in this paper. This particular application is handwritten offline digit recognition. The classification depends on simple 2 layer MultiLayer Perceptron (MLP). The particular feature extraction approach is suitable for execution of FPGA because it is utilized with subtraction and addition operations. From Standard database handwritten digit images of normalized 40×40 pixel the features are extracted by the proposed method. It has been discovered by experiential outcomes that 85% accuracy is achieved by proposed system. Overall, as compared to other systems, it is less complex, more accurate and simple. Further this project explains IEE-754 format single precision floating point MAC unit’s FPGA implementation which is utilized for feeding the neurons weighted inputs in artificial neural networks. Data representation range is improved by floating point numbers utilization to a higher number from smaller number that is highly suggested for Artificial Neuron Network. The code is developed in HDL, simulated and synthesis results are extracted using Xilinx synthesis tools .In order to validate its computational accuracy of the FFT, an MATLAB validation script is used to verify the output of HDL with standard reference model.

Download Full-text

Static Block Floating-Point Quantization for Convolutional Neural Networks on FPGA

2019 International Conference on Field-Programmable Technology (ICFPT) ◽

10.1109/icfpt47387.2019.00012 ◽

2019 ◽

Cited By ~ 1

Author(s):

Hongxiang Fan ◽

Gang Wang ◽

Martin Ferianc ◽

Xinyu Niu ◽

Wayne Luk

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Floating Point

Download Full-text

A Fused Floating-Point Four-Term Dot Product Unit

IEEE Transactions on Circuits and Systems I Regular Papers ◽

10.1109/tcsi.2016.2525042 ◽

2016 ◽

Vol 63 (3) ◽

pp. 370-378 ◽

Cited By ~ 6

Author(s):

Jongwook Sohn ◽

Earl E. Swartzlander

Keyword(s):

Floating Point ◽

Dot Product

Download Full-text

Accelerating Spike-by-Spike Neural Networks on FPGA with Hybrid Custom Floating-Point and Logarithmic Dot-Product Approximation

Parallel Hardware for Artificial Neural Networks Using Fixed Floating Point Representation

Reverse-Engineering Deep Neural Networks Using Floating-Point Timing Side-Channels

A Mathematical Approach Towards Quantization of Floating Point Weights in Low Power Neural Networks

Optimized Fused Floating-Point Many-Term Dot-Product Hardware for Machine Learning Accelerators

An FPGA-based floating-point processor array supporting a high-precision dot product

Improved Architectures for a Floating-Point Fused Dot Product Unit

A high speed floating point dot product unit

Design and Implementation of Perceptron Neuron in Machine Learning for Handwritten Character Recognition

Static Block Floating-Point Quantization for Convolutional Neural Networks on FPGA

A Fused Floating-Point Four-Term Dot Product Unit

Export Citation Format