Integrated photonic FFT for photonic tensor operations towards efficient and high-speed neural networks

AbstractThe technologically-relevant task of feature extraction from data performed in deep-learning systems is routinely accomplished as repeated fast Fourier transforms (FFT) electronically in prevalent domain-specific architectures such as in graphics processing units (GPU). However, electronics systems are limited with respect to power dissipation and delay, due to wire-charging challenges related to interconnect capacitance. Here we present a silicon photonics-based architecture for convolutional neural networks that harnesses the phase property of light to perform FFTs efficiently by executing the convolution as a multiplication in the Fourier-domain. The algorithmic executing time is determined by the time-of-flight of the signal through this photonic reconfigurable passive FFT ‘filter’ circuit and is on the order of 10’s of picosecond short. A sensitivity analysis shows that this optical processor must be thermally phase stabilized corresponding to a few degrees. Furthermore, we find that for a small sample number, the obtainable number of convolutions per {time, power, and chip area) outperforms GPUs by about two orders of magnitude. Lastly, we show that, conceptually, the optical FFT and convolution-processing performance is indeed directly linked to optoelectronic device-level, and improvements in plasmonics, metamaterials or nanophotonics are fueling next generation densely interconnected intelligent photonic circuits with relevance for edge-computing 5G networks by processing tensor operations optically.

Download Full-text

Configurable Texture Unit for Convolutional Neural Networks on Graphics Processing Units

2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS) ◽

10.1109/aicas.2019.8771629 ◽

2019 ◽

Author(s):

Yi-Hsiang Chen ◽

Shao-Yi Chien

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Graphics Processing Units ◽

Graphics Processing

Download Full-text

End-to-End High Speed Forward Error Correction Using Graphics Processing Units

Lecture Notes in Electrical Engineering - Mobile, Ubiquitous, and Intelligent Computing ◽

10.1007/978-3-642-40675-1_8 ◽

2014 ◽

pp. 47-53

Author(s):

Md Shohidul Islam ◽

Jong-Myon Kim

Keyword(s):

Error Correction ◽

Graphics Processing Units ◽

High Speed ◽

Forward Error Correction ◽

End To End ◽

Forward Error ◽

Graphics Processing

Download Full-text

The VOLNA-OP2 tsunami code (version 1.5)

Geoscientific Model Development ◽

10.5194/gmd-11-4621-2018 ◽

2018 ◽

Vol 11 (11) ◽

pp. 4621-4635 ◽

Cited By ~ 7

Author(s):

Istvan Z. Reguly ◽

Daniel Giles ◽

Devaraj Gopinathan ◽

Laure Quivy ◽

Joakim H. Beck ◽

...

Keyword(s):

Graphics Processing Units ◽

High Performance ◽

Shallow Water Equation ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Central Processing ◽

Domain Specific ◽

Computing Platforms ◽

Graphics Processing ◽

Intel Xeon

Abstract. In this paper, we present the VOLNA-OP2 tsunami model and implementation; a finite-volume non-linear shallow-water equation (NSWE) solver built on the OP2 domain-specific language (DSL) for unstructured mesh computations. VOLNA-OP2 is unique among tsunami solvers in its support for several high-performance computing platforms: central processing units (CPUs), the Intel Xeon Phi, and graphics processing units (GPUs). This is achieved in a way that the scientific code is kept separate from various parallel implementations, enabling easy maintainability. It has already been used in production for several years; here we discuss how it can be integrated into various workflows, such as a statistical emulator. The scalability of the code is demonstrated on three supercomputers, built with classical Xeon CPUs, the Intel Xeon Phi, and NVIDIA P100 GPUs. VOLNA-OP2 shows an ability to deliver productivity as well as performance and portability to its users across a number of platforms.

Download Full-text

Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9054562 ◽

2020 ◽

Author(s):

Guangli Li ◽

Lei Liu ◽

Xueying Wang ◽

Xiu Ma ◽

Xiaobing Feng

Keyword(s):

Neural Networks ◽

Graphics Processing Units ◽

Graphics Processing

Download Full-text

High-Speed Nonlinear Finite Element Analysis for Surgical Simulation Using Graphics Processing Units

IEEE Transactions on Medical Imaging ◽

10.1109/tmi.2007.913112 ◽

2008 ◽

Vol 27 (5) ◽

pp. 650-663 ◽

Cited By ~ 109

Author(s):

Z.A. Taylor ◽

M. Cheng ◽

S. Ourselin

Keyword(s):

Finite Element Analysis ◽

Finite Element ◽

Graphics Processing Units ◽

High Speed ◽

Surgical Simulation ◽

Nonlinear Finite Element Analysis ◽

Nonlinear Finite Element ◽

Element Analysis ◽

Graphics Processing

Download Full-text

Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration

Journal of Biomedical Optics ◽

10.1117/1.3041496 ◽

2008 ◽

Vol 13 (6) ◽

pp. 060504 ◽

Cited By ~ 232

Author(s):

Erik Alerstam ◽

Tomas Svensson ◽

Stefan Andersson-Engels

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Parallel Computing ◽

Graphics Processing Units ◽

High Speed ◽

Photon Migration ◽

Graphics Processing

Download Full-text

Specific Radar Recognition Based on Characteristics of Emitted Radio Waveforms Using Convolutional Neural Networks

Sensors ◽

10.3390/s21248237 ◽

2021 ◽

Vol 21 (24) ◽

pp. 8237

Author(s):

Jan Matuszewski ◽

Dymitr Pietrow

Keyword(s):

Neural Networks ◽

Graphics Processing Units ◽

Signal Generator ◽

Digital Data ◽

Simulation Environment ◽

Noisy Environment ◽

Electronic Warfare ◽

Radar Signals ◽

Graphics Processing ◽

Near Future

With the increasing complexity of the electromagnetic environment and continuous development of radar technology we can expect a large number of modern radars using agile waveforms to appear on the battlefield in the near future. Effectively identifying these radar signals in electronic warfare systems only by relying on traditional recognition models poses a serious challenge. In response to the above problem, this paper proposes a recognition method of emitted radar signals with agile waveforms based on the convolutional neural network (CNN). These signals are measured in the electronic recognition receivers and processed into digital data, after which they undergo recognition. The implementation of this system is presented in a simulation environment with the help of a signal generator that has the ability to make changes in signal signatures earlier recognized and written in the emitter database. This article contains a description of the software’s components, learning subsystem and signal generator. The problem of teaching neural networks with the use of the graphics processing units and the way of choosing the learning coefficients are also outlined. The correctness of the CNN operation was tested using a simulation environment that verified the operation’s effectiveness in a noisy environment and in conditions where many radar signals that interfere with each other are present. The effectiveness results of the applied solutions and the possibilities of developing the method of learning and processing algorithms are presented by means of tables and appropriate figures. The experimental results demonstrate that the proposed method can effectively solve the problem of recognizing raw radar signals with agile time waveforms, and achieve correct probability of recognition at the level of 92–99%.

Download Full-text

IA Algorithm Acceleration Using GPUs

Encyclopedia of Artificial Intelligence ◽

10.4018/978-1-59904-849-9.ch129 ◽

2011 ◽

pp. 873-878 ◽

Cited By ~ 1

Author(s):

Antonio Seoane ◽

Alberto Jaspe

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Artificial Neural Networks ◽

Graphics Processing Units ◽

High Performance ◽

General Purpose ◽

Algorithm Acceleration ◽

Database Operations ◽

Graphics Processing ◽

Programmable Processors

Graphics Processing Units (GPUs) have been evolving very fast, turning into high performance programmable processors. Though GPUs have been designed to compute graphics algorithms, their power and flexibility makes them a very attractive platform for generalpurpose computing. In the last years they have been used to accelerate calculations in physics, computer vision, artificial intelligence, database operations, etc. (Owens, 2007). In this paper an approach to general purpose computing with GPUs is made, followed by a description of artificial intelligence algorithms based on Artificial Neural Networks (ANN) and Evolutionary Computation (EC) accelerated using GPU.

Download Full-text

Deep Learning With PyTorch

Machine Learning and Deep Learning in Real-Time Applications - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-7998-3095-5.ch003 ◽

2020 ◽

pp. 61-95

Author(s):

Anmol Chaudhary ◽

Kuldeep Singh Chouhan ◽

Jyoti Gajrani ◽

Bhavna Sharma

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Machine Translation ◽

Exponential Growth ◽

Graphics Processing Units ◽

New Technologies ◽

The Internet ◽

Computational Power ◽

Practical Applications ◽

Graphics Processing

In the last decade, deep learning has seen exponential growth due to rise in computational power as a result of graphics processing units (GPUs) and a large amount of data due to the democratization of the internet and smartphones. This chapter aims to throw light on both the theoretical aspects of deep learning and its practical aspects using PyTorch. The chapter primarily discusses new technologies using deep learning and PyTorch in detail. The chapter discusses the advantages of using PyTorch compared to other deep learning libraries. The chapter discusses some of the practical applications like image classification and machine translation. The chapter also discusses the various frameworks built with the help of PyTorch. PyTorch consists of various models that increases its flexibility and accessibility to a greater extent. As a result, many frameworks built on top of PyTorch are discussed in this chapter. The authors believe that this chapter will help readers in getting a better understanding of deep learning making neural networks using PyTorch.

Download Full-text

A GPU-accelerated image reduction pipeline

Publications of the Astronomical Society of Japan ◽

10.1093/pasj/psaa091 ◽

2020 ◽

Author(s):

Masafumi Niwano ◽

Katsuhiro L Murata ◽

Ryo Adachi ◽

Sili Wang ◽

Yutaro Tachibana ◽

...

Keyword(s):

Image Processing ◽

Graphics Processing Units ◽

High Speed ◽

Emission Measure ◽

Robotic Telescope ◽

Graphics Processing ◽

High Speed Image Processing ◽

Python Package ◽

Telescope System

Abstract We developed a high-speed image reduction pipeline using Graphics Processing Units (GPUs) as hardware accelerators. Astronomers desire to detect the emission measure counterpart of gravitational-wave sources as soon as possible and to share in the systematic follow-up observation. Therefore, high-speed image processing is important. We developed a new image-reduction pipeline for our robotic telescope system, which uses a GPU via the Python package CuPy for high-speed image processing. As a result, the new pipeline has increased in processing speed by more than 40 times compared with the current one, while maintaining the same functions.

Download Full-text