systolic array Latest Research Papers

An energy efficient architecture for TPUs that is based on reduced voltage operation. The errors are captured and corrected by utilizing ABFT and hence aggressive voltage scaling is made possible.

Download Full-text

GPU-Accelerated Timing Simulation of Systolic-Array-Based AI Accelerators

10.1109/ats52891.2021.00034 ◽

2021 ◽

Author(s):

Stefan Holst ◽

Lim Bumun ◽

Xiaoqing Wen

Keyword(s):

Systolic Array ◽

Timing Simulation

Download Full-text

RiSA: A Reinforced Systolic Array for Depthwise Convolutions and Embedded Tensor Reshaping

ACM Transactions on Embedded Computing Systems ◽

10.1145/3476984 ◽

2021 ◽

Vol 20 (5s) ◽

pp. 1-20

Author(s):

Hyungmin Cho

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Language Processing ◽

Systolic Array ◽

Data Reuse ◽

Systolic Arrays ◽

High Data ◽

Area Efficiency ◽

High Area ◽

Accelerator Design

Depthwise convolutions are widely used in convolutional neural networks (CNNs) targeting mobile and embedded systems. Depthwise convolution layers reduce the computation loads and the number of parameters compared to the conventional convolution layers. Many deep neural network (DNN) accelerators adopt an architecture that exploits the high data-reuse factor of DNN computations, such as a systolic array. However, depthwise convolutions have low data-reuse factor and under-utilize the processing elements (PEs) in systolic arrays. In this paper, we present a DNN accelerator design called RiSA, which provides a novel mechanism that boosts the PE utilization for depthwise convolutions on a systolic array with minimal overheads. In addition, the PEs in systolic arrays can be efficiently used only if the data items ( tensors ) are arranged in the desired layout. Typical DNN accelerators provide various types of PE interconnects or additional modules to flexibly rearrange the data items and manage data movements during DNN computations. RiSA provides a lightweight set of tensor management tasks within the PE array itself that eliminates the need for an additional module for tensor reshaping tasks. Using this embedded tensor reshaping, RiSA supports various DNN models, including convolutional neural networks and natural language processing models while maintaining a high area efficiency. Compared to Eyeriss v2, RiSA improves the area and energy efficiency for MobileNet-V1 inference by 1.91× and 1.31×, respectively.

Download Full-text

Scalable Systolic Array Multiplier Optimized by Sparse Matrix

10.1109/asicon52560.2021.9620326 ◽

2021 ◽

Author(s):

RiMing Jia ◽

Tu Xu ◽

YuChun Chang

Keyword(s):

Systolic Array ◽

Sparse Matrix ◽

Array Multiplier

Download Full-text

A Triangular Systolic Array Based Digital Architecture for Computing Eigenvalues of Asymmetric Matrix

10.1109/cnna49188.2021.9610808 ◽

2021 ◽

Author(s):

Elif Ozturk ◽

Ilayda Koseoglu ◽

Mustak E. Yalcin

Keyword(s):

Systolic Array ◽

Digital Architecture

Download Full-text

Leveraging Fine-grained Structured Sparsity for CNN Inference on Systolic Array Architectures

10.1109/fpl53798.2021.00060 ◽

2021 ◽

Author(s):

Linqiao Liu ◽

Stephen Brown

Keyword(s):

Systolic Array ◽

Structured Sparsity ◽

Fine Grained

Download Full-text

Systolic-Array Spiking Neural Accelerators with Dynamic Heterogeneous Voltage Regulation

10.1109/ijcnn52387.2021.9534037 ◽

2021 ◽

Author(s):

Jeong-Jun Lee ◽

Jianhao Chen ◽

Wenrui Zhang ◽

Peng Li

Keyword(s):

Systolic Array ◽

Voltage Regulation

Download Full-text

systolic array
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Dataflow Systolic Array Implementations of Exploring Dual-Triangular Structure in QR Decomposition Using High-Level Synthesis

Dataflow Mirroring: Architectural Support for Highly Efficient Fine-Grained Spatial Multitasking on Systolic-Array NPUs

RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

Algorithm Level Error Detection in Low Voltage Systolic Array

GPU-Accelerated Timing Simulation of Systolic-Array-Based AI Accelerators

RiSA: A Reinforced Systolic Array for Depthwise Convolutions and Embedded Tensor Reshaping

Scalable Systolic Array Multiplier Optimized by Sparse Matrix

A Triangular Systolic Array Based Digital Architecture for Computing Eigenvalues of Asymmetric Matrix

Leveraging Fine-grained Structured Sparsity for CNN Inference on Systolic Array Architectures

Systolic-Array Spiking Neural Accelerators with Dynamic Heterogeneous Voltage Regulation

Export Citation Format

systolic arrayRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Dataflow Systolic Array Implementations of Exploring Dual-Triangular Structure in QR Decomposition Using High-Level Synthesis

Dataflow Mirroring: Architectural Support for Highly Efficient Fine-Grained Spatial Multitasking on Systolic-Array NPUs

RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

Algorithm Level Error Detection in Low Voltage Systolic Array

GPU-Accelerated Timing Simulation of Systolic-Array-Based AI Accelerators

RiSA: A Reinforced Systolic Array for Depthwise Convolutions and Embedded Tensor Reshaping

Scalable Systolic Array Multiplier Optimized by Sparse Matrix

A Triangular Systolic Array Based Digital Architecture for Computing Eigenvalues of Asymmetric Matrix

Leveraging Fine-grained Structured Sparsity for CNN Inference on Systolic Array Architectures

Systolic-Array Spiking Neural Accelerators with Dynamic Heterogeneous Voltage Regulation

systolic array
Recently Published Documents