Energy-Efficient Convolutional Neural Networks via Recurrent Data Reuse

AxR-NN: Approximate Computation Reuse for Energy-Efficient Convolutional Neural Networks

Proceedings of the 2020 on Great Lakes Symposium on VLSI ◽

10.1145/3386263.3407595 ◽

2020 ◽

Author(s):

Dongning Ma ◽

Xunzhao Yin ◽

Michael Niemier ◽

X. Sharon Hu ◽

Xun Jiao

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Approximate Computation

COSY: An Energy-Efficient Hardware Architecture for Deep Convolutional Neural Networks Based on Systolic Array

2017 IEEE 23rd International Conference on Parallel and Distributed Systems (ICPADS) ◽

10.1109/icpads.2017.00034 ◽

2017 ◽

Cited By ~ 2

Author(s):

Chen Xin ◽

Qiang Chen ◽

Miren Tian ◽

Mohan Ji ◽

Chenglong Zou ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Systolic Array ◽

Energy Efficient ◽

Hardware Architecture ◽

Deep Convolutional Neural Networks

RiSA: A Reinforced Systolic Array for Depthwise Convolutions and Embedded Tensor Reshaping

ACM Transactions on Embedded Computing Systems ◽

10.1145/3476984 ◽

2021 ◽

Vol 20 (5s) ◽

pp. 1-20

Author(s):

Hyungmin Cho

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Language Processing ◽

Systolic Array ◽

Data Reuse ◽

Systolic Arrays ◽

High Data ◽

Area Efficiency ◽

High Area ◽

Accelerator Design

Depthwise convolutions are widely used in convolutional neural networks (CNNs) targeting mobile and embedded systems. Depthwise convolution layers reduce the computation loads and the number of parameters compared to the conventional convolution layers. Many deep neural network (DNN) accelerators adopt an architecture that exploits the high data-reuse factor of DNN computations, such as a systolic array. However, depthwise convolutions have low data-reuse factor and under-utilize the processing elements (PEs) in systolic arrays. In this paper, we present a DNN accelerator design called RiSA, which provides a novel mechanism that boosts the PE utilization for depthwise convolutions on a systolic array with minimal overheads. In addition, the PEs in systolic arrays can be efficiently used only if the data items ( tensors ) are arranged in the desired layout. Typical DNN accelerators provide various types of PE interconnects or additional modules to flexibly rearrange the data items and manage data movements during DNN computations. RiSA provides a lightweight set of tensor management tasks within the PE array itself that eliminates the need for an additional module for tensor reshaping tasks. Using this embedded tensor reshaping, RiSA supports various DNN models, including convolutional neural networks and natural language processing models while maintaining a high area efficiency. Compared to Eyeriss v2, RiSA improves the area and energy efficiency for MobileNet-V1 inference by 1.91× and 1.31×, respectively.

TFE: Energy-efficient Transferred Filter-based Engine to Compress and Accelerate Convolutional Neural Networks

2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) ◽

10.1109/micro50266.2020.00067 ◽

2020 ◽

Author(s):

Huiyu Mo ◽

Leibo Liu ◽

Wenjing Hu ◽

Wenping Zhu ◽

Qiang Li ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient

Energy-Efficient Convolutional Neural Networks with Deterministic Bit-Stream Processing

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE) ◽

10.23919/date.2019.8714937 ◽

2019 ◽

Cited By ~ 12

Author(s):

S. Rasoul Faraji ◽

M. Hassan Najafi ◽

Bingzhe Li ◽

David J. Lilja ◽

Kia Bazargan

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Stream Processing ◽

Bit Stream

A Heterogeneous and Reconfigurable Embedded Architecture for Energy-Efficient Execution of Convolutional Neural Networks

Architecture of Computing Systems – ARCS 2019 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-18656-2_20 ◽

2019 ◽

pp. 267-280

Author(s):

Konstantin Lübeck ◽

Oliver Bringmann

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Embedded Architecture ◽

Efficient Execution

A Configurable and Versatile Architecture for Low Power, Energy Efficient Hardware Acceleration of Convolutional Neural Networks

2019 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC) ◽

10.1109/norchip.2019.8906950 ◽

2019 ◽

Author(s):

Steinar Thune Christensen ◽

Snorre Aunet ◽

Omer Qadir

Keyword(s):

Neural Networks ◽

Low Power ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Hardware Acceleration

Energy Efficient Convolutional Neural Networks for EEG Artifact Detection

2018 IEEE Biomedical Circuits and Systems Conference (BioCAS) ◽

10.1109/biocas.2018.8584791 ◽

2018 ◽

Cited By ~ 4

Author(s):

Mohit Khatwani ◽

M. Hosseini ◽

H. Paneliya ◽

Tinoosh Mohsenin ◽

W. David Hairston ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Artifact Detection

Gabor filter assisted energy efficient fast learning Convolutional Neural Networks

2017 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) ◽

10.1109/islped.2017.8009202 ◽

2017 ◽

Cited By ~ 24

Author(s):

Syed Shakib Sarwar ◽

Priyadarshini Panda ◽

Kaushik Roy

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Gabor Filter ◽

Fast Learning

An Energy-Efficient Architecture for Binary Weight Convolutional Neural Networks

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2017.2767624 ◽

2018 ◽

Vol 26 (2) ◽

pp. 280-293 ◽

Cited By ~ 24

Author(s):

Yizhi Wang ◽

Jun Lin ◽

Zhongfeng Wang

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Energy Efficient ◽

Energy Efficient Architecture ◽

Binary Weight