Tolerating Defects in Low-Power Neural Network Accelerators Via Retraining-Free Weight Approximation

Hardware accelerators are essential to the accommodation of ever-increasing Deep Neural Network (DNN) workloads on the resource-constrained embedded devices. While accelerators facilitate fast and energy-efficient DNN operations, their accuracy is threatened by faults in their on-chip and off-chip memories, where millions of DNN weights are held. The use of emerging Non-Volatile Memories (NVM) further exposes DNN accelerators to a non-negligible rate of permanent defects due to immature fabrication, limited endurance, and aging. To tolerate defects in NVM-based DNN accelerators, previous work either requires extra redundancy in hardware or performs defect-aware retraining, imposing significant overhead. In comparison, this paper proposes a set of algorithms that exploit the flexibility in setting the fault-free bits in weight memory to effectively approximate weight values, so as to mitigate defect-induced accuracy drop. These algorithms can be applied as a one-step solution when loading the weights to embedded devices. They only require trivial hardware support and impose negligible run-time overhead. Experiments on popular DNN models show that the proposed techniques successfully boost inference accuracy even in the face of elevated defect rates in the weight memory.

Download Full-text

DNN-Life: An Energy-Efficient Aging Mitigation Framework for Improving the Lifetime of On-Chip Weight Memories in Deep Neural Network Hardware Architectures

10.23919/date51398.2021.9473943 ◽

2021 ◽

Author(s):

Muhammad Abdullah Hanif ◽

Muhammad Shafique

Keyword(s):

Neural Network ◽

Energy Efficient ◽

Deep Neural Network ◽

Hardware Architectures ◽

Neural Network Hardware ◽

On Chip

Download Full-text

An Overview of Energy-Efficient Hardware Accelerators for On-Device Deep-Neural-Network Training

IEEE Open Journal of Solid-State Circuits ◽

10.1109/ojsscs.2021.3119554 ◽

2021 ◽

Vol 1 ◽

pp. 115-128

Author(s):

Jinsu Lee ◽

Hoi-Jun Yoo

Keyword(s):

Neural Network ◽

Energy Efficient ◽

Deep Neural Network ◽

Hardware Accelerators ◽

Neural Network Training ◽

Network Training

Download Full-text

Processor Pipelining Method for Efficient Deep Neural Network Inference on Embedded Devices

2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC) ◽

10.1109/hipc50609.2020.00022 ◽

2020 ◽

Author(s):

Akshay Parashar ◽

Arun Abraham ◽

Deepak Chaudhary ◽

Vikram Nelvoy Rajendiran

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Network Inference ◽

Embedded Devices

Download Full-text

Energy-Efficient Ultra-Dense Network Using LSTM-based Deep Neural Network

IEEE Transactions on Wireless Communications ◽

10.1109/twc.2021.3061577 ◽

2021 ◽

pp. 1-1

Author(s):

Seungnyun Kim ◽

Junwon Son ◽

Byonghyo Shim

Keyword(s):

Neural Network ◽

Energy Efficient ◽

Deep Neural Network ◽

Dense Network ◽

Ultra Dense Network

Download Full-text

An Energy-Efficient Deep Neural Network Training Processor with Bit-Slice-Level Reconfigurability and Sparsity Exploitation

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) ◽

10.1109/coolchips52128.2021.9410324 ◽

2021 ◽

Author(s):

Donghyeon Han ◽

Dongseok Im ◽

Gwangtae Park ◽

Youngwoo Kim ◽

Seokchan Song ◽

...

Keyword(s):

Neural Network ◽

Energy Efficient ◽

Deep Neural Network ◽

Neural Network Training ◽

Network Training

Download Full-text

Design Tradeoff of Internal Memory Size and Memory Access Energy in Deep Neural Network Hardware Accelerators

2018 IEEE 7th Global Conference on Consumer Electronics (GCCE) ◽

10.1109/gcce.2018.8574742 ◽

2018 ◽

Author(s):

Shen-Fu Hsiao ◽

Pei-Hsuen Wu

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Memory Access ◽

Hardware Accelerators ◽

Internal Memory ◽

Memory Size ◽

Neural Network Hardware

Download Full-text

Surrogate Model based Co-Optimization of Deep Neural Network Hardware Accelerators

10.1109/mwscas47672.2021.9531708 ◽

2021 ◽

Author(s):

Hendrik Wohrle ◽

Mariela De Lucas Alvarez ◽

Fabian Schlenke ◽

Alexander Walsemann ◽

Michael Karagounis ◽

...

Keyword(s):

Neural Network ◽

Surrogate Model ◽

Deep Neural Network ◽

Hardware Accelerators ◽

Model Based ◽

Neural Network Hardware

Download Full-text

Challenges and Opportunities in Near-Threshold DNN Accelerators around Timing Errors

Journal of Low Power Electronics and Applications ◽

10.3390/jlpea10040033 ◽

2020 ◽

Vol 10 (4) ◽

pp. 33

Author(s):

Pramesh Pandey ◽

Noel Daniel Gundi ◽

Prabal Basu ◽

Tahmoures Shabanian ◽

Mitchell Craig Patrick ◽

...

Keyword(s):

Neural Network ◽

Energy Efficient ◽

Deep Neural Network ◽

Ad Hoc ◽

Efficient Design ◽

Utilization Pattern ◽

Timing Errors ◽

Challenges And Opportunities ◽

Design Paradigm ◽

Near Threshold

AI evolution is accelerating and Deep Neural Network (DNN) inference accelerators are at the forefront of ad hoc architectures that are evolving to support the immense throughput required for AI computation. However, much more energy efficient design paradigms are inevitable to realize the complete potential of AI evolution and curtail energy consumption. The Near-Threshold Computing (NTC) design paradigm can serve as the best candidate for providing the required energy efficiency. However, NTC operation is plagued with ample performance and reliability concerns arising from the timing errors. In this paper, we dive deep into DNN architecture to uncover some unique challenges and opportunities for operation in the NTC paradigm. By performing rigorous simulations in TPU systolic array, we reveal the severity of timing errors and its impact on inference accuracy at NTC. We analyze various attributes—such as data–delay relationship, delay disparity within arithmetic units, utilization pattern, hardware homogeneity, workload characteristics—and uncover unique localized and global techniques to deal with the timing errors in NTC.

Download Full-text