Selective Information Control and Network Compression in Multi-layered Neural Networks

One of the most popular approaches for neural network compression is sparsification — learning sparse weight matrices. In structured sparsification, weights are set to zero by groups corresponding to structure units, e. g. neurons. We further develop the structured sparsification approach for the gated recurrent neural networks, e. g. Long Short-Term Memory (LSTM). Specifically, in addition to the sparsification of individual weights and neurons, we propose sparsifying the preactivations of gates. This makes some gates constant and simplifies an LSTM structure. We test our approach on the text classification and language modeling tasks. Our method improves the neuron-wise compression of the model in most of the tasks. We also observe that the resulting structure of gate sparsity depends on the task and connect the learned structures to the specifics of the particular tasks.

Download Full-text

An Efficient Specific Emitter Identification Method Based on Complex-Valued Neural Networks and Network Compression

IEEE Journal on Selected Areas in Communications ◽

10.1109/jsac.2021.3087243 ◽

2021 ◽

pp. 1-1

Author(s):

Yu Wang ◽

Guan Gui ◽

Haris Gacanin ◽

Tomoaki Ohtsuki ◽

Octavia A. Dobre ◽

...

Keyword(s):

Neural Networks ◽

Identification Method ◽

Specific Emitter Identification ◽

Network Compression ◽

Complex Valued

Download Full-text

Small Network for Lightweight Task in Computer Vision: A Pruning Method Based on Feature Representation

Computational Intelligence and Neuroscience ◽

10.1155/2021/5531023 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Yisu Ge ◽

Shufang Lu ◽

Fei Gao

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Feature Representation ◽

Fine Tuning ◽

Practical Application ◽

Pruning Algorithm ◽

Network Pruning ◽

Pruning Strategy ◽

Speed Up ◽

Network Compression

Many current convolutional neural networks are hard to meet the practical application requirement because of the enormous network parameters. For accelerating the inference speed of networks, more and more attention has been paid to network compression. Network pruning is one of the most efficient and simplest ways to compress and speed up the networks. In this paper, a pruning algorithm for the lightweight task is proposed, and a pruning strategy based on feature representation is investigated. Different from other pruning approaches, the proposed strategy is guided by the practical task and eliminates the irrelevant filters in the network. After pruning, the network is compacted to a smaller size and is easy to recover accuracy with fine-tuning. The performance of the proposed pruning algorithm is validated on the acknowledged image datasets, and the experimental results prove that the proposed algorithm is more suitable to prune the irrelevant filters for the fine-tuning dataset.

Download Full-text

Network Compression for end-to-end trainable neural networks using Approximate Computing

10.1109/icccnt51525.2021.9579944 ◽

2021 ◽

Author(s):

Poonam Magadum ◽

Soma Ghosh

Keyword(s):

Neural Networks ◽

Approximate Computing ◽

End To End ◽

Network Compression

Download Full-text

Towards Efficient Neuromorphic Hardware: Unsupervised Adaptive Neuron Pruning

Electronics ◽

10.3390/electronics9071059 ◽

2020 ◽

Vol 9 (7) ◽

pp. 1059 ◽

Cited By ~ 1

Author(s):

Wenzhe Guo ◽

Hasan Erdem Yantır ◽

Mohammed E. Fouda ◽

Ahmed M. Eltawil ◽

Khaled Nabil Salama

Keyword(s):

Neural Networks ◽

Energy Efficiency ◽

Real Time ◽

Classification Accuracy ◽

Energy Efficient ◽

Spiking Neural Networks ◽

Neuron Network ◽

Neuromorphic Hardware ◽

Neuromorphic Systems ◽

Network Compression

To solve real-time challenges, neuromorphic systems generally require deep and complex network structures. Thus, it is crucial to search for effective solutions that can reduce network complexity, improve energy efficiency, and maintain high accuracy. To this end, we propose unsupervised pruning strategies that are focused on pruning neurons while training in spiking neural networks (SNNs) by utilizing network dynamics. The importance of neurons is determined by the fact that neurons that fire more spikes contribute more to network performance. Based on these criteria, we demonstrate that pruning with an adaptive spike count threshold provides a simple and effective approach that can reduce network size significantly and maintain high classification accuracy. The online adaptive pruning shows potential for developing energy-efficient training techniques due to less memory access and less weight-update computation. Furthermore, a parallel digital implementation scheme is proposed to implement spiking neural networks (SNNs) on field programmable gate array (FPGA). Notably, our proposed pruning strategies preserve the dense format of weight matrices, so the implementation architecture remains the same after network compression. The adaptive pruning strategy enables 2.3× reduction in memory size and 2.8× improvement on energy efficiency when 400 neurons are pruned from an 800-neuron network, while the loss of classification accuracy is 1.69%. And the best choice of pruning percentage depends on the trade-off among accuracy, memory, and energy. Therefore, this work offers a promising solution for effective network compression and energy-efficient hardware implementation of neuromorphic systems in real-time applications.

Download Full-text

Selective Information Extraction Strategies for Cancer Pathology Reports with Convolutional Neural Networks

Proceedings of the International Neural Networks Society - Recent Advances in Big Data and Deep Learning ◽

10.1007/978-3-030-16841-4_9 ◽

2019 ◽

pp. 89-98

Author(s):

Hong-Jun Yoon ◽

John X. Qiu ◽

J. Blair Christian ◽

Jacob Hinkle ◽

Folami Alamudun ◽

...

Keyword(s):

Neural Networks ◽

Information Extraction ◽

Convolutional Neural Networks ◽

Cancer Pathology ◽

Pathology Reports ◽

Selective Information

Download Full-text

Literature Review of Deep Network Compression

Informatics ◽

10.3390/informatics8040077 ◽

2021 ◽

Vol 8 (4) ◽

pp. 77

Author(s):

Ali Alqahtani ◽

Xianghua Xie ◽

Mark W. Jones

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Deep Neural Networks ◽

Low Rank ◽

Vast Number ◽

Deep Networks ◽

Factorization Methods ◽

Network Compression ◽

Rank Factorization ◽

Pruning Methods

Deep networks often possess a vast number of parameters, and their significant redundancy in parameterization has become a widely-recognized property. This presents significant challenges and restricts many deep learning applications, making the focus on reducing the complexity of models while maintaining their powerful performance. In this paper, we present an overview of popular methods and review recent works on compressing and accelerating deep neural networks. We consider not only pruning methods but also quantization methods, and low-rank factorization methods. This review also intends to clarify these major concepts, and highlights their characteristics, advantages, and shortcomings.

Download Full-text