Towards Verification-Aware Knowledge Distillation for Neural-Network Controlled Systems: Invited Paper

Recently, the necessity to run high-performance neural networks (NN) is increasing even in resource-constrained embedded systems such as wearable devices. However, due to the high computational and memory requirements of the NN applications, it is typically infeasible to execute them on a single device. Instead, it has been proposed to run a single NN application cooperatively on top of multiple devices, a so-called distributed neural network. In the distributed neural network, workloads of a single big NN application are distributed over multiple tiny devices. While the computation overhead could effectively be alleviated by this approach, the existing distributed NN techniques, such as MoDNN, still suffer from large traffics between the devices and vulnerability to communication failures. In order to get rid of such big communication overheads, a knowledge distillation based distributed NN, called Network of Neural Networks (NoNN), was proposed, which partitions the filters in the final convolutional layer of the original NN into multiple independent subsets and derives smaller NNs out of each subset. However, NoNN also has limitations in that the partitioning result may be unbalanced and it considerably compromises the correlation between filters in the original NN, which may result in an unacceptable accuracy degradation in case of communication failure. In this paper, in order to overcome these issues, we propose to enhance the partitioning strategy of NoNN in two aspects. First, we enhance the redundancy of the filters that are used to derive multiple smaller NNs by means of averaging to increase the immunity of the distributed NN to communication failure. Second, we propose a novel partitioning technique, modified from Eigenvector-based partitioning, to preserve the correlation between filters as much as possible while keeping the consistent number of filters distributed to each device. Throughout extensive experiments with the CIFAR-100 (Canadian Institute For Advanced Research-100) dataset, it has been observed that the proposed approach maintains high inference accuracy (over 70%, 1.53× improvement over the state-of-the-art approach), on average, even when a half of eight devices in a distributed NN fail to deliver their partial inference results.

Download Full-text

Performance oriented anti-windup for a class of neural network controlled systems

IEEE Conference on Cybernetics and Intelligent Systems, 2004. ◽

10.1109/iccis.2004.1460682 ◽

2005 ◽

Cited By ~ 2

Author(s):

G. Herrmann ◽

M.C. Turner ◽

I. Postiethwaite

Keyword(s):

Neural Network ◽

Controlled Systems

Download Full-text

Noninvasive grading of glioma by knowledge distillation base lightweight convolutional neural network

10.1109/aemcse51986.2021.00227 ◽

2021 ◽

Author(s):

Ai Lingmei ◽

Bai Wenhao

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Knowledge Distillation ◽

Grading Of Glioma

Download Full-text

A Bias Neural Network Based on Knowledge Distillation

Communications in Computer and Information Science - Bio-inspired Computing: Theories and Applications ◽

10.1007/978-981-13-2829-9_34 ◽

2018 ◽

pp. 377-387

Author(s):

Yulong Wang ◽

Zhi Wu ◽

Yifeng Huang

Keyword(s):

Neural Network ◽

Knowledge Distillation

Download Full-text

Knowledge Distillation for Recurrent Neural Network Language Modeling with Trust Regularization

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2019.8683533 ◽

2019 ◽

Cited By ~ 1

Author(s):

Yangyang Shi ◽

Mei-Yuh Hwang ◽

Xin Lei ◽

Haoyu Sheng

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Language Modeling ◽

Knowledge Distillation ◽

Network Language

Download Full-text

A Neural Network Approach to Failure Decision of Adaptively Controlled Systems

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)47775-5 ◽

1994 ◽

Vol 27 (8) ◽

pp. 605-610

Author(s):

K. Kumamaru ◽

K. Inoue ◽

S. Nonaka ◽

H. Ono ◽

T. Söderström

Keyword(s):

Neural Network ◽

Network Approach ◽

Neural Network Approach ◽

Controlled Systems

Download Full-text

Development of the instrumental support of the domestic computing platform "Elbrus 801-PC" in the problems of neural network modeling of nonlinear dynamic systems

Nonlinear World ◽

10.18127/j20700970-202101-02 ◽

2021 ◽

Author(s):

O.V. Druzhinina ◽

E.R. Korepanov ◽

V.V. Belousov ◽

O.N. Masina ◽

A.A. Petrov

Keyword(s):

Neural Network ◽

Nonlinear Systems ◽

Network Modeling ◽

Neural Network Modeling ◽

Switching Algorithm ◽

Computing Platform ◽

Controlled Systems ◽

Research Problems ◽

Extended Analysis ◽

Software And Hardware

The development of tools for solving research problems with the use of domestic software and hardware is an urgent direction. Such tasks include the tasks of neural network modeling of nonlinear controlled systems. The paper provides an extended analysis of the capabilities of the Elbrus architecture and the blocks of the built-in EML library for mathematical modeling of nonlinear systems. A comparative analysis of the instrumentation and efficiency of computational experiments is performed, taking into account the use of an 8-core processor and the potential capabilities of a 16-core processor. The specifics of the EML library blocks in relation to solving specific types of scientific problems is considered and the optimized software is analyzed. The design of generalized models of nonlinear systems with switching is proposed. For generalized models, a new switching algorithm has been developed that can be adapted to the Elbrus computing platform. An algorithmic tree is constructed, and algorithmic and software are developed for the study of models with switching. The results of adaptation of the modules of the software package for modeling managed systems to the elements of the platform are presented. The results of computer modeling of nonlinear systems based on the Elbrus 801-RS computing platform are systematized and generalized. The results can be used in problems of creating algorithmic and software for solving research modeling problems, in problems of synthesis and analysis of models of controlled technical systems with switching modes of operation, as well as in problems of neural network modeling and machine learning.

Download Full-text

Fail-Safe Stability for Neural Network Controlled Systems

Safety, Reliability and Applications of Emerging Intelligent Control Technologies ◽

10.1016/b978-0-08-042374-6.50015-8 ◽

1995 ◽

pp. 61-66

Author(s):

Y.S. Hung ◽

S. Lam

Keyword(s):

Neural Network ◽

Controlled Systems ◽

Fail Safe

Download Full-text

Speech Enhancement Using Generative Adversarial Network by Distilling Knowledge from Statistical Method

Applied Sciences ◽

10.3390/app9163396 ◽

2019 ◽

Vol 9 (16) ◽

pp. 3396 ◽

Cited By ~ 3

Author(s):

Jianfeng Wu ◽

Yongzhu Hua ◽

Shengying Yang ◽

Hongshuai Qin ◽

Huibin Qin

Keyword(s):

Neural Network ◽

Statistical Method ◽

Speech Enhancement ◽

Data Sets ◽

Generative Adversarial Network ◽

Adversarial Learning ◽

Noisy Speech ◽

Adversarial Network ◽

Knowledge Distillation ◽

Enhancement Algorithm

This paper presents a new deep neural network (DNN)-based speech enhancement algorithm by integrating the distilled knowledge from the traditional statistical-based method. Unlike the other DNN-based methods, which usually train many different models on the same data and then average their predictions, or use a large number of noise types to enlarge the simulated noisy speech, the proposed method does not train a whole ensemble of models and does not require a mass of simulated noisy speech. It first trains a discriminator network and a generator network simultaneously using the adversarial learning method. Then, the discriminator network and generator network are re-trained by distilling knowledge from the statistical method, which is inspired by the knowledge distillation in a neural network. Finally, the generator network is fine-tuned using real noisy speech. Experiments on CHiME4 data sets demonstrate that the proposed method achieves a more robust performance than the compared DNN-based method in terms of perceptual speech quality.

Download Full-text

A scalable convolutional neural network for task-specified scenarios via knowledge distillation

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2017.7952600 ◽

2017 ◽

Cited By ~ 3

Author(s):

Mengnan Shi ◽

Fei Qin ◽

Qixiang Ye ◽

Zhenjun Han ◽

Jianbin Jiao

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Knowledge Distillation

Download Full-text