A Novel Training Strategy for Deep Learning Model Compression Applied to Viral Classifications

AbstractDeep learning models are tools for data analysis suitable for approximating (non-linear) relationships among variables for the best prediction of an outcome. While these models can be used to answer many important questions, their utility is still harshly criticized, being extremely challenging to identify which data descriptors are the most adequate to represent a given specific phenomenon of interest. With a recent experience in the development of a deep learning model designed to detect failures in mechanical water meter devices, we have learnt that a sensible deterioration of the prediction accuracy can occur if one tries to train a deep learning model by adding specific device descriptors, based on categorical data. This can happen because of an excessive increase in the dimensions of the data, with a correspondent loss of statistical significance. After several unsuccessful experiments conducted with alternative methodologies that either permit to reduce the data space dimensionality or employ more traditional machine learning algorithms, we changed the training strategy, reconsidering that categorical data, in the light of a Pareto analysis. In essence, we used those categorical descriptors, not as an input on which to train our deep learning model, but as a tool to give a new shape to the dataset, based on the Pareto rule. With this data adjustment, we trained a more performative deep learning model able to detect defective water meter devices with a prediction accuracy in the range 87–90%, even in the presence of categorical descriptors.

Download Full-text

A Deep Learning Model Compression and Ensemble Approach for Weed Detection

10.24251/hicss.2022.138 ◽

2022 ◽

Author(s):

Martinson Ofori ◽

Omar El-Gayar ◽

Austin O'Brien ◽

Cherie Noteboom

Keyword(s):

Deep Learning ◽

Learning Model ◽

Weed Detection ◽

Ensemble Approach ◽

Model Compression ◽

Deep Learning Model

Download Full-text

Research on Model Compression for Embedded Platform through Quantization and Pruning

Journal of Physics Conference Series ◽

10.1088/1742-6596/2078/1/012047 ◽

2021 ◽

Vol 2078 (1) ◽

pp. 012047

Author(s):

Xiao Hu ◽

Hao Wen

Keyword(s):

Artificial Intelligence ◽

Deep Learning ◽

Learning Model ◽

Model Simplification ◽

Accuracy Rate ◽

Embedded Platform ◽

Remarkable Improvement ◽

Model Compression ◽

Artificial Intelligence Technology ◽

Deep Learning Model

Abstract So far, artificial intelligence has gone through decades of development. Although artificial intelligence technology is not yet mature, it has already been applied in many walks of life. With the explosion of IoT technology in 2019, artificial intelligence has ushered in a new climax. It can be said that the development of IoT technology has led to the development of artificial intelligence once again. But the traditional deep learning model is very complex and redundant. The hardware environment of IoT can not afford the time and resources cost by the model which runs on the GPU originally, so model compression without decreasing accuracy rate so much is applicable in this situation. In this paper, we experimented with using two tricks for model compression: Pruning and Quantization. By utilizing these methods, we got a remarkable improvement in model simplification while retaining a relatively close accuracy.

Download Full-text

Deep Learning Model Selection of Suboptimal Complexity

Автоматика и телемеханика ◽

10.31857/s000523100001252-1 ◽

2018 ◽

pp. 129-147

Author(s):

Oleg Bakhteev ◽

◽

Vadim Strijov ◽

Keyword(s):

Deep Learning ◽

Model Selection ◽

Learning Model ◽

Deep Learning Model ◽

Selection Of

Download Full-text

Improving Sentiment Analysis using Hybrid Deep Learning Model

Recent Advances in Computer Science and Communications ◽

10.2174/2213275912666190328200012 ◽

2020 ◽

Vol 13 (4) ◽

pp. 627-640 ◽

Cited By ~ 1

Author(s):

Avinash Chandra Pandey ◽

Dharmveer Singh Rajpoot

Keyword(s):

Neural Network ◽

Deep Learning ◽

Sentiment Analysis ◽

Classification Accuracy ◽

Short Term Memory ◽

Computational Cost ◽

Extraction Process ◽

Learning Model ◽

Sentiment Classification ◽

Deep Learning Model

Background: Sentiment analysis is a contextual mining of text which determines viewpoint of users with respect to some sentimental topics commonly present at social networking websites. Twitter is one of the social sites where people express their opinion about any topic in the form of tweets. These tweets can be examined using various sentiment classification methods to find the opinion of users. Traditional sentiment analysis methods use manually extracted features for opinion classification. The manual feature extraction process is a complicated task since it requires predefined sentiment lexicons. On the other hand, deep learning methods automatically extract relevant features from data hence; they provide better performance and richer representation competency than the traditional methods. Objective: The main aim of this paper is to enhance the sentiment classification accuracy and to reduce the computational cost. Method: To achieve the objective, a hybrid deep learning model, based on convolution neural network and bi-directional long-short term memory neural network has been introduced. Results: The proposed sentiment classification method achieves the highest accuracy for the most of the datasets. Further, from the statistical analysis efficacy of the proposed method has been validated. Conclusion: Sentiment classification accuracy can be improved by creating veracious hybrid models. Moreover, performance can also be enhanced by tuning the hyper parameters of deep leaning models.

Download Full-text