Forecasting the power consumption of a rotor spinning machine by using an adaptive squeeze and excitation convolutional neural network with imbalanced data

In a CNN (convolutional neural network) accelerator, to reduce memory traffic and power consumption, there is a need to exploit the sparsity of activation values. Therefore, some research efforts have been paid to skip ineffectual computations (i.e., multiplications by zero). Different from previous works, in this paper, we point out the similarity of activation values: (1) in the same layer of a CNN model, most feature maps are either highly dense or highly sparse; (2) in the same layer of a CNN model, feature maps in different channels are often similar. Based on the two observations, we propose a block-based compression approach, which utilizes both the sparsity and the similarity of activation values to further reduce the data volume. Moreover, we also design an encoder, a decoder and an indexing module to support the proposed approach. The encoder is used to translate output activations into the proposed block-based compression format, while both the decoder and the indexing module are used to align nonzero values for effectual computations. Compared with previous works, benchmark data consistently show that the proposed approach can greatly reduce both memory traffic and power consumption.

Download Full-text

Oversampling Based on Data Augmentation in Convolutional Neural Network for Silicon Wafer Defect Classification

Knowledge Innovation Through Intelligent Software Methodologies, Tools and Techniques - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200547 ◽

2020 ◽

Author(s):

Uzma Batool ◽

Mohd Ibrahim Shapiai ◽

Nordinah Ismail ◽

Hilman Fauzi ◽

Syahrizal Salleh

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Silicon Wafer ◽

Data Augmentation ◽

Imbalanced Data ◽

Training Data ◽

Defect Classification ◽

Learning Method ◽

Test Set

Silicon wafer defect data collected from fabrication facilities is intrinsically imbalanced because of the variable frequencies of defect types. Frequently occurring types will have more influence on the classification predictions if a model gets trained on such skewed data. A fair classifier for such imbalanced data requires a mechanism to deal with type imbalance in order to avoid biased results. This study has proposed a convolutional neural network for wafer map defect classification, employing oversampling as an imbalance addressing technique. To have an equal participation of all classes in the classifier’s training, data augmentation has been employed, generating more samples in minor classes. The proposed deep learning method has been evaluated on a real wafer map defect dataset and its classification results on the test set returned a 97.91% accuracy. The results were compared with another deep learning based auto-encoder model demonstrating the proposed method, a potential approach for silicon wafer defect classification that needs to be investigated further for its robustness.

Download Full-text

Convolutional Neural Network for Imbalanced Data Classification of Silicon Wafer Defects

2020 16th IEEE International Colloquium on Signal Processing & Its Applications (CSPA) ◽

10.1109/cspa48992.2020.9068669 ◽

2020 ◽

Author(s):

Uzma Batool ◽

Mohd Ibrahim Shapiai ◽

Hilman Fauzi ◽

Jia Xian Fong

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Silicon Wafer ◽

Imbalanced Data ◽

Data Classification ◽

Imbalanced Data Classification

Download Full-text

Drug-Drug Interaction Extraction via Recurrent Hybrid Convolutional Neural Networks with an Improved Focal Loss

Entropy ◽

10.3390/e21010037 ◽

2019 ◽

Vol 21 (1) ◽

pp. 37 ◽

Cited By ~ 13

Author(s):

Xia Sun ◽

Ke Dong ◽

Long Ma ◽

Richard Sutcliffe ◽

Feijuan He ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Loss Function ◽

Contextual Information ◽

Imbalanced Data ◽

Biomedical Literature ◽

Local Context ◽

Automatic Extraction ◽

Sentence Level ◽

Interaction Extraction

Drug-drug interactions (DDIs) may bring huge health risks and dangerous effects to a patient’s body when taking two or more drugs at the same time or within a certain period of time. Therefore, the automatic extraction of unknown DDIs has great potential for the development of pharmaceutical agents and the safety of drug use. In this article, we propose a novel recurrent hybrid convolutional neural network (RHCNN) for DDI extraction from biomedical literature. In the embedding layer, the texts mentioning two entities are represented as a sequence of semantic embeddings and position embeddings. In particular, the complete semantic embedding is obtained by the information fusion between a word embedding and its contextual information which is learnt by recurrent structure. After that, the hybrid convolutional neural network is employed to learn the sentence-level features which consist of the local context features from consecutive words and the dependency features between separated words for DDI extraction. Lastly but most significantly, in order to make up for the defects of the traditional cross-entropy loss function when dealing with class imbalanced data, we apply an improved focal loss function to mitigate against this problem when using the DDIExtraction 2013 dataset. In our experiments, we achieve DDI automatic extraction with a micro F-score of 75.48% on the DDIExtraction 2013 dataset, outperforming the state-of-the-art approach by 2.49%.

Download Full-text

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch017 ◽

2020 ◽

pp. 274-294

Author(s):

Yilin Yan ◽

Min Chen ◽

Saad Sadiq ◽

Mei-Ling Shyu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Network Models ◽

Multimedia Data ◽

Neural Network Models ◽

Minority Class ◽

Imbalanced Data Classification

The classification of imbalanced datasets has recently attracted significant attention due to its implications in several real-world use cases. The classifiers developed on datasets with skewed distributions tend to favor the majority classes and are biased against the minority class. Despite extensive research interests, imbalanced data classification remains a challenge in data mining research, especially for multimedia data. Our attempt to overcome this hurdle is to develop a convolutional neural network (CNN) based deep learning solution integrated with a bootstrapping technique. Considering that convolutional neural networks are very computationally expensive coupled with big training datasets, we propose to extract features from pre-trained convolutional neural network models and feed those features to another full connected neutral network. Spark implementation shows promising performance of our model in handling big datasets with respect to feasibility and scalability.

Download Full-text

Imbalanced data fault diagnosis of hydrogen sensors using deep convolutional generative adversarial network with convolutional neural network

Review of Scientific Instruments ◽

10.1063/5.0057059 ◽

2021 ◽

Vol 92 (9) ◽

pp. 095007

Author(s):

Yongyi Sun ◽

Tingting Zhao ◽

Zhihui Zou ◽

Yinsheng Chen ◽

Hongquan Zhang

Keyword(s):

Neural Network ◽

Fault Diagnosis ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Hydrogen Sensors ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

POMMEL: Exploring Off-Chip Memory Energy & Power Consumption in Convolutional Neural Network Accelerators

10.1109/dsd53832.2021.00073 ◽

2021 ◽

Author(s):

Alexander Montgomerie-Corcoran ◽

Christos-Savvas Bouganis

Keyword(s):

Neural Network ◽

Power Consumption ◽

Convolutional Neural Network

Download Full-text

Deep Learning Based Computer Generated Face Identification Using Convolutional Neural Network

Applied Sciences ◽

10.3390/app8122610 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2610 ◽

Cited By ~ 14

Author(s):

L. Dang ◽

Syed Hassan ◽

Suhyeon Im ◽

Jaecheol Lee ◽

Sujin Lee ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Layer Structure ◽

Imbalanced Data ◽

Generative Adversarial Networks ◽

Gradient Boosting ◽

Face Identification ◽

Large Computer ◽

Face Images ◽

Extreme Gradient Boosting

Generative adversarial networks (GANs) describe an emerging generative model which has made impressive progress in the last few years in generating photorealistic facial images. As the result, it has become more and more difficult to differentiate between computer-generated and real face images, even with the human’s eyes. If the generated images are used with the intent to mislead and deceive readers, it would probably cause severe ethical, moral, and legal issues. Moreover, it is challenging to collect a dataset for computer-generated face identification that is large enough for research purposes because the number of realistic computer-generated images is still limited and scattered on the internet. Thus, a development of a novel decision support system for analyzing and detecting computer-generated face images generated by the GAN network is crucial. In this paper, we propose a customized convolutional neural network, namely CGFace, which is specifically designed for the computer-generated face detection task by customizing the number of convolutional layers, so it performs well in detecting computer-generated face images. After that, an imbalanced framework (IF-CGFace) is created by altering CGFace’s layer structure to adjust to the imbalanced data issue by extracting features from CGFace layers and use them to train AdaBoost and eXtreme Gradient Boosting (XGB). Next, we explain the process of generating a large computer-generated dataset based on the state-of-the-art PCGAN and BEGAN model. Then, various experiments are carried out to show that the proposed model with augmented input yields the highest accuracy at 98%. Finally, we provided comparative results by applying the proposed CNN architecture on images generated by other GAN researches.

Download Full-text

Study on intelligent anti–electricity stealing early-warning technology based on convolutional neural networks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189621 ◽

2021 ◽

pp. 1-7

Author(s):

Nan Pan ◽

Xin Shen ◽

Xiaojue Guo ◽

Min Cao ◽

Dilin Pan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Power Consumption ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

High Speed ◽

Learning Algorithm ◽

Single Point ◽

Displacement Sensor ◽

Gradient Descent Method

In recent years, electricity stealing has been repeatedly prohibited, and as the methods of stealing electricity have become more intelligent and concealed, it is growing increasingly difficult to extract high-dimensional data features of power consumption. In order to solve this problem, a correlation model of power-consumption data based on convolutional neural networks (CNN) is established. First, the original user signal is preprocessed to remove the noise. The user signal with a fixed signal length is then intercepted and the parallel class labelled. The segmented user signals and corresponding labels are input into the convolutional neural network for training, and the trained convolutional neural network is then used to detect and classify the test user signals. Finally, the actual steal leak dataset is used to verify the effectiveness of this algorithm, which proves that the algorithm can effectively carry out anti–-electricity stealing by warning of abnormal power consumption behavior. There are lots of line traces on the surface of the broken ends which left in the cable cutting case crime scene along the high-speed railway in China. The line traces usually present nonlinear morphological features and has strong randomness. It is not very effective when using existing image-processing and three-dimensional scanning methods to do the trace comparison, therefore, a fast algorithm based on wavelet domain feature aiming at the nonlinear line traces is put forward to make fast trace analysis and infer the criminal tools. The proposed algorithm first applies wavelet decomposition to the 1-D signals which picked up by single point laser displacement sensor to partially reduce noises. After that, the dynamic time warping is employed to do trace feature similarity matching. Finally, using linear regression machine learning algorithm based on gradient descent method to do constant iteration. The experiment results of cutting line traces sample data comparison demonstrate the accuracy and reliability of the proposed algorithm.

Download Full-text

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2017010101 ◽

2017 ◽

Vol 8 (1) ◽

pp. 1-20 ◽

Cited By ~ 15

Author(s):

Yilin Yan ◽

Min Chen ◽

Saad Sadiq ◽

Mei-Ling Shyu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Network Models ◽

Multimedia Data ◽

Neural Network Models ◽

Minority Class ◽

Imbalanced Data Classification

The classification of imbalanced datasets has recently attracted significant attention due to its implications in several real-world use cases. The classifiers developed on datasets with skewed distributions tend to favor the majority classes and are biased against the minority class. Despite extensive research interests, imbalanced data classification remains a challenge in data mining research, especially for multimedia data. Our attempt to overcome this hurdle is to develop a convolutional neural network (CNN) based deep learning solution integrated with a bootstrapping technique. Considering that convolutional neural networks are very computationally expensive coupled with big training datasets, we propose to extract features from pre-trained convolutional neural network models and feed those features to another full connected neutral network. Spark implementation shows promising performance of our model in handling big datasets with respect to feasibility and scalability.

Download Full-text