Improving Machine Learning Diagnostic Systems with Model-Based Data Augmentation - Part A: Data Generation

Machine-learning diagnostic systems are widely used to detect abnormal conditions in electrical equipment. Training robust and accurate diagnostic systems is challenging because only small databases of abnormal-condition data are available. However, the performance of the diagnostic systems depends on the quantity and quality of the data. The training database can be augmented utilizing data augmentation techniques that generate synthetic data to improve diagnostic performance. However, existing data augmentation techniques are generic methods that do not include additional information in the synthetic data. In this paper, we develop a model-based data augmentation technique integrating computer-implementable electromechanical models. Synthetic normal- and abnormal-condition data are generated with an electromechanical model and a stochastic parameter value sampling method. The model-based data augmentation is showcased to detect an abnormal condition of a distribution transformer. First, the synthetic data are compared with the measurements to verify the synthetic data. Then, ML-based diagnostic systems are created using model-based data augmentation and are compared with state-of-the-art diagnostic systems. It is shown that using the model-based data augmentation results in an improved accuracy compared to state-of-the-art diagnostic systems. This holds especially true when only a small abnormal-condition database is available.

Download Full-text

Improving Machine Learning Diagnostic Systems with Model-Based Data Augmentation ― Part B: Application

10.1109/isgteurope52324.2021.9640050 ◽

2021 ◽

Author(s):

Jannis Nikolas Kahlen ◽

Andre Wurde ◽

Michael Andres ◽

Albert Moser

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Diagnostic Systems ◽

Model Based

Download Full-text

An efficient machine-learning model based on data augmentation for pain intensity recognition

Egyptian Informatics Journal ◽

10.1016/j.eij.2020.02.006 ◽

2020 ◽

Vol 21 (4) ◽

pp. 241-257

Author(s):

Ahmad Al-Qerem

Keyword(s):

Machine Learning ◽

Pain Intensity ◽

Data Augmentation ◽

Learning Model ◽

Model Based ◽

Machine Learning Model ◽

Efficient Machine

Download Full-text

Mind wandering as data augmentation: How mental travel supports abstraction

Behavioral and Brain Sciences ◽

10.1017/s0140525x1900311x ◽

2020 ◽

Vol 43 ◽

Author(s):

Myrthe Faber

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Mental Content ◽

Mind Wandering ◽

Theoretical Framework ◽

Important Addition

Abstract Gilead et al. state that abstraction supports mental travel, and that mental travel critically relies on abstraction. I propose an important addition to this theoretical framework, namely that mental travel might also support abstraction. Specifically, I argue that spontaneous mental travel (mind wandering), much like data augmentation in machine learning, provides variability in mental content and context necessary for abstraction.

Download Full-text

Enhancement of Image Classification through Data Augmentation using Machine Learning

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i9.220224 ◽

2018 ◽

Vol 6 (9) ◽

pp. 220-224

Author(s):

Th. S. Kumar

Keyword(s):

Machine Learning ◽

Image Classification ◽

Data Augmentation

Download Full-text

Building Damage Detection from Post-Event Aerial Imagery Using Single Shot Multibox Detector

Applied Sciences ◽

10.3390/app9061128 ◽

2019 ◽

Vol 9 (6) ◽

pp. 1128 ◽

Cited By ~ 12

Author(s):

Yundong Li ◽

Wei Hu ◽

Han Dong ◽

Xueyan Zhang

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Hurricane Sandy ◽

Training Data ◽

Aerial Images ◽

Detection Methods ◽

Single Shot ◽

Data Set ◽

Augmentation Strategies ◽

Post Disaster

Using aerial cameras, satellite remote sensing or unmanned aerial vehicles (UAV) equipped with cameras can facilitate search and rescue tasks after disasters. The traditional manual interpretation of huge aerial images is inefficient and could be replaced by machine learning-based methods combined with image processing techniques. Given the development of machine learning, researchers find that convolutional neural networks can effectively extract features from images. Some target detection methods based on deep learning, such as the single-shot multibox detector (SSD) algorithm, can achieve better results than traditional methods. However, the impressive performance of machine learning-based methods results from the numerous labeled samples. Given the complexity of post-disaster scenarios, obtaining many samples in the aftermath of disasters is difficult. To address this issue, a damaged building assessment method using SSD with pretraining and data augmentation is proposed in the current study and highlights the following aspects. (1) Objects can be detected and classified into undamaged buildings, damaged buildings, and ruins. (2) A convolution auto-encoder (CAE) that consists of VGG16 is constructed and trained using unlabeled post-disaster images. As a transfer learning strategy, the weights of the SSD model are initialized using the weights of the CAE counterpart. (3) Data augmentation strategies, such as image mirroring, rotation, Gaussian blur, and Gaussian noise processing, are utilized to augment the training data set. As a case study, aerial images of Hurricane Sandy in 2012 were maximized to validate the proposed method’s effectiveness. Experiments show that the pretraining strategy can improve of 10% in terms of overall accuracy compared with the SSD trained from scratch. These experiments also demonstrate that using data augmentation strategies can improve mAP and mF1 by 72% and 20%, respectively. Finally, the experiment is further verified by another dataset of Hurricane Irma, and it is concluded that the paper method is feasible.

Download Full-text

Data Augmentation for Machine Learning-Based Hardware Trojan Detection at Gate-Level Netlists

2021 IEEE 27th International Symposium on On-Line Testing and Robust System Design (IOLTS) ◽

10.1109/iolts52814.2021.9486713 ◽

2021 ◽

Author(s):

Kento Hasegawa ◽

Seira Hidano ◽

Kohei Nozawa ◽

Shinsaku Kiyomoto ◽

Nozomu Togawa

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Hardware Trojan ◽

Hardware Trojan Detection ◽

Trojan Detection

Download Full-text

Field-Scale Soil Moisture Retrieval Using PALSAR-2 Polarimetric Decomposition and Machine Learning

Agronomy ◽

10.3390/agronomy11010035 ◽

2020 ◽

Vol 11 (1) ◽

pp. 35

Author(s):

Xiaodong Huang ◽

Beth Ziniti ◽

Michael H. Cosh ◽

Michele Reba ◽

Jinfei Wang ◽

...

Keyword(s):

Machine Learning ◽

Soil Moisture ◽

Canopy Cover ◽

The United States ◽

Optical Data ◽

Study Region ◽

Model Based ◽

Key Indicator ◽

Ground Measurement ◽

L Band

Soil moisture is a key indicator to assess cropland drought and irrigation status as well as forecast production. Compared with the optical data which are obscured by the crop canopy cover, the Synthetic Aperture Radar (SAR) is an efficient tool to detect the surface soil moisture under the vegetation cover due to its strong penetration capability. This paper studies the soil moisture retrieval using the L-band polarimetric Phased Array-type L-band SAR 2 (PALSAR-2) data acquired over the study region in Arkansas in the United States. Both two-component model-based decomposition (SAR data alone) and machine learning (SAR + optical indices) methods are tested and compared in this paper. Validation using independent ground measurement shows that the both methods achieved a Root Mean Square Error (RMSE) of less than 10 (vol.%), while the machine learning methods outperform the model-based decomposition, achieving an RMSE of 7.70 (vol.%) and R2 of 0.60.

Download Full-text

MODES: model-based optimization on distributed embedded systems

Machine Learning ◽

10.1007/s10994-021-06014-6 ◽

2021 ◽

Author(s):

Junjie Shi ◽

Jiang Bian ◽

Jakob Richter ◽

Kuan-Hsun Chen ◽

Jörg Rahnenführer ◽

...

Keyword(s):

Machine Learning ◽

Embedded Systems ◽

Learning Model ◽

Black Box ◽

Distributed Embedded Systems ◽

Data Set ◽

Individual Model ◽

Model Based ◽

Machine Learning Model ◽

Distributed Machine Learning

AbstractThe predictive performance of a machine learning model highly depends on the corresponding hyper-parameter setting. Hence, hyper-parameter tuning is often indispensable. Normally such tuning requires the dedicated machine learning model to be trained and evaluated on centralized data to obtain a performance estimate. However, in a distributed machine learning scenario, it is not always possible to collect all the data from all nodes due to privacy concerns or storage limitations. Moreover, if data has to be transferred through low bandwidth connections it reduces the time available for tuning. Model-Based Optimization (MBO) is one state-of-the-art method for tuning hyper-parameters but the application on distributed machine learning models or federated learning lacks research. This work proposes a framework $$\textit{MODES}$$ MODES that allows to deploy MBO on resource-constrained distributed embedded systems. Each node trains an individual model based on its local data. The goal is to optimize the combined prediction accuracy. The presented framework offers two optimization modes: (1) $$\textit{MODES}$$ MODES -B considers the whole ensemble as a single black box and optimizes the hyper-parameters of each individual model jointly, and (2) $$\textit{MODES}$$ MODES -I considers all models as clones of the same black box which allows it to efficiently parallelize the optimization in a distributed setting. We evaluate $$\textit{MODES}$$ MODES by conducting experiments on the optimization for the hyper-parameters of a random forest and a multi-layer perceptron. The experimental results demonstrate that, with an improvement in terms of mean accuracy ($$\textit{MODES}$$ MODES -B), run-time efficiency ($$\textit{MODES}$$ MODES -I), and statistical stability for both modes, $$\textit{MODES}$$ MODES outperforms the baseline, i.e., carry out tuning with MBO on each node individually with its local sub-data set.

Download Full-text

MODEL-BASED DESIGN OF AM COMPONENTS TO ENABLE DECENTRALIZED DIGITAL MANUFACTURING SYSTEMS

Proceedings of the Design Society ◽

10.1017/pds.2021.474 ◽

2021 ◽

Vol 1 ◽

pp. 2127-2136

Author(s):

Olivia Borgue ◽

John Stavridis ◽

Tomas Vannucci ◽

Panagiotis Stavropoulos ◽

Harry Bikas ◽

...

Keyword(s):

Additive Manufacturing ◽

Manufacturing Systems ◽

Production Systems ◽

Data Generation ◽

Production Networks ◽

Model Based ◽

Manufacturing Technologies ◽

Am Processes ◽

Final Consumer

AbstractAdditive manufacturing (AM) is a versatile technology that could add flexibility in manufacturing processes, whether implemented alone or along other technologies. This technology enables on-demand production and decentralized production networks, as production facilities can be located around the world to manufacture products closer to the final consumer (decentralized manufacturing). However, the wide adoption of additive manufacturing technologies is hindered by the lack of experience on its implementation, the lack of repeatability among different manufacturers and a lack of integrated production systems. The later, hinders the traceability and quality assurance of printed components and limits the understanding and data generation of the AM processes and parameters. In this article, a design strategy is proposed to integrate the different phases of the development process into a model-based design platform for decentralized manufacturing. This platform is aimed at facilitating data traceability and product repeatability among different AM machines. The strategy is illustrated with a case study where a car steering knuckle is manufactured in three different facilities in Sweden and Italy.

Download Full-text