Impact of Noise on Machine Learning-based Condition Monitoring Applications: a Case Study

Author(s):  
Roberto Bodo ◽  
Matteo Bertocco ◽  
Alberto Bianchi
2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Sergio Martin del Campo Barraza ◽  
William Lindskog ◽  
Davide Badalotti ◽  
Oskar Liew ◽  
Arash Toyser

Data-based models built using machine learning solutions are becoming more prominent in the condition monitoring, maintenance, and prognostics fields. The capacity to build these models using a machine learning approach depends largely in the quality of the data. Of particular importance is the availability of labelled data, which describes the conditions that are intended to be identified. However, properly labelled data that is useful in many machine learning strategies is a scare resource. Furthermore, producing high-quality labelled data is expensive, time-consuming and a lot of times inaccurate given the uncertainty surrounding the labeling process and the annotators.  Active Learning (AL) has emerged as a semi-supervised approach that enables cost and time reductions of the labeling process. This approach has had a delayed adoption for time series classification given the difficulty to extract and present the time series information in such a way that it is easy to understand for the human annotator who incorporates the labels. This difficulty arises from the large dimensionality that many of these time series possess. This challenge is exacerbated by the cold-start problem, where the initial labelled dataset used in typical AL frameworks may not exist. Thus, the initial set of labels to be allocated to the time series samples is not available. This last challenge is particularly common on many condition monitoring applications where data samples of specific faults or problems does not exist. In this article, we present an AL framework to be used in the classification of time series from industrial process data, in particular vibration waveforms originated from condition monitoring applications. In this framework, we deal with the absence of labels to train an initial classification model by introducing a pre-clustering step. This step uses an unsupervised clustering algorithm to identify the number of labels and selects the points with a stronger group belonging as initial samples to be labelled in the active learning step. Furthermore, this framework presents two approaches to present the information to the annotator that can be via time-series imaging and automatic extraction of statistical features. Our work is motivated by the interest to facilitate the effort required for labeling time-series waveforms, while maintaining a high level of accuracy and consistency on those labels. In addition, we study the number of time-series samples that require to be labelled to achieve different levels of classification accuracy, as well as their confidence intervals. These experiments are carried out using vibration signals from a well-known rolling element bearing dataset and typical process data from a production plant.   An active learning framework that considers the conditions of the data commonly found in maintenance and condition monitoring applications while presenting the data in ways easy to interpret by human annotators can facilitate the generation reliable datasets. These datasets can, in turn, assist in the development of data-driven models that describe the many different processes that a machine undergoes.


2021 ◽  
pp. 55-76
Author(s):  
Jacopo Cavalaglio Camargo Molano ◽  
Federico Campo ◽  
Luca Capelli ◽  
Giulia Massaccesi ◽  
Davide Borghi ◽  
...  

Author(s):  
Oscar Duque-Perez ◽  
Carlos Del Pozo-Gallego ◽  
Daniel Morinigo-Sotelo ◽  
Wagner Fontes Godoy

Condition monitoring of bearings is an open issue. The use of the stator current to monitor induction motors has been validated as a very advantageous and practical way to detect several types of faults. Nevertheless, for bearing faults the use of vibrations or sound generally offers better results in the accuracy of the detection although with some disadvantages related to the sensors used for monitoring. To improve the performance of bearing monitoring, it is proposed to take advantage of more information available in the current spectra, beyond the usually employed, incorporating the amplitude of a significant number of sidebands around the first eleven harmonics, growing exponentially the number of fault signatures. This is especially interesting for inverter-fed motors. But, in turn, this leads to the problem of overfitting when applying a classifier to perform the fault diagnosis. To overcome this problem, and still exploit all the useful information available in the spectra, it is proposed to use shrinkage methods, which have been lately proposed in machine learning to solve the overfitting issue when the problem has much more variables than examples to classify. A case study with a motor is shown to prove the validity of the proposal


Energies ◽  
2019 ◽  
Vol 12 (17) ◽  
pp. 3392 ◽  
Author(s):  
Oscar Duque-Perez ◽  
Carlos Del Pozo-Gallego ◽  
Daniel Morinigo-Sotelo ◽  
Wagner Fontes Godoy

Condition monitoring of bearings is an open issue. The use of the stator current to monitor induction motors has been validated as a very advantageous and practical way to detect several types of faults. Nevertheless, for bearing faults, the use of vibrations or sound generally offers better results in the accuracy of the detection, although with some disadvantages related to the sensors used for monitoring. To improve the performance of bearing monitoring, it is proposed to take advantage of more information available in the current spectra, beyond the usually employed, incorporating the amplitude of a significant number of sidebands around the first eleven harmonics, growing exponentially the number of fault signatures. This is especially interesting for inverter-fed motors. But, in turn, this leads to the problem of overfitting when applying a classifier to perform the fault diagnosis. To overcome this problem, and still exploit all the useful information available in the spectra, it is proposed to use shrinkage methods, which have been lately proposed in machine learning to solve the overfitting issue when the problem has many more variables than examples to classify. A case study with a motor is shown to prove the validity of the proposal.


Sign in / Sign up

Export Citation Format

Share Document