Machine Learning for Wind Turbine Fault Prediction through the Combination of Datasets from Same Type Turbines

Mapping Intimacies ◽

10.21741/9781644901731-7 ◽

2022 ◽

Author(s):

C. Bosch

Keyword(s):

Machine Learning ◽

Wind Turbine ◽

Data Augmentation ◽

Learning Algorithms ◽

Wind Farms ◽

Machine Learning Algorithms ◽

Fault Prediction ◽

System Failures ◽

Wide Range ◽

Early Fault Detection

Abstract. Early fault detection in wind turbines is key to reduce both costs and uncertainty in the generation of energy and operation of these structures. The isolation of many wind farms, especially those offshore, makes scheduled maintenance very costly and on many occasions inefficient. In addition, the downtime of these structures is typically long and a predictive solution is much needed to 1) help prepare for the maintenance procedure beforehand, for instance to avoid delays when waiting for the required resources and components for maintenance to be available and, 2) avoid the possibility of more destructive system failures. Predicting failures in such complex systems requires modeling of multiple components in isolation and as a whole. Physics-based and data-based models are used for this purpose, which have been proven useful in this regard. Specifically, Machine Learning algorithms are proven to be a valuable resource in a wide range of problems in this industry, however a solution capable of accurately predicting the range of faults of a particular type of wind turbine is still a challenge. In this paper, we will introduce the capabilities of machine learning for wind turbine fault prediction, as well as a technique to predict different types of faults. We will compare the performance of two well established machine learning algorithms (namely K-Nearest Neighbour and Random Forest classifiers) on real wind turbine data which have produced great levels of prediction accuracy. We also propose data augmentation methods to help enhance the training of ML models when wind turbine data is scarce by merging data from turbines of the same type.

Download Full-text

Evaluation of three potential machine learning algorithms for predicting the velocity and turbulence intensity of a wind turbine wake

Renewable Energy ◽

10.1016/j.renene.2021.11.097 ◽

2021 ◽

Author(s):

Shantanu Purohit ◽

E.Y.K. Ng ◽

Ijaz Fazil Syed Ahmed Kabir

Keyword(s):

Machine Learning ◽

Wind Turbine ◽

Turbulence Intensity ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Turbine Wake ◽

Wind Turbine Wake

Download Full-text

Taxonomy of machine learning algorithms in software fault prediction using object oriented metrics

Procedia Computer Science ◽

10.1016/j.procs.2018.05.115 ◽

2018 ◽

Vol 132 ◽

pp. 993-1001 ◽

Cited By ~ 7

Author(s):

Ajmer Singh ◽

Rajesh Bhatia ◽

Anita Singhrova

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Object Oriented ◽

Machine Learning Algorithms ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault ◽

Object Oriented Metrics

Download Full-text

Identification of Leukemia Subtypes from Microscopic Images Using Convolutional Neural Network

Diagnostics ◽

10.3390/diagnostics9030104 ◽

2019 ◽

Vol 9 (3) ◽

pp. 104 ◽

Cited By ~ 11

Author(s):

Ahmed ◽

Yigit ◽

Isik ◽

Alpkocak

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Training Data ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Set ◽

Leukemia Data

Leukemia is a fatal cancer and has two main types: Acute and chronic. Each type has two more subtypes: Lymphoid and myeloid. Hence, in total, there are four subtypes of leukemia. This study proposes a new approach for diagnosis of all subtypes of leukemia from microscopic blood cell images using convolutional neural networks (CNN), which requires a large training data set. Therefore, we also investigated the effects of data augmentation for an increasing number of training samples synthetically. We used two publicly available leukemia data sources: ALL-IDB and ASH Image Bank. Next, we applied seven different image transformation techniques as data augmentation. We designed a CNN architecture capable of recognizing all subtypes of leukemia. Besides, we also explored other well-known machine learning algorithms such as naive Bayes, support vector machine, k-nearest neighbor, and decision tree. To evaluate our approach, we set up a set of experiments and used 5-fold cross-validation. The results we obtained from experiments showed that our CNN model performance has 88.25% and 81.74% accuracy, in leukemia versus healthy and multiclass classification of all subtypes, respectively. Finally, we also showed that the CNN model has a better performance than other wellknown machine learning algorithms.

Download Full-text

Utilization of machine-learning algorithms for wind turbine site suitability modeling in Iowa, USA

Wind Energy ◽

10.1002/we.1723 ◽

2014 ◽

Vol 18 (4) ◽

pp. 713-727 ◽

Cited By ~ 3

Author(s):

Andrey N. Petrov ◽

Jordan M. Wessling

Keyword(s):

Machine Learning ◽

Wind Turbine ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Site Suitability

Download Full-text

Object Oriented Fault Prediction Analysis Using Machine Learning Algorithms

Lecture Notes in Electrical Engineering - ICDSMLA 2019 ◽

10.1007/978-981-15-1420-3_96 ◽

2020 ◽

pp. 886-892

Author(s):

Neha ◽

Ajay Jaiswal ◽

Abhishek Tandon

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Object Oriented ◽

Machine Learning Algorithms ◽

Fault Prediction ◽

Prediction Analysis

Download Full-text

Wind Turbine Blade Damage Detection Using Supervised Machine Learning Algorithms

Journal of Vibration and Acoustics ◽

10.1115/1.4036951 ◽

2017 ◽

Vol 139 (6) ◽

Cited By ~ 14

Author(s):

Taylor Regan ◽

Christopher Beale ◽

Murat Inalpolat

Keyword(s):

Machine Learning ◽

Damage Detection ◽

Wind Turbine ◽

Health Monitoring ◽

Learning Algorithms ◽

Turbine Blades ◽

Machine Learning Algorithms ◽

Test Rig ◽

Wind Turbine Blades ◽

Baseline Characteristics

Wind turbine blades undergo high operational loads, experience variable environmental conditions, and are susceptible to failure due to defects, fatigue, and weather-induced damage. These large-scale composite structures are fundamentally enclosed acoustic cavities and currently have limited, if any, structural health monitoring (SHM) in place. A novel acoustics-based structural sensing and health monitoring technique is developed, requiring efficient algorithms for operational damage detection of cavity structures. This paper describes the selection of a set of statistical features for acoustics-based damage detection of enclosed cavities, such as wind turbine blades, as well as a systematic approach used in the identification of competent machine learning algorithms. Logistic regression (LR) and support vector machine (SVM) methods are identified and used with optimal feature selection for decision-making via binary classification algorithms. A laboratory-scale wind turbine with hollow composite blades was built for damage detection studies. This test rig allows for testing of stationary or rotating blades, of which time and frequency domain information can be collected to establish baseline characteristics. The test rig can then be used to observe any deviations from the baseline characteristics. An external microphone attached to the tower will be utilized to monitor blade health while blades are internally ensonified by wireless speakers. An initial test campaign with healthy and damaged blade specimens is carried out to arrive at several conclusions on the detectability and feature extraction capabilities required for damage detection.

Download Full-text

Machine Learning Methods for Fear Classification Based on Physiological Features

Sensors ◽

10.3390/s21134519 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4519

Author(s):

Livia Petrescu ◽

Cătălin Petrescu ◽

Ana Oprea ◽

Oana Mitruț ◽

Gabriela Moise ◽

...

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Dimensionality Reduction ◽

Data Augmentation ◽

Binary Classification ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Physiological Data ◽

Gradient Boosting ◽

Support Vector

This paper focuses on the binary classification of the emotion of fear, based on the physiological data and subjective responses stored in the DEAP dataset. We performed a mapping between the discrete and dimensional emotional information considering the participants’ ratings and extracted a substantial set of 40 types of features from the physiological data, which represented the input to various machine learning algorithms—Decision Trees, k-Nearest Neighbors, Support Vector Machine and artificial networks—accompanied by dimensionality reduction, feature selection and the tuning of the most relevant hyperparameters, boosting classification accuracy. The methodology we approached included tackling different situations, such as resolving the problem of having an imbalanced dataset through data augmentation, reducing overfitting, computing various metrics in order to obtain the most reliable classification scores and applying the Local Interpretable Model-Agnostic Explanations method for interpretation and for explaining predictions in a human-understandable manner. The results show that fear can be predicted very well (accuracies ranging from 91.7% using Gradient Boosting Trees to 93.5% using dimensionality reduction and Support Vector Machine) by extracting the most relevant features from the physiological data and by searching for the best parameters which maximize the machine learning algorithms’ classification scores.

Download Full-text

Medium and Long-Term Precipitation Forecasting Method Based on Data Augmentation and Machine Learning Algorithms

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2022.3140442 ◽

2022 ◽

pp. 1-1

Author(s):

Tiantian Tang ◽

Donglai Jiao ◽

Tao Chen ◽

Guan Gui

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Precipitation Forecasting ◽

Forecasting Method

Download Full-text

Review of machine learning algorithms' application in pharmaceutical technology

Arhiv za farmaciju ◽

10.5937/arhfarm71-32499 ◽

2021 ◽

Vol 71 (4) ◽

pp. 302-317

Author(s):

Jelena Đuriš ◽

Ivana Kurćubić ◽

Svetlana Ibrić

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Data Science ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Formulation Development ◽

Light Gradient ◽

Pharmaceutical Technology ◽

Wide Range

Machine learning algorithms, and artificial intelligence in general, have a wide range of applications in the field of pharmaceutical technology. Starting from the formulation development, through a great potential for integration within the Quality by design framework, these data science tools provide a better understanding of the pharmaceutical formulations and respective processing. Machine learning algorithms can be especially helpful with the analysis of the large volume of data generated by the Process analytical technologies. This paper provides a brief explanation of the artificial neural networks, as one of the most frequently used machine learning algorithms. The process of the network training and testing is described and accompanied with illustrative examples of machine learning tools applied in the context of pharmaceutical formulation development and related technologies, as well as an overview of the future trends. Recently published studies on more sophisticated methods, such as deep neural networks and light gradient boosting machine algorithm, have been described. The interested reader is also referred to several official documents (guidelines) that pave the way for a more structured representation of the machine learning models in their prospective submissions to the regulatory bodies.

Download Full-text

Domestic Cat Sound Classification Using Learned Features from Deep Neural Nets

Applied Sciences ◽

10.3390/app8101949 ◽

2018 ◽

Vol 8 (10) ◽

pp. 1949 ◽

Cited By ~ 10

Author(s):

Yagya Raj Pandeya ◽

Dongwhoon Kim ◽

Joonwhoan Lee

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Learning Algorithms ◽

Area Under The Curve ◽

Machine Learning Algorithms ◽

Neural Nets ◽

Domestic Cat ◽

Neural Net ◽

Original Dataset ◽

Learned Features

The domestic cat (Feliscatus) is one of the most attractive pets in the world, and it generates mysterious kinds of sound according to its mood and situation. In this paper, we deal with the automatic classification of cat sounds using machine learning. Machine learning approach for the classification requires class labeled data, so our work starts with building a small dataset named CatSound across 10 categories. Along with the original dataset, we increase the amount of data with various audio data augmentation methods to help our classification task. In this study, we use two types of learned features from deep neural networks; one from a pre-trained convolutional neural net (CNN) on music data by transfer learning and the other from unsupervised convolutional deep belief network that is (CDBN) solely trained on a collected set of cat sounds. In addition to conventional GAP, we propose an effective pooling method called FDAP to explore a number of meaningful features. In FDAP, the frequency dimension is roughly divided and then the average pooling is applied in each division. For the classification, we exploited five different machine learning algorithms and an ensemble of them. We compare the classification performances with respect following factors: the amount of data increased by augmentation, the learned features from pre-trained CNN or unsupervised CDBN, conventional GAP or FDAP, and the machine learning algorithms used for the classification. As expected, the proposed FDAP features with larger amount of data increased by augmentation combined with the ensemble approach have produced the best accuracy. Moreover, both learned features from pre-trained CNN and unsupervised CDBN produce good results in the experiment. Therefore, with the combination of all those positive factors, we obtained the best result of 91.13% in accuracy, 0.91 in f1-score, and 0.995 in area under the curve (AUC) score.

Download Full-text