A Three Layer Super Learner Ensemble with Hyperparameter Optimization to Improve the Performance of Machine Learning Model

A combination of different machine learning models to form a super learner can definitely lead to improved predictions in any domain. The super learner ensemble discussed in this study collates several machine learning models and proposes to enhance the performance by considering the final meta- model accuracy and the prediction duration. An algorithm is proposed to rate the machine learning models derived by combining the base classifiers voted with different weights. The proposed algorithm is named as Log Loss Weighted Super Learner Model (LLWSL). Based on the voted weight, the optimal model is selected and the machine learning method derived is identified. The meta- learner of the super learner uses them by tuning their hyperparameters. The execution time and the model accuracies were evaluated using two separate datasets inside LMSSLIITD extracted from the educational industry by executing the LLWSL algorithm. According to the outcome of the evaluation process, it has been noticed that there exists a significant improvement in the proposed algorithm LLWSL for use in machine learning tasks for the achievement of better performances.

Download Full-text

A Comparative Study of Machine Learning Models with Hyperparameter Optimization Algorithm for Mapping Mineral Prospectivity

Minerals ◽

10.3390/min11020159 ◽

2021 ◽

Vol 11 (2) ◽

pp. 159

Author(s):

Nan Lin ◽

Yongliang Chen ◽

Haiqi Liu ◽

Hanlin Liu

Keyword(s):

Machine Learning ◽

Swarm Intelligence ◽

Optimization Algorithm ◽

Geochemical Data ◽

Support Vector ◽

Learning Models ◽

Hyperparameter Optimization ◽

Mineral Prospectivity ◽

Swarm Intelligence Optimization ◽

Machine Learning Models

Selecting internal hyperparameters, which can be set by the automatic search algorithm, is important to improve the generalization performance of machine learning models. In this study, the geological, remote sensing and geochemical data of the Lalingzaohuo area in Qinghai province were researched. A multi-source metallogenic information spatial data set was constructed by calculating the Youden index for selecting potential evidence layers. The model for mapping mineral prospectivity of the study area was established by combining two swarm intelligence optimization algorithms, namely the bat algorithm (BA) and the firefly algorithm (FA), with different machine learning models. The receiver operating characteristic (ROC) and prediction-area (P-A) curves were used for performance evaluation and showed that the two algorithms had an obvious optimization effect. The BA and FA differentiated in improving multilayer perceptron (MLP), AdaBoost and one-class support vector machine (OCSVM) models; thus, there was no optimization algorithm that was consistently superior to the other. However, the accuracy of the machine learning models was significantly enhanced after optimizing the hyperparameters. The area under curve (AUC) values of the ROC curve of the optimized machine learning models were all higher than 0.8, indicating that the hyperparameter optimization calculation was effective. In terms of individual model improvement, the accuracy of the FA-AdaBoost model was improved the most significantly, with the AUC value increasing from 0.8173 to 0.9597 and the prediction/area (P/A) value increasing from 3.156 to 10.765, where the mineral targets predicted by the model occupied 8.63% of the study area and contained 92.86% of the known mineral deposits. The targets predicted by the improved machine learning models are consistent with the metallogenic geological characteristics, indicating that the swarm intelligence optimization algorithm combined with the machine learning model is an efficient method for mineral prospectivity mapping.

Download Full-text

Grid search in hyperparameter optimization of machine learning models for prediction of HIV/AIDS test results

International Journal of Computers and Applications ◽

10.1080/1206212x.2021.1974663 ◽

2021 ◽

pp. 1-12

Author(s):

Daniel Mesafint Belete ◽

Manjaiah D. Huchaiah

Keyword(s):

Machine Learning ◽

Grid Search ◽

Test Results ◽

Learning Models ◽

Hyperparameter Optimization ◽

Machine Learning Models ◽

Hiv Aids

Download Full-text

Comparison of machine learning models based on time domain and frequency domain features for faults diagnosis in rotating machines

MATEC Web of Conferences ◽

10.1051/matecconf/201821117009 ◽

2018 ◽

Vol 211 ◽

pp. 17009

Author(s):

Natalia Espinoza Sepulveda ◽

Jyoti Sinha

Keyword(s):

Machine Learning ◽

Frequency Domain ◽

Time Domain ◽

Intelligent Systems ◽

Learning Models ◽

Machine Vibration ◽

Vibration Data ◽

Machine Learning Model ◽

The Time Domain ◽

Machine Learning Models

The development of technologies for the maintenance industry has taken an important role to meet the demanding challenges. One of the important challenges is to predict the defects, if any, in machines as early as possible to manage the machines downtime. The vibration-based condition monitoring (VCM) is well-known for this purpose but requires the human experience and expertise. The machine learning models using the intelligent systems and pattern recognition seem to be the future avenue for machine fault detection without the human expertise. Several such studies are published in the literature. This paper is also on the machine learning model for the different machine faults classification and detection. Here the time domain and frequency domain features derived from the measured machine vibration data are used separated in the development of the machine learning models using the artificial neutral network method. The effectiveness of both the time and frequency domain features based models are compared when they are applied to an experimental rig. The paper presents the proposed machine learning models and their performance in terms of the observations and results.

Download Full-text

Automated Retraining of Machine Learning Models

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3322.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 445-452

Keyword(s):

Machine Learning ◽

Input Data ◽

Research Work ◽

Learning Models ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Crucial Component ◽

Conventional Machine ◽

Over Time ◽

Machine Learning Models

Data is the most crucial component of a successful ML system. Once a machine learning model is developed, it gets obsolete over time due to presence of new input data being generated every second. In order to keep our predictions accurate we need to find a way to keep our models up to date. Our research work involves finding a mechanism which can retrain the model with new data automatically. This research also involves exploring the possibilities of automating machine learning processes. We started this project by training and testing our model using conventional machine learning methods. The outcome was then compared with the outcome of those experiments conducted using the AutoML methods like TPOT. This helped us in finding an efficient technique to retrain our models. These techniques can be used in areas where people do not deal with the actual working of a ML model but only require the outputs of ML processes

Download Full-text

TOPICAL ISSUES OF APPLICATION OF MACHINE LEARNING METHODS IN ECONOMY

Инновационные аспекты развития науки и техники. Сборник статей VIII Международной научно-практической конференции: сборник статей, [электронное издание сетевого распространения] / Под ред. Н.В. Емельянова. – М.: “КДУ”, “Добросвет”, 2021. – 149 с. ◽

10.31453/kdu.ru.978-5-7913-1176-4-2021-28-33 ◽

2021 ◽

Author(s):

Natalia Pavlovna Persteneva ◽

◽

Darya Dmitrievn Skryleva ◽

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Learning Model ◽

Learning Models ◽

Learning Methods ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Popular Classes ◽

Machine Learning Models

The article discusses machine learning methods. Using the example of two popular classes: supervised learning and unsupervised learning. Variants of the main types of machine learning models for each method are presented. A generalized algorithm for building any machine learning model is formed.

Download Full-text

Improving Logging Prediction on Imbalanced Datasets

International Journal of Open Source Software and Processes ◽

10.4018/ijossp.2016040103 ◽

2016 ◽

Vol 7 (2) ◽

pp. 43-71 ◽

Cited By ~ 3

Author(s):

Sangeeta Lal ◽

Neetu Sardana ◽

Ashish Sureka

Keyword(s):

Machine Learning ◽

Open Source ◽

Class Imbalance ◽

Learning Model ◽

Learning Models ◽

Class Imbalance Problem ◽

Imbalanced Datasets ◽

Imbalance Problem ◽

Machine Learning Model ◽

Machine Learning Models

Logging is an important yet tough decision for OSS developers. Machine-learning models are useful in improving several steps of OSS development, including logging. Several recent studies propose machine-learning models to predict logged code construct. The prediction performances of these models are limited due to the class-imbalance problem since the number of logged code constructs is small as compared to non-logged code constructs. No previous study analyzes the class-imbalance problem for logged code construct prediction. The authors first analyze the performances of J48, RF, and SVM classifiers for catch-blocks and if-blocks logged code constructs prediction on imbalanced datasets. Second, the authors propose LogIm, an ensemble and threshold-based machine-learning model. Third, the authors evaluate the performance of LogIm on three open-source projects. On average, LogIm model improves the performance of baseline classifiers, J48, RF, and SVM, by 7.38%, 9.24%, and 4.6% for catch-blocks, and 12.11%, 14.95%, and 19.13% for if-blocks logging prediction.

Download Full-text

Neural Networks in ECG Classification

Neural Networks in Healthcare ◽

10.4018/978-1-59140-848-2.ch004 ◽

2011 ◽

pp. 81-104 ◽

Cited By ~ 2

Author(s):

G. Camps-Valls ◽

J. F. Guerrero-Martinez

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Feature Selection ◽

Adaptive Systems ◽

Clinical Environment ◽

Learning Models ◽

Cardiac Pathology ◽

Learning Tasks ◽

Beat Detection ◽

Machine Learning Models

In this chapter, we review the vast field of application of artificial neural networks in cardiac pathology discrimination based on electrocardiographic signals. We discuss advantages and drawbacks of neural and adaptive systems in cardiovascular medicine and catch a glimpse of forthcoming developments in machine learning models for the real clinical environment. Some problems are identified in the learning tasks of beat detection, feature selection/extraction, and classification, and some proposals and suggestions are given to alleviate the problems of interpretability, overfitting, and adaptation. These have become important problems in recent years and will surely constitute the basis of some investigations in the immediate future.

Download Full-text

GPT-3 AI language tool calls for cautious optimism

Emerald Expert Briefings ◽

10.1108/oxan-db256373 ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Public Funding ◽

Computational Effort ◽

Learning Models ◽

Content Type ◽

Business Applications ◽

Machine Learning Model ◽

Explainable Ai ◽

Advance Research ◽

Machine Learning Models

Significance It required arguably the single largest computational effort for a machine learning model to date, and is it capable of producing text at times indistinguishable from the work of a human author. This has generated considerable excitement about potentially transformative business applications -- and concerns about the system's weaknesses and possible misuse. Impacts Stereotypes and biases in machine learning models will become increasingly problematic as they are adopted by businesses and governments. The use of flawed AI tools that result in embarrassing failures risk cuts to public funding for AI research. Academia and industry face pressure to advance research into explainable AI, but progress is slow.

Download Full-text

Unscented Kalman Filter-Aided Long Short-Term Memory Approach for Wind Nowcasting

Aerospace ◽

10.3390/aerospace8090236 ◽

2021 ◽

Vol 8 (9) ◽

pp. 236

Author(s):

Junghyun Kim ◽

Kyuman Lee

Keyword(s):

Machine Learning ◽

Short Term Memory ◽

Unscented Kalman Filter ◽

Aviation Industry ◽

Learning Models ◽

Short Term ◽

Machine Learning Model ◽

Long Short Term Memory ◽

Lstm Network ◽

Machine Learning Models

Obtaining reliable wind information is critical for efficiently managing air traffic and airport operations. Wind forecasting has been considered one of the most challenging tasks in the aviation industry. Recently, with the advent of artificial intelligence, many machine learning techniques have been widely used to address a variety of complex phenomena in wind predictions. In this paper, we propose a hybrid framework that combines a machine learning model with Kalman filtering for a wind nowcasting problem in the aviation industry. More specifically, this study has three objectives as follows: (1) compare the performance of the machine learning models (i.e., Gaussian process, multi-layer perceptron, and long short-term memory (LSTM) network) to identify the most appropriate model for wind predictions, (2) combine the machine learning model selected in step (1) with an unscented Kalman filter (UKF) to improve the fidelity of the model, and (3) perform Monte Carlo simulations to quantify uncertainties arising from the modeling process. Results show that short-term time-series wind datasets are best predicted by the LSTM network compared to the other machine learning models and the UKF-aided LSTM (UKF-LSTM) approach outperforms the LSTM network only, especially when long-term wind forecasting needs to be considered.

Download Full-text

Hybrid Machine Learning Model for Body Fat Percentage Prediction Based on Support Vector Regression and Emotional Artificial Neural Networks

Applied Sciences ◽

10.3390/app11219797 ◽

2021 ◽

Vol 11 (21) ◽

pp. 9797

Author(s):

Solaf A. Hussain ◽

Nadire Cavus ◽

Boran Sekeroglu

Keyword(s):

Machine Learning ◽

Body Fat ◽

Support Vector ◽

Body Fat Percentage ◽

Learning Models ◽

Fat Percentage ◽

Machine Learning Model ◽

Proposed Model ◽

Hybrid Machine ◽

Machine Learning Models

Obesity or excessive body fat causes multiple health problems and diseases. However, obesity treatment and control need an accurate determination of body fat percentage (BFP). The existing methods for BFP estimation require several procedures, which reduces their cost-effectivity and generalization. Therefore, developing cost-effective models for BFP estimation is vital for obesity treatment. Machine learning models, particularly hybrid models, have a strong ability to analyze challenging data and perform predictions by combining different characteristics of the models. This study proposed a hybrid machine learning model based on support vector regression and emotional artificial neural networks (SVR-EANNs) for accurate recent BFP prediction using a primary BFP dataset. SVR was applied as a consistent attribute selection model on seven properties and measurements, using the left-out sensitivity analysis, and the regression ability of the EANN was considered in the prediction phase. The proposed model was compared to seven benchmark machine learning models. The obtained results show that the proposed hybrid model (SVR-EANN) outperformed other machine learning models by achieving superior results in the three considered evaluation metrics. Furthermore, the proposed model suggested that abdominal circumference is a significant factor in BFP prediction, while age has a minor effect.

Download Full-text