A Novel Intrusion Detection Approach Using Machine Learning Ensemble for IoT Environments

The Internet of Things (IoT) has gained significant importance due to its applicability in diverse environments. Another reason for the influence of the IoT is its use of a flexible and scalable framework. The extensive and diversified use of the IoT in the past few years has attracted cyber-criminals. They exploit the vulnerabilities of the open-source IoT framework due to the absentia of robust and standard security protocols, hence discouraging existing and potential stakeholders. The authors propose a binary classifier approach developed from a machine learning ensemble method to filter and dump malicious traffic to prevent malicious actors from accessing the IoT network and its peripherals. The gradient boosting machine (GBM) ensemble approach is used to train the binary classifier using pre-processed recorded data packets to detect the anomaly and prevent the IoT networks from zero-day attacks. The positive class performance metrics of the model resulted in an accuracy of 98.27%, a precision of 96.40%, and a recall of 95.70%. The simulation results prove the effectiveness of the proposed model against cyber threats, thus making it suitable for critical applications for the IoT.

Download Full-text

A Predictive Model for Welfare Attitudes of Cohorts: Using Gradient Boosting Machine Learning Algorithm

Socail Science Review ◽

10.31502/ssri.52.2.5 ◽

2021 ◽

Vol 52 (2) ◽

pp. 91-114

Author(s):

KiHye Hong ◽

Tae Ho Eom

Keyword(s):

Machine Learning ◽

Predictive Model ◽

Learning Algorithm ◽

Gradient Boosting ◽

Machine Learning Algorithm ◽

Gradient Boosting Machine ◽

Welfare Attitudes

Download Full-text

Data Analytics for Monitoring the Satisfactory Parameters of Airline Passengers using Machine Learning Algorithms in Python

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8677.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1231-1235

Keyword(s):

Machine Learning ◽

Data Analytics ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Complex Information ◽

Huge Data ◽

Gradient Boosting Machine ◽

Airline Passengers ◽

Effective Representation

An effective representation by machine learning algorithms is to obtain the results especially in Big Data, there are numerous applications can produce outcome, whereas a Random Forest Algorithm (RF) Gradient Boosting Machine (GBM), Decision tree (DT) in Python will able to give the higher accuracy in regard with classifying various parameters of Airliner Passengers satisfactory levels. The complex information of airline passengers has provided huge data for interpretation through different parameters of satisfaction that contains large information in quantity wise. An algorithm has to support in classifying these data’s with accuracies. As a result some of the methods may provide less precision and there is an opportunity of information cancellation and furthermore information missing utilizing conventional techniques. Subsequently RF and GBM used to conquer the unpredictability and exactness about the information provided. The aim of this study is to identify an Algorithm which is suitable for classifying the satisfactory level of airline passengers with data analytics using python by knowing the output. The optimization and Implementation of independent variables by training and testing for accuracy in python platform determined the variation between the each parameters and also recognized RF and GBM as a better algorithm in comparison with other classifying algorithms.

Download Full-text

Development of a Diabetes Melitus Detection and Prediction Model Using Light Gradient Boosting Machine and K-Nearest Neighbour

10.36108/ujees/1202.30.0160 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

B. A Omodunbi

Keyword(s):

Diabetes Mellitus ◽

Machine Learning ◽

Hybrid Model ◽

Learning Model ◽

Experimental Result ◽

Gradient Boosting ◽

Light Gradient ◽

Machine Learning Model ◽

Gradient Boosting Machine ◽

Receiver Operating

Diabetes mellitus is a health disorder that occurs when the blood sugar level becomes extremely high due to body resistance in producing the required amount of insulin. The aliment happens to be among the major causes of death in Nigeria and the world at large. This study was carried out to detect diabetes mellitus by developing a hybrid model that comprises of two machine learning model namely Light Gradient Boosting Machine (LGBM) and K-Nearest Neighbor (KNN). This research is aimed at developing a machine learning model for detecting the occurrence of diabetes in patients. The performance metrics employed in evaluating the finding for this study are Receiver Operating Characteristics (ROC) Curve, Five-fold Cross-validation, precision, and accuracy score. The proposed system had an accuracy of 91% and the area under the Receiver Operating Characteristic Curve was 93%. The experimental result shows that the prediction accuracy of the hybrid model is better than traditional machine learning

Download Full-text

Software reuse analytics using integrated random forest and gradient boosting machine learning algorithm

Software Practice and Experience ◽

10.1002/spe.2921 ◽

2020 ◽

Author(s):

Amandeep Kaur Sandhu ◽

Ranbir Singh Batth

Keyword(s):

Machine Learning ◽

Random Forest ◽

Software Reuse ◽

Learning Algorithm ◽

Gradient Boosting ◽

Machine Learning Algorithm ◽

Gradient Boosting Machine

Download Full-text

Multistep-Ahead Solar Radiation Forecasting Scheme Based on the Light Gradient Boosting Machine: A Case Study of Jeju Island

Remote Sensing ◽

10.3390/rs12142271 ◽

2020 ◽

Vol 12 (14) ◽

pp. 2271 ◽

Cited By ~ 2

Author(s):

Jinwoong Park ◽

Jihoon Moon ◽

Seungmin Jung ◽

Eenjun Hwang

Keyword(s):

Solar Radiation ◽

Global Solar Radiation ◽

Jeju Island ◽

Gradient Boosting ◽

Probabilistic Forecasting ◽

Training Time ◽

Light Gradient ◽

Proposed Model ◽

Gradient Boosting Machine ◽

Time Problem

Smart islands have focused on renewable energy sources, such as solar and wind, to achieve energy self-sufficiency. Because solar photovoltaic (PV) power has the advantage of less noise and easier installation than wind power, it is more flexible in selecting a location for installation. A PV power system can be operated more efficiently by predicting the amount of global solar radiation for solar power generation. Thus far, most studies have addressed day-ahead probabilistic forecasting to predict global solar radiation. However, day-ahead probabilistic forecasting has limitations in responding quickly to sudden changes in the external environment. Although multistep-ahead (MSA) forecasting can be used for this purpose, traditional machine learning models are unsuitable because of the substantial training time. In this paper, we propose an accurate MSA global solar radiation forecasting model based on the light gradient boosting machine (LightGBM), which can handle the training-time problem and provide higher prediction performance compared to other boosting methods. To demonstrate the validity of the proposed model, we conducted a global solar radiation prediction for two regions on Jeju Island, the largest island in South Korea. The experiment results demonstrated that the proposed model can achieve better predictive performance than the tree-based ensemble and deep learning methods.

Download Full-text

Prediction of Acute Kidney Injury after Liver Transplantation: Machine Learning Approaches vs. Logistic Regression Model

Journal of Clinical Medicine ◽

10.3390/jcm7110428 ◽

2018 ◽

Vol 7 (11) ◽

pp. 428 ◽

Cited By ~ 31

Author(s):

Hyung-Chul Lee ◽

Soo Yoon ◽

Seong-Mi Yang ◽

Won Kim ◽

Ho-Geol Ryu ◽

...

Keyword(s):

Machine Learning ◽

Liver Transplantation ◽

Acute Kidney Injury ◽

Logistic Regression ◽

Regression Analysis ◽

Logistic Regression Analysis ◽

Kidney Injury ◽

Gradient Boosting ◽

Learning Approaches ◽

Gradient Boosting Machine

Acute kidney injury (AKI) after liver transplantation has been reported to be associated with increased mortality. Recently, machine learning approaches were reported to have better predictive ability than the classic statistical analysis. We compared the performance of machine learning approaches with that of logistic regression analysis to predict AKI after liver transplantation. We reviewed 1211 patients and preoperative and intraoperative anesthesia and surgery-related variables were obtained. The primary outcome was postoperative AKI defined by acute kidney injury network criteria. The following machine learning techniques were used: decision tree, random forest, gradient boosting machine, support vector machine, naïve Bayes, multilayer perceptron, and deep belief networks. These techniques were compared with logistic regression analysis regarding the area under the receiver-operating characteristic curve (AUROC). AKI developed in 365 patients (30.1%). The performance in terms of AUROC was best in gradient boosting machine among all analyses to predict AKI of all stages (0.90, 95% confidence interval [CI] 0.86–0.93) or stage 2 or 3 AKI. The AUROC of logistic regression analysis was 0.61 (95% CI 0.56–0.66). Decision tree and random forest techniques showed moderate performance (AUROC 0.86 and 0.85, respectively). The AUROC of support the vector machine, naïve Bayes, neural network, and deep belief network was smaller than that of the other models. In our comparison of seven machine learning approaches with logistic regression analysis, the gradient boosting machine showed the best performance with the highest AUROC. An internet-based risk estimator was developed based on our model of gradient boosting. However, prospective studies are required to validate our results.

Download Full-text

Noise Prediction Using Machine Learning with Measurements Analysis

Applied Sciences ◽

10.3390/app10186619 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6619

Author(s):

Po-Jiun Wen ◽

Chihpin Huang

Keyword(s):

Machine Learning ◽

Noise Exposure ◽

Learning Model ◽

Training Data ◽

Coefficient Of Determination ◽

Gradient Boosting ◽

Noise Prediction ◽

Time Duration ◽

Proposed Model ◽

The Impact

The noise prediction using machine learning is a special study that has recently received increased attention. This is particularly true in workplaces with noise pollution, which increases noise exposure for general laborers. This study attempts to analyze the noise equivalent level (Leq) at the National Synchrotron Radiation Research Center (NSRRC) facility and establish a machine learning model for noise prediction. This study utilized the gradient boosting model (GBM) as the learning model in which past noise measurement records and many other features are integrated as the proposed model makes a prediction. This study analyzed the time duration and frequency of the collected Leq and also investigated the impact of training data selection. The results presented in this paper indicate that the proposed prediction model works well in almost noise sensors and frequencies. Moreover, the model performed especially well in sensor 8 (125 Hz), which was determined to be a serious noise zone in the past noise measurements. The results also show that the root-mean-square-error (RMSE) of the predicted harmful noise was less than 1 dBA and the coefficient of determination (R2) value was greater than 0.7. That is, the working field showed a favorable noise prediction performance using the proposed method. This positive result shows the ability of the proposed approach in noise prediction, thus providing a notification to the laborer to prevent long-term exposure. In addition, the proposed model accurately predicts noise future pollution, which is essential for laborers in high-noise environments. This would keep employees healthy in avoiding noise harmful positions to prevent people from working in that environment.

Download Full-text

Prediction of probable backorder scenarios in the supply chain using Distributed Random Forest and Gradient Boosting Machine learning techniques

Journal Of Big Data ◽

10.1186/s40537-020-00345-2 ◽

2020 ◽

Vol 7 (1) ◽

Cited By ~ 1

Author(s):

Samiul Islam ◽

Saman Hassanzadeh Amin

Keyword(s):

Machine Learning ◽

Supply Chain ◽

Random Forest ◽

Machine Learning Techniques ◽

Gradient Boosting ◽

Learning Techniques ◽

Gradient Boosting Machine

Download Full-text

Supervised Machine-learning Predictive Analytics for Prediction of Postinduction Hypotension

Anesthesiology ◽

10.1097/aln.0000000000002374 ◽

2018 ◽

Vol 129 (4) ◽

pp. 675-688 ◽

Cited By ~ 45

Author(s):

Samir Kendale ◽

Prathamesh Kulkarni ◽

Andrew D. Rosenberg ◽

Jing Wang

Keyword(s):

Machine Learning ◽

Receiver Operating Characteristic Curve ◽

Operating Characteristic ◽

Predictive Analytics ◽

Characteristic Curve ◽

Supervised Machine Learning ◽

Gradient Boosting ◽

Machine Learning Methods ◽

Gradient Boosting Machine ◽

Operating Characteristic Curve

AbstractEditor’s PerspectiveWhat We Already Know about This TopicWhat This Article Tells Us That Is NewBackgroundHypotension is a risk factor for adverse perioperative outcomes. Machine-learning methods allow large amounts of data for development of robust predictive analytics. The authors hypothesized that machine-learning methods can provide prediction for the risk of postinduction hypotension.MethodsData was extracted from the electronic health record of a single quaternary care center from November 2015 to May 2016 for patients over age 12 that underwent general anesthesia, without procedure exclusions. Multiple supervised machine-learning classification techniques were attempted, with postinduction hypotension (mean arterial pressure less than 55 mmHg within 10 min of induction by any measurement) as primary outcome, and preoperative medications, medical comorbidities, induction medications, and intraoperative vital signs as features. Discrimination was assessed using cross-validated area under the receiver operating characteristic curve. The best performing model was tuned and final performance assessed using split-set validation.ResultsOut of 13,323 cases, 1,185 (8.9%) experienced postinduction hypotension. Area under the receiver operating characteristic curve using logistic regression was 0.71 (95% CI, 0.70 to 0.72), support vector machines was 0.63 (95% CI, 0.58 to 0.60), naive Bayes was 0.69 (95% CI, 0.67 to 0.69), k-nearest neighbor was 0.64 (95% CI, 0.63 to 0.65), linear discriminant analysis was 0.72 (95% CI, 0.71 to 0.73), random forest was 0.74 (95% CI, 0.73 to 0.75), neural nets 0.71 (95% CI, 0.69 to 0.71), and gradient boosting machine 0.76 (95% CI, 0.75 to 0.77). Test set area for the gradient boosting machine was 0.74 (95% CI, 0.72 to 0.77).ConclusionsThe success of this technique in predicting postinduction hypotension demonstrates feasibility of machine-learning models for predictive analytics in the field of anesthesiology, with performance dependent on model selection and appropriate tuning.

Download Full-text

Accurate Prediction of Antibody Resistance in Clinical HIV-1 Isolates

10.1101/364828 ◽

2018 ◽

Cited By ~ 1

Author(s):

Reda Rawi ◽

Raghvendra Mall ◽

Chen-Hsiang Shen ◽

Nicole A. Doria-Rose ◽

S. Katie Farney ◽

...

Keyword(s):

Machine Learning ◽

Clinical Trials ◽

Neutralizing Antibodies ◽

Gradient Boosting ◽

Clinical Settings ◽

Gradient Boosting Machine ◽

Resistant Strains ◽

Antibody Resistance ◽

Hiv 1

Broadly neutralizing antibodies (bNAbs) targeting the HIV-1 envelope glycoprotein (Env) have promising utility in prevention and treatment of HIV-1 infection with several undergoing clinical trials. Due to high sequence diversity and mutation rate of HIV-1, viral isolates are often resistant to particular bNAbs. Resistant strains are commonly identified by time-consuming and expensive in vitro neutralization experiments. Here, we developed machine learning-based classifiers that accurately predict resistance of HIV-1 strains to 33 neutralizing antibodies. Notably, our classifiers achieved an overall prediction accuracy of 96% for 212 clinical isolates from patients enrolled in four different clinical trials. Moreover, use of the tree-based machine learning method gradient boosting machine enabled us to identify critical epitope features that distinguish between antibody resistance and sensitivity. The availability of an in silico antibody resistance predictor will facilitate informed decisions of antibody usage in clinical settings.

Download Full-text