DDoS Attack Classification on Cloud Environment Using Machine Learning Techniques with Different Feature Selection Methods

doi:10.30534/ijatcse/2020/60952020

The Effectiveness of Feature Selection Method in Solar Power Prediction

Journal of Renewable Energy ◽

10.1155/2013/952613 ◽

2013 ◽

Vol 2013 ◽

pp. 1-9 ◽

Cited By ~ 4

Author(s):

Md Rahat Hossain ◽

Amanullah Maung Than Oo ◽

A. B. M. Shawkat Ali

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Prediction Accuracy ◽

Solar Power ◽

Feature Subset Selection ◽

Machine Learning Techniques ◽

Support Vector ◽

Selection Methods ◽

Power Prediction ◽

Learning Techniques

This paper empirically shows that the effect of applying selected feature subsets on machine learning techniques significantly improves the accuracy for solar power prediction. Experiments are performed using five well-known wrapper feature selection methods to obtain the solar power prediction accuracy of machine learning techniques with selected feature subsets. For all the experiments, the machine learning techniques, namely, least median square (LMS), multilayer perceptron (MLP), and support vector machine (SVM), are used. Afterwards, these results are compared with the solar power prediction accuracy of those same machine leaning techniques (i.e., LMS, MLP, and SVM) but without applying feature selection methods (WAFS). Experiments are carried out using reliable and real life historical meteorological data. The comparison between the results clearly shows that LMS, MLP, and SVM provide better prediction accuracy (i.e., reduced MAE and MASE) with selected feature subsets than without selected feature subsets. Experimental results of this paper facilitate to make a concrete verdict that providing more attention and effort towards the feature subset selection aspect (e.g., selected feature subsets on prediction accuracy which is investigated in this paper) can significantly contribute to improve the accuracy of solar power prediction.

Download Full-text

A comprehensive study of different feature selection methods and machine-learning techniques for SODAR structure classification

Modeling Earth Systems and Environment ◽

10.1007/s40808-020-00872-0 ◽

2020 ◽

Author(s):

Nishant Kumar ◽

Kirti Soni ◽

Ravinder Agarwal

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Machine Learning Techniques ◽

Selection Methods ◽

Learning Techniques ◽

Comprehensive Study

Download Full-text

Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques

PLoS ONE ◽

10.1371/journal.pone.0254976 ◽

2021 ◽

Vol 16 (7) ◽

pp. e0254976

Author(s):

Keyvan Karami ◽

Mahboubeh Akbari ◽

Mohammad-Taher Moradi ◽

Bijan Soleymani ◽

Hossein Fallahi

Keyword(s):

Machine Learning ◽

Data Mining ◽

Acute Myeloid Leukemia ◽

Feature Selection ◽

Predictive Value ◽

Myeloid Leukemia ◽

Machine Learning Techniques ◽

Selection Methods ◽

Learning Techniques ◽

Acute Myeloid

This paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the survival of AML patients. Here, six data mining algorithms including Decision Tree, Random Forrest, Logistic Regression, Naive Bayes, W-Bayes Net, and Gradient Boosted Tree (GBT) are employed for the detection model and implemented using the common data mining tool RapidMiner and open-source R package. To improve the predictive ability of our model, a set of features were selected by employing multiple feature selection methods. The accuracy of classification was obtained using 10-fold cross-validation for the various combinations of the feature selection methods and machine learning algorithms. The performance of the models was assessed by various measurement indexes including accuracy, kappa, sensitivity, specificity, positive predictive value, negative predictive value, and area under the ROC curve (AUC). Our results showed that GBT with an accuracy of 85.17%, AUC of 0.930, and the feature selection via the Relief algorithm has the best performance in predicting the survival rate of AML patients.

Download Full-text

Framework to Analyze Malicious Behaviour in Cloud Environment using Machine Learning Techniques

2021 International Conference on Computer Communication and Informatics (ICCCI) ◽

10.1109/iccci50826.2021.9402671 ◽

2021 ◽

Author(s):

Pranay Jha ◽

Ashok Sharma

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Cloud Environment ◽

Learning Techniques

Download Full-text

FSDroid:- A feature selection technique to detect malware from Android using Machine Learning Techniques

Multimedia Tools and Applications ◽

10.1007/s11042-020-10367-w ◽

2021 ◽

Author(s):

Arvind Mahindru ◽

A.L. Sangal

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Machine Learning Techniques ◽

Feature Selection Technique ◽

Selection Technique ◽

Learning Techniques

Download Full-text

Evolutionary Machine Learning for Classification with Incomplete Data

10.26686/wgtn.17072123 ◽

2021 ◽

Author(s):

◽

Cao Truong Tran

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Genetic Programming ◽

Incomplete Data ◽

Missing Values ◽

Machine Learning Techniques ◽

Feature Construction ◽

Classification Algorithms ◽

Learning Techniques ◽

Effectiveness And Efficiency

<p>Classification is a major task in machine learning and data mining. Many real-world datasets suffer from the unavoidable issue of missing values. Classification with incomplete data has to be carefully handled because inadequate treatment of missing values will cause large classification errors. Existing most researchers working on classification with incomplete data focused on improving the effectiveness, but did not adequately address the issue of the efficiency of applying the classifiers to classify unseen instances, which is much more important than the act of creating classifiers. A common approach to classification with incomplete data is to use imputation methods to replace missing values with plausible values before building classifiers and classifying unseen instances. This approach provides complete data which can be then used by any classification algorithm, but sophisticated imputation methods are usually computationally intensive, especially for the application process of classification. Another approach to classification with incomplete data is to build a classifier that can directly work with missing values. This approach does not require time for estimating missing values, but it often generates inaccurate and complex classifiers when faced with numerous missing values. A recent approach to classification with incomplete data which also avoids estimating missing values is to build a set of classifiers which then is used to select applicable classifiers for classifying unseen instances. However, this approach is also often inaccurate and takes a long time to find applicable classifiers when faced with numerous missing values. The overall goal of the thesis is to simultaneously improve the effectiveness and efficiency of classification with incomplete data by using evolutionary machine learning techniques for feature selection, clustering, ensemble learning, feature construction and constructing classifiers. The thesis develops approaches for improving imputation for classification with incomplete data by integrating clustering and feature selection with imputation. The approaches improve both the effectiveness and the efficiency of using imputation for classification with incomplete data. The thesis develops wrapper-based feature selection methods to improve input space for classification algorithms that are able to work directly with incomplete data. The methods not only improve the classification accuracy, but also reduce the complexity of classifiers able to work directly with incomplete data. The thesis develops a feature construction method to improve input space for classification algorithms with incomplete data by proposing interval genetic programming-genetic programming with a set of interval functions. The method improves the classification accuracy and reduces the complexity of classifiers. The thesis develops an ensemble approach to classification with incomplete data by integrating imputation, feature selection, and ensemble learning. The results show that the approach is more accurate, and faster than previous common methods for classification with incomplete data. The thesis develops interval genetic programming to directly evolve classifiers for incomplete data. The results show that classifiers generated by interval genetic programming can be more effective and efficient than classifiers generated the combination of imputation and traditional genetic programming. Interval genetic programming is also more effective than common classification algorithms able to work directly with incomplete data. In summary, the thesis develops a range of approaches for simultaneously improving the effectiveness and efficiency of classification with incomplete data by using a range of evolutionary machine learning techniques.</p>

Download Full-text

Automated Soil Residue Levels Detecting Device With IoT Interface

Advances in Data Mining and Database Management - Challenges and Applications of Data Analytics in Social Perspectives ◽

10.4018/978-1-7998-2566-1.ch007 ◽

2021 ◽

pp. 123-135

Author(s):

Adiraju Prasanth Rao ◽

K. Sudheer Reddy ◽

Sathiyamoorthi V.

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

The Novel ◽

Cloud Environment ◽

Application System ◽

Learning Techniques ◽

Social And Economic Development ◽

Residue Levels ◽

Soil Residue ◽

Agricultural Farm

Cloud computing and internet of things (IoT) are playing a crucial role in the present era of technological, social, and economic development. The novel models where cloud and IoT are integrated together are foreseen as disruptive and enable a number of application scenarios. The e-smart is an application system designed by leveraging cloud, IoT, and several other technology frameworks that are deployed on the agricultural farm to collect the data from the farm fields. The application extracts and collects the information about the residue levels of soil and crop details and the same data will be hosted in the cloud environment. The proposed e-smart application system is to analyze, integrate, and correlate datasets and produce decision-oriented reports to the farmer by using several machine learning techniques.

Download Full-text

Analysis of Machine Learning Techniques for Lightweight DDoS Attack Detection on IoT Networks

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Forthcoming Networks and Sustainability in the IoT Era ◽

10.1007/978-3-030-69431-9_8 ◽

2021 ◽

pp. 96-110

Author(s):

Eric McCullough ◽

Razib Iqbal ◽

Ajay Katangur

Keyword(s):

Machine Learning ◽

Attack Detection ◽

Machine Learning Techniques ◽

Ddos Attack ◽

Learning Techniques ◽

Ddos Attack Detection

Download Full-text

Flow-Based Detection and Mitigation of Low-Rate DDOS Attack in SDN Environment Using Machine Learning Techniques

10.1007/978-981-16-2919-8_18 ◽

2021 ◽

pp. 193-205

Author(s):

K. Muthamil Sudar ◽

P. Deepalakshmi

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Ddos Attack ◽

Learning Techniques ◽

Low Rate

Download Full-text

Non-Intrusive Load Monitoring of Residential Water-Heating Circuit Using Ensemble Machine Learning Techniques

Inventions ◽

10.3390/inventions5040057 ◽

2020 ◽

Vol 5 (4) ◽

pp. 57

Author(s):

Attique Ur Rehman ◽

Tek Tjing Lie ◽

Brice Vallès ◽

Shafiqur Rahman Tito

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Machine Learning Techniques ◽

Learning Models ◽

Water Heating ◽

Energy Monitoring ◽

Non Invasive ◽

Ensemble Machine Learning ◽

Learning Techniques ◽

Load Monitoring

The recent advancement in computational capabilities and deployment of smart meters have caused non-intrusive load monitoring to revive itself as one of the promising techniques of energy monitoring. Toward effective energy monitoring, this paper presents a non-invasive load inference approach assisted by feature selection and ensemble machine learning techniques. For evaluation and validation purposes of the proposed approach, one of the major residential load elements having solid potential toward energy efficiency applications, i.e., water heating, is considered. Moreover, to realize the real-life deployment, digital simulations are carried out on low-sampling real-world load measurements: New Zealand GREEN Grid Database. For said purposes, MATLAB and Python (Scikit-Learn) are used as simulation tools. The employed learning models, i.e., standalone and ensemble, are trained on a single household’s load data and later tested rigorously on a set of diverse households’ load data, to validate the generalization capability of the employed models. This paper presents a comprehensive performance evaluation of the presented approach in the context of event detection, feature selection, and learning models. Based on the presented study and corresponding analysis of the results, it is concluded that the proposed approach generalizes well to the unseen testing data and yields promising results in terms of non-invasive load inference.

Download Full-text