FSDroid:- A feature selection technique to detect malware from Android using Machine Learning Techniques

Evolutionary Machine Learning for Classification with Incomplete Data

10.26686/wgtn.17072123 ◽

2021 ◽

Author(s):

◽

Cao Truong Tran

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Genetic Programming ◽

Incomplete Data ◽

Missing Values ◽

Machine Learning Techniques ◽

Feature Construction ◽

Classification Algorithms ◽

Learning Techniques ◽

Effectiveness And Efficiency

<p>Classification is a major task in machine learning and data mining. Many real-world datasets suffer from the unavoidable issue of missing values. Classification with incomplete data has to be carefully handled because inadequate treatment of missing values will cause large classification errors. Existing most researchers working on classification with incomplete data focused on improving the effectiveness, but did not adequately address the issue of the efficiency of applying the classifiers to classify unseen instances, which is much more important than the act of creating classifiers. A common approach to classification with incomplete data is to use imputation methods to replace missing values with plausible values before building classifiers and classifying unseen instances. This approach provides complete data which can be then used by any classification algorithm, but sophisticated imputation methods are usually computationally intensive, especially for the application process of classification. Another approach to classification with incomplete data is to build a classifier that can directly work with missing values. This approach does not require time for estimating missing values, but it often generates inaccurate and complex classifiers when faced with numerous missing values. A recent approach to classification with incomplete data which also avoids estimating missing values is to build a set of classifiers which then is used to select applicable classifiers for classifying unseen instances. However, this approach is also often inaccurate and takes a long time to find applicable classifiers when faced with numerous missing values. The overall goal of the thesis is to simultaneously improve the effectiveness and efficiency of classification with incomplete data by using evolutionary machine learning techniques for feature selection, clustering, ensemble learning, feature construction and constructing classifiers. The thesis develops approaches for improving imputation for classification with incomplete data by integrating clustering and feature selection with imputation. The approaches improve both the effectiveness and the efficiency of using imputation for classification with incomplete data. The thesis develops wrapper-based feature selection methods to improve input space for classification algorithms that are able to work directly with incomplete data. The methods not only improve the classification accuracy, but also reduce the complexity of classifiers able to work directly with incomplete data. The thesis develops a feature construction method to improve input space for classification algorithms with incomplete data by proposing interval genetic programming-genetic programming with a set of interval functions. The method improves the classification accuracy and reduces the complexity of classifiers. The thesis develops an ensemble approach to classification with incomplete data by integrating imputation, feature selection, and ensemble learning. The results show that the approach is more accurate, and faster than previous common methods for classification with incomplete data. The thesis develops interval genetic programming to directly evolve classifiers for incomplete data. The results show that classifiers generated by interval genetic programming can be more effective and efficient than classifiers generated the combination of imputation and traditional genetic programming. Interval genetic programming is also more effective than common classification algorithms able to work directly with incomplete data. In summary, the thesis develops a range of approaches for simultaneously improving the effectiveness and efficiency of classification with incomplete data by using a range of evolutionary machine learning techniques.</p>

Download Full-text

Non-Intrusive Load Monitoring of Residential Water-Heating Circuit Using Ensemble Machine Learning Techniques

Inventions ◽

10.3390/inventions5040057 ◽

2020 ◽

Vol 5 (4) ◽

pp. 57

Author(s):

Attique Ur Rehman ◽

Tek Tjing Lie ◽

Brice Vallès ◽

Shafiqur Rahman Tito

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Machine Learning Techniques ◽

Learning Models ◽

Water Heating ◽

Energy Monitoring ◽

Non Invasive ◽

Ensemble Machine Learning ◽

Learning Techniques ◽

Load Monitoring

The recent advancement in computational capabilities and deployment of smart meters have caused non-intrusive load monitoring to revive itself as one of the promising techniques of energy monitoring. Toward effective energy monitoring, this paper presents a non-invasive load inference approach assisted by feature selection and ensemble machine learning techniques. For evaluation and validation purposes of the proposed approach, one of the major residential load elements having solid potential toward energy efficiency applications, i.e., water heating, is considered. Moreover, to realize the real-life deployment, digital simulations are carried out on low-sampling real-world load measurements: New Zealand GREEN Grid Database. For said purposes, MATLAB and Python (Scikit-Learn) are used as simulation tools. The employed learning models, i.e., standalone and ensemble, are trained on a single household’s load data and later tested rigorously on a set of diverse households’ load data, to validate the generalization capability of the employed models. This paper presents a comprehensive performance evaluation of the presented approach in the context of event detection, feature selection, and learning models. Based on the presented study and corresponding analysis of the results, it is concluded that the proposed approach generalizes well to the unseen testing data and yields promising results in terms of non-invasive load inference.

Download Full-text

Combining Correlation-Based Feature and Machine Learning for Sensory Evaluation of Saigon Beer

International Journal of Knowledge and Systems Science ◽

10.4018/ijkss.2020040104 ◽

2020 ◽

Vol 11 (2) ◽

pp. 71-85

Author(s):

Nhat-Vinh Lu ◽

Trong-Nhan Vuong ◽

Duy-Tai Dinh

Keyword(s):

Machine Learning ◽

Sensory Evaluation ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Methods ◽

Feature Selection Technique ◽

Machine Learning Methods ◽

Learning Techniques ◽

Correlation Based Feature Selection ◽

Positive Results

Sensory evaluation plays an important role in the food and consumer goods industry. In recent years, the application of machine learning techniques to support food sensory evaluation has become popular. Many different machine learning methods have been applied and produced positive results in this field. In this article, the authors propose a new method to support sensory evaluation on multiple criteria based on the use of a correlation-based feature selection technique, combined with machine learning methods such as linear regression, multilayer perceptron, support vector machine, and random forest. Experimental results are based on considering the correlation between physicochemical components and sensory factors on the Saigon beer dataset.

Download Full-text

Feature selection for an automated ancient Tamil script classification system using machine learning techniques

2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET) ◽

10.1109/icammaet.2017.8186731 ◽

2017 ◽

Cited By ~ 2

Author(s):

T S Suganya ◽

S Murugavalli

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Classification System ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Selection For ◽

Tamil Script

Download Full-text

Feature Selection and Classification of Leukemia Cancer Using Machine Learning Techniques

Machine Learning Research ◽

10.11648/j.mlr.20200502.11 ◽

2020 ◽

Vol 5 (2) ◽

pp. 18

Author(s):

Md. Alamgir Sarder ◽

Md. Maniruzzaman ◽

Benojir Ahammed

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

A Survey on Diagnosis and Analysis of Diabetic Retinopathy using Feature Selection

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset207132 ◽

2020 ◽

pp. 170-176

Author(s):

Amalu Michael ◽

Deepa S S

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Diabetic Retinopathy ◽

High Ratio ◽

Training Data ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Diabetic Eye Disease ◽

Selection Mechanisms ◽

Feature Selection Techniques

Diabetic retinopathy is one of the common forms of diabetic eye disease. DR occurs due to a high ratio of glucose in the blood, which causes alterations in the retinal vessels. Machine learning may be a broad multidisciplinary field that has its roots in statistics, algebra, data processing, and information analytics, etc. Machine learning is used to discover patterns from medical data and provide an efficient way to predict diseases.ML is an application of artificial intelligence it collects information from training data. There are several machine learning techniques are used for the diagnosis of diabetic retinopathy. This paper mainly focuses on the survey of such techniques and also various feature selection mechanisms. This study provides the basic categorization of feature selection techniques and discussing their use.

Download Full-text

A Static Feature Selection-based Android Malware Detection Using Machine Learning Techniques

2020 International Conference on Smart Electronics and Communication (ICOSEC) ◽

10.1109/icosec49089.2020.9215355 ◽

2020 ◽

Author(s):

Aviral Sangal ◽

Harsh Kumar Verma

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Malware Detection ◽

Machine Learning Techniques ◽

Android Malware ◽

Android Malware Detection ◽

Learning Techniques ◽

Static Feature

Download Full-text

Performance Analysis of Microarray Data Classification using Machine Learning Techniques

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/ijkdb.2015070104 ◽

2015 ◽

Vol 5 (2) ◽

pp. 43-54

Author(s):

Subhendu Kumar Pani ◽

Bikram Kesari Ratha ◽

Ajay Kumar Mishra

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Microarray Data ◽

Predictive Accuracy ◽

Machine Learning Techniques ◽

Learning Approaches ◽

Data Mining Technique ◽

Single Experiment ◽

Learning Techniques ◽

Microarray Datasets

Microarray technology of DNA permits simultaneous monitoring and determining of thousands of gene expression activation levels in a single experiment. Data mining technique such as classification is extensively used on microarray data for medical diagnosis and gene analysis. However, high dimensionality of the data affects the performance of classification and prediction. Consequently, a key issue in microarray data is feature selection and dimensionality reduction in order to achieve better classification and predictive accuracy. There are several machine learning approaches available for feature selection. In this study, the authors use Particle Swarm Organization (PSO) and Genetic Algorithm (GA) to find the performance of several popular classifiers on a set of microarray datasets. Experimental results conclude that feature selection affects the performance.

Download Full-text

An efficient feature selection method for classification in health care systems using machine learning techniques

2011 3rd International Conference on Electronics Computer Technology ◽

10.1109/icectech.2011.5941891 ◽

2011 ◽

Cited By ~ 7

Author(s):

K Selvakuberan ◽

D Kayathiri ◽

B Harini ◽

M Indra Devi

Keyword(s):

Machine Learning ◽

Health Care ◽

Feature Selection ◽

Health Care Systems ◽

Feature Selection Method ◽

Selection Method ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Care Systems

Download Full-text

Human Activity Classification Using Machine Learning Techniques with Feature Selection

Communications in Computer and Information Science - Advanced Informatics for Computing Research ◽

10.1007/978-981-16-3660-8_35 ◽

2021 ◽

pp. 371-380

Author(s):

P. Maneesha ◽

Nagadeepa Choppakatla

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Human Activity ◽

Machine Learning Techniques ◽

Activity Classification ◽

Learning Techniques

Download Full-text