Comprehensive Non-Intrusive Load Monitoring Process: Device Event Detection, Device Feature Extraction and Device Identification Using KNN, Random Forest and Decision Tree

A multi-agent architecture for a Non-Intrusive Load Monitoring (NILM) solution is presented and evaluated. The underlying rationale for such an architecture is that each agent (load event detection, feature extraction, and classification) outperforms others of the same type in particular scenarios; hence, by combining the expertise of these agents, the system presents an improved performance. Known NILM algorithms, as well as new algorithms, proposed by the authors, were individually evaluated and compared. The proposed architecture considers a NILM system composed of Load Monitoring Modules (LMM) that report to a Center of Operations, required in larger facilities. For the purposed of evaluating and comparing performance, five load event detect agents, five feature extraction agents, and five classification agents were studied so that the best combinations of agents could be implemented in LMMs. To evaluate the proposed system, the COOLL and the LIT-Dataset were used. Performance improvements were detected in all scenarios, with power-ON and power-OFF detection improving up to 13%, while classification accuracy improved up to 9.4%.

Download Full-text

Heart Disease Prediction Using Decision Tree and Random Forest Classification Techniques

Applications of Big Data in Large- and Small-Scale Systems - Advances in Data Mining and Database Management ◽

10.4018/978-1-7998-6673-2.ch015 ◽

2021 ◽

pp. 234-259

Author(s):

Nitika Kapoor ◽

Parminder Singh

Keyword(s):

Feature Extraction ◽

Heart Disease ◽

Random Forest ◽

Decision Tree ◽

Random Forest Classifier ◽

Disease Prediction ◽

Decision Tree Classifier ◽

Hybrid Classifier ◽

Forest Classification ◽

Tree Classifier

Data mining is the approach which can extract useful information from the data. The prediction analysis is the approach which can predict future possibilities based on the current information. The authors propose a hybrid classifier to carry out the heart disease prediction. The hybrid classifier is combination of random forest and decision tree classifier. Moreover, the heart disease prediction technique has three steps, which are data pre-processing, feature extraction, and classification. In this research, random forest classifier is applied for the feature extraction and decision tree classifier is applied for the generation of prediction results. However, random forest classifier will extract the information and decision tree will generate final classifier result. The authors show the results of proposed model using the Python platform. Moreover, the results are compared with support vector machine (SVM) and k-nearest neighbour classifier (KNN).

Download Full-text

Heart Disease Prediction Method using Hybrid Classifier

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.i1109.0789s419 ◽

2019 ◽

Vol 8 (9S4) ◽

pp. 57-61

Keyword(s):

Feature Extraction ◽

Heart Disease ◽

Random Forest ◽

Decision Tree ◽

Research Work ◽

Random Forest Classifier ◽

Disease Prediction ◽

Decision Tree Classifier ◽

Hybrid Classifier ◽

Tree Classifier

The data mining is the approach which can extract useful information from the data. The following research work that has been described is related to the heart disease prediction. The prediction analysis is the approach which can predict future possibilities based on the current information. For the heart disease prediction the classifier that is designed in this research work is hybrid classifier. The hybrid classifier is combination of random forest and decision tree classifier. Moreover, the heart disease prediction technique has three steps which are data pre-processing, feature extraction and classification. In this paper, random forest classifier is applied for the feature extraction and decision tree classifier is applied for the generation of prediction results. However, random forest classifier will extract the information and decision tree will generate final classifier result. We have proposed a hybrid model that has been implemented in python. Moreover, the results are compared with Support Vector Machine (SVM) and K-Nearest Neighbor classifier (KNN).

Download Full-text

SAAE-DNN: Deep Learning Method on Intrusion Detection

Symmetry ◽

10.3390/sym12101695 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1695

Author(s):

Chaofei Tang ◽

Nurbol Luktarhan ◽

Yuxin Zhao

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Random Forest ◽

Intrusion Detection ◽

Decision Tree ◽

Binary Classification ◽

Attention Mechanism ◽

Detection Methods ◽

Detection Accuracy ◽

Multi Classification

Intrusion detection system (IDS) plays a significant role in preventing network attacks and plays a vital role in the field of national security. At present, the existing intrusion detection methods are generally based on traditional machine learning models, such as random forest and decision tree, but they rely heavily on artificial feature extraction and have relatively low accuracy. To solve the problems of feature extraction and low detection accuracy in intrusion detection, an intrusion detection model SAAE-DNN, based on stacked autoencoder (SAE), attention mechanism and deep neural network (DNN), is proposed. The SAE represents data with a latent layer, and the attention mechanism enables the network to obtain the key features of intrusion detection. The trained SAAE encoder can not only automatically extract features, but also initialize the weights of DNN potential layers to improve the detection accuracy of DNN. We evaluate the performance of SAAE-DNN in binary-classification and multi-classification on an NSL-KDD dataset. The SAAE-DNN model can detect normally and attack symmetrically, with an accuracy of 87.74% and 82.14% (binary-classification and multi-classification), which is higher than that of machine learning methods such as random forest and decision tree. The experimental results show that the model has a better performance than other comparison methods.

Download Full-text

Prediction of Breast Cancer using Decision tree and Random Forest Algorithm

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i2.226229 ◽

2018 ◽

Vol 6 (2) ◽

pp. 226-229

Author(s):

N.Sridevi . ◽

◽

S.Anitha . ◽

Keyword(s):

Breast Cancer ◽

Random Forest ◽

Decision Tree ◽

Random Forest Algorithm

Download Full-text

Gray-Level Co-occurrence Matrix and Random Forest Based Off-line Odia Handwritten Character Recognition

Recent Patents on Engineering ◽

10.2174/1872212112666180601085544 ◽

2019 ◽

Vol 13 (2) ◽

pp. 136-141 ◽

Cited By ~ 2

Author(s):

Abhisek Sethy ◽

Prashanta Kumar Patra ◽

Deepak Ranjan Nayak

Keyword(s):

Feature Extraction ◽

Random Forest ◽

Character Recognition ◽

Recognition Rate ◽

Discrete Wavelet ◽

Gray Level ◽

Handwritten Character Recognition ◽

Handwritten Character ◽

Wide Range ◽

Occurrence Matrix

Background: In the past decades, handwritten character recognition has received considerable attention from researchers across the globe because of its wide range of applications in daily life. From the literature, it has been observed that there is limited study on various handwritten Indian scripts and Odia is one of them. We revised some of the patents relating to handwritten character recognition. Methods: This paper deals with the development of an automatic recognition system for offline handwritten Odia character recognition. In this case, prior to feature extraction from images, preprocessing has been done on the character images. For feature extraction, first the gray level co-occurrence matrix (GLCM) is computed from all the sub-bands of two-dimensional discrete wavelet transform (2D DWT) and thereafter, feature descriptors such as energy, entropy, correlation, homogeneity, and contrast are calculated from GLCMs which are termed as the primary feature vector. In order to further reduce the feature space and generate more relevant features, principal component analysis (PCA) has been employed. Because of the several salient features of random forest (RF) and K- nearest neighbor (K-NN), they have become a significant choice in pattern classification tasks and therefore, both RF and K-NN are separately applied in this study for segregation of character images. Results: All the experiments were performed on a system having specification as windows 8, 64-bit operating system, and Intel (R) i7 – 4770 CPU @ 3.40 GHz. Simulations were conducted through Matlab2014a on a standard database named as NIT Rourkela Odia Database. Conclusion: The proposed system has been validated on a standard database. The simulation results based on 10-fold cross-validation scenario demonstrate that the proposed system earns better accuracy than the existing methods while requiring least number of features. The recognition rate using RF and K-NN classifier is found to be 94.6% and 96.4% respectively.

Download Full-text

Document Preprocessing with TF-IDF to Improve the Polarity Classification Performance of Unstructured Sentiment Analysis

Kinetik Game Technology Information System Computer Network Computing Electronics and Control ◽

10.22219/kinetik.v5i3.1066 ◽

2020 ◽

pp. 235-242

Author(s):

Farrikh Alzami ◽

Erika Devi Udayanti ◽

Dwi Puji Prabowo ◽

Rama Aria Megantara

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Random Forest ◽

Sentiment Analysis ◽

Classification Performance ◽

Document Preparation ◽

Learning Models ◽

Polarity Classification ◽

Negative Sentiment ◽

Machine Learning Models

Sentiment analysis in terms of polarity classification is very important in everyday life, with the existence of polarity, many people can find out whether the respected document has positive or negative sentiment so that it can help in choosing and making decisions. Sentiment analysis usually done manually. Therefore, an automatic sentiment analysis classification process is needed. However, it is rare to find studies that discuss extraction features and which learning models are suitable for unstructured sentiment analysis types with the Amazon food review case. This research explores some extraction features such as Word Bags, TF-IDF, Word2Vector, as well as a combination of TF-IDF and Word2Vector with several machine learning models such as Random Forest, SVM, KNN and Naïve Bayes to find out a combination of feature extraction and learning models that can help add variety to the analysis of polarity sentiments. By assisting with document preparation such as html tags and punctuation and special characters, using snowball stemming, TF-IDF results obtained with SVM are suitable for obtaining a polarity classification in unstructured sentiment analysis for the case of Amazon food review with a performance result of 87,3 percent.

Download Full-text

Non-Intrusive Load Monitoring Based on Feature Extraction of Change-point and XGBoost Classifier

2020 IEEE 4th Conference on Energy Internet and Energy System Integration (EI2) ◽

10.1109/ei250167.2020.9347014 ◽

2020 ◽

Author(s):

Zhuo Chen ◽

Junxingxu Chen ◽

Xianyong Xu ◽

Shuangjian Peng ◽

Jian Xiao ◽

...

Keyword(s):

Feature Extraction ◽

Change Point ◽

Load Monitoring

Download Full-text

Machine Learning in Aging: An Example of Developing Prediction Models for Serious Fall Injury in Older Adults

Innovation in Aging ◽

10.1093/geroni/igaa057.859 ◽

2020 ◽

Vol 4 (Supplement_1) ◽

pp. 268-269

Author(s):

Jaime Speiser ◽

Kathryn Callahan ◽

Jason Fanning ◽

Thomas Gill ◽

Anne Newman ◽

...

Keyword(s):

Machine Learning ◽

Older Adults ◽

Random Forest ◽

Decision Tree ◽

Prediction Models ◽

Receiver Operating Curve ◽

Learning Methods ◽

Life Study ◽

Fall Injury ◽

Machine Learning Methods

Abstract Advances in computational algorithms and the availability of large datasets with clinically relevant characteristics provide an opportunity to develop machine learning prediction models to aid in diagnosis, prognosis, and treatment of older adults. Some studies have employed machine learning methods for prediction modeling, but skepticism of these methods remains due to lack of reproducibility and difficulty understanding the complex algorithms behind models. We aim to provide an overview of two common machine learning methods: decision tree and random forest. We focus on these methods because they provide a high degree of interpretability. We discuss the underlying algorithms of decision tree and random forest methods and present a tutorial for developing prediction models for serious fall injury using data from the Lifestyle Interventions and Independence for Elders (LIFE) study. Decision tree is a machine learning method that produces a model resembling a flow chart. Random forest consists of a collection of many decision trees whose results are aggregated. In the tutorial example, we discuss evaluation metrics and interpretation for these models. Illustrated in data from the LIFE study, prediction models for serious fall injury were moderate at best (area under the receiver operating curve of 0.54 for decision tree and 0.66 for random forest). Machine learning methods may offer improved performance compared to traditional models for modeling outcomes in aging, but their use should be justified and output should be carefully described. Models should be assessed by clinical experts to ensure compatibility with clinical practice.

Download Full-text

Robot Perceptual Classification Method Based on Mixed Features of Decision Tree and Random Forest

2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) ◽

10.1109/icbaie52039.2021.9389973 ◽

2021 ◽

Author(s):

Yifan Song ◽

Jiankai Zuo ◽

Jiehong Wu ◽

Zeyuan Liu ◽

Ziheng Li

Keyword(s):

Random Forest ◽

Decision Tree ◽

Classification Method ◽

Perceptual Classification ◽

Mixed Features

Download Full-text