scholarly journals K-nearest neighbor and naïve Bayes based diagnostic analytic of harmonic source identification

2021 ◽  
Vol 9 (6) ◽  
pp. 2650-2657
Author(s):  
Mohd Hatta Jopri ◽  
Mohd Ruddin Ab Ghani ◽  
Abdul Rahim Abdullah ◽  
Mustafa Manap ◽  
Tole Sutikno ◽  
...  

This paper proposes a comparison of machine learning (ML) algorithm known as the k-nearest neighbor (KNN) and naïve Bayes (NB) in identifying and diagnosing the harmonic sources in the power system. A single-point measurement is applied in this proposed method, and using the S-transform the measurement signals are analyzed and extracted into voltage and current parameters. The voltage and current features that estimated from time-frequency representation (TFR) of S-transform analysis are used as the input for MLs. Four significant cases of harmonic source location are considered, whereas harmonic voltage (HV) and harmonic current (HC) source type-load are used in the diagnosing process. To identify the best ML, the performance measurement of the proposed method including the accuracy, precision, specificity, sensitivity, and F-measure are calculated. The sufficiency of the proposed methodology is tested and verified on IEEE 4-bust test feeder and each ML algorithm is executed for 10 times due to prevent any overfitting result.

Author(s):  
Mohd Hatta Jopri ◽  
Abdul Rahim Abdullah ◽  
Jingwei Too ◽  
Tole Sutikno ◽  
Srete Nikolovski ◽  
...  

<span>A harmonic source diagnostic analytic is a vital to identify the location and type of harmonic source in the power system. This paper introduces a comparison of machine learning (ML) algorithm which are support vector machine (SVM) and Naïve Bayes (NB). Voltage and current features are used as the input for ML are extracted from time-frequency representation (TFR) of S-transform. Several unique cases of harmonic source location are considered, whereas harmonic voltage and harmonic current source type-load are used in the diagnosing process. To identify the best ML, the performance measurement of the propose method including accuracy, specificity, sensitivity, and F-measure are calculated. The adequacy of the proposed methodology is tested and verified on IEEE 4-bust test feeder and each ML algorithm is executed for 10 times due to different partitions and to prevent any overfitting result.</span>


2021 ◽  
Vol 10 (1) ◽  
pp. 171-178
Author(s):  
Mohd Hatta Jopri ◽  
Abdul Rahim Abdullah ◽  
Mustafa Manap ◽  
M. Badril Nor Shah ◽  
Tole Sutikno ◽  
...  

The diagnostic analytic of harmonic source is crucial research due to identify and diagnose the harmonic source in the power system. This paper presents a comparison of machine learning (ML) algorithm known as linear discriminate analysis (LDA) and k-nearest neighbor (KNN) in identifying and diagnosing the harmonic sources. Voltage and current features that estimated from time-frequency representation (TFR) of S-transform analysis are used as the input for ML. Several unique cases of harmonic source location are considered, whereas harmonic voltage (HV) and harmonic current (HC) source type-load are used in the diagnosing process. To identify the best ML, each ML algorithm is executed 10 times due to prevent any overfitting result and the performance criteria are measured consist of the accuracy, precision, geometric mean, specificity, sensitivity, and F measure are calculated.


Author(s):  
M. H Jopri ◽  
MR Ab Ghani ◽  
A.R Abdullah ◽  
Tole Sutikno ◽  
M Manap ◽  
...  

<span>The diagnostic analytic type of harmonic source is a vital research due to diagnose and identify type of harmonic source that exist in the power system. This paper presents a comparison of machine learning (ML) algorithm namely as the Naïve Bayes (NB) and linear discriminate analysis (LDA) in identifying and diagnosing the harmonic sources.  The MLs inputs are the voltage and current feature sets that estimated from the time-frequency representation (TFR) of S-transform analysis. Four specific cases of harmonic source location are considered in this research, whereas harmonic voltage (H<sub>V</sub>) and harmonic current (H<sub>C</sub>) source type-load are used in the diagnosing process. The sufficiency of the proposed methodology is tested and verified on the IEEE 4-bust test feeder, and to prevent overfitting, the K-fold cross-validation technique is implemented for performance evaluation. To identify the best ML, the performance measurement consist of the accuracy, precision, geometric mean, F-measure, sensitivity, and specificity are conducted.</span>


Data mining usually specifies the discovery of specific pattern or analysis of data from a large dataset. Classification is one of an efficient data mining technique, in which class the data are classified are already predefined using the existing datasets. The classification of medical records in terms of its symptoms using computerized method and storing the predicted information in the digital format is of great importance in the diagnosis of various diseases in the medical field. In this paper, finding the algorithm with highest accuracy range is concentrated so that a cost-effective algorithm can be found. Here the data mining classification algorithms are compared with their accuracy of finding exact data according to the diagnosis report and their execution rate to identify how fast the records are classified. The classification technique based algorithms used in this study are the Naive Bayes Classifier, the C4.5 tree classifier and the K-Nearest Neighbor (KNN) to predict which algorithm is the best suited for classifying any kind of medical dataset. Here the datasets such as Breast Cancer, Iris and Hypothyroid are used to predict which of the three algorithms is suitable for classifying the datasets with highest accuracy of finding the records of patients with the particular health problems. The experimental results represented in the form of table and graph shows the performance and the importance of Naïve Bayes, C4.5 and K-Nearest Neighbor algorithms. From the performance outcome of the three algorithms the C4.5 algorithm is a lot better than the Naïve Bayes and the K-Nearest Neighbor algorithm.


Author(s):  
Rajni Rajni ◽  
Amandeep Amandeep

<p>Diabetes is a major concern all over the world. It is increasing at a fast pace. People can avoid diabetes at an early stage without any test. The goal of this paper is to predict the probability of whether the person has a risk of diabetes or not at an early stage. This would lead to having a great impact on their quality of human life. The datasets are Pima Indians diabetes and Cleveland coronary illness and consist of 768 records. Though there are a number of solutions available for information extraction from a huge datasets and to predict the possibility of having diabetes, but the accuracy of their mining process is far from accurate. For achieving highest accuracy, the issue of zero probability which is generally faced by naïve bayes analysis needs to be addressed suitably. The proposed framework RB-Bayes aims to extract the required information with high accuracy that could survive the problem of zero probability and also configure accuracy with other methods like Support Vector Machine, Naive Bayes, and K Nearest Neighbor. We calculated mean to handle missing data and calculated probability for yes (positive) and no (negative). The highest value between yes and no decide the value for the tuple. It is mostly used in text classification. The outcomes on Pima Indian diabetes dataset demonstrate that the proposed methodology enhances the precision as a contrast with other regulated procedures. The accuracy of the proposed methodology large dataset is 72.9%.</p>


2019 ◽  
Vol 4 (1) ◽  
Author(s):  
Deny Haryadi ◽  
Rila Mandala

Harga minyak kelapa sawit bisa mengalami kenaikan, penurunan maupun tetap setiap hari karena faktor yang mempengaruhi harga minyak kelapa sawit seperti harga minyak nabati lain (minyak kedelai dan minyak canola), harga minyak mentah dunia, maupun nilai tukar riil antara kurs dolar terhadap mata uang negara produsen (rupiah, ringgit, dan canada) atau mata uang negara konsumen (rupee). Untuk itu dibutuhkan prediksi harga minyak kelapa sawit yang cukup akurat agar para investor bisa mendapatkan keuntungan sesuai perencanaan yang dibuat. tujuan dari penelitian ini yaitu untuk mengetahui perbandingan accuracy, precision, dan recall yang dihasilkan oleh algoritma Naïve Bayes, Support Vector Machine, dan K-Nearest Neighbor dalam menyelesaikan masalah prediksi harga minyak kelapa sawit dalam investasi. Berdasarkan hasil pengujian dalam penelitian yang telah dilakukan, algoritma Support Vector Machine memiliki accuracy, precision, dan recall dengan jumlah paling tinggi dibandingkan dengan algoritma Naïve Bayes dan algoritma K-Nearest Neighbor. Nilai accuracy tertinggi pada penelitian ini yaitu 82,46% dengan precision tertinggi yaitu 86% dan recall tertinggi yaitu 89,06%.


2010 ◽  
Vol 5 (2) ◽  
pp. 133-137 ◽  
Author(s):  
Mohammed J. Islam ◽  
Q. M. Jonathan Wu ◽  
Majid Ahmadi ◽  
Maher A. SidAhmed

Sign in / Sign up

Export Citation Format

Share Document