Feature Selection on K-Nearest Neighbor Algorithm Using Similarity Measure

Author(s):  
Ratih Puspadini ◽  
Herman Mawengkang ◽  
Syahril Efendi
2020 ◽  
Vol 4 (2) ◽  
pp. 39-47
Author(s):  
Junta Zeniarja ◽  
Anisatawalanita Ukhifahdhina ◽  
Abu Salam

Heart is one of the essential organs that assume a significant part in the human body. However, heart can also cause diseases that affect the death. World Health Organization (WHO) data from 2012 showed that all deaths from cardiovascular disease (vascular) 7.4 million (42.3%) were caused by heart disease. Increased cases of heart disease require a step as an early prevention and prevention efforts by making early diagnosis of heart disease. In this research will be done early diagnosis of heart disease by using data mining process in the form of classification. The algorithm used is K-Nearest Neighbor algorithm with Forward Selection method. The K-Nearest Neighbor algorithm is used for classification in order to obtain a decision result from the diagnosis of heart disease, while the forward selection is used as a feature selection whose purpose is to increase the accuracy value. Forward selection works by removing some attributes that are irrelevant to the classification process. In this research the result of accuracy of heart disease diagnosis with K-Nearest Neighbor algorithm is 73,44%, while result of K-Nearest Neighbor algorithm accuracy with feature selection method 78,66%. It is clear that the incorporation of the K-Nearest Neighbor algorithm with the forward selection method has improved the accuracy result. Keywords - K-Nearest Neighbor, Classification, Heart Disease, Forward Selection, Data Mining


2018 ◽  
Author(s):  
I Wayan Agus Surya Darma

Balinese character recognition is a technique to recognize feature or pattern of Balinese character. Feature of Balinese character is generated through feature extraction process. This research using handwritten Balinese character. Feature extraction is a process to obtain the feature of character. In this research, feature extraction process generated semantic and direction feature of handwritten Balinese character. Recognition is using K-Nearest Neighbor algorithm to recognize 81 handwritten Balinese character. The feature of Balinese character images tester are compared with reference features. Result of the recognition system with K=3 and reference=10 is achieved a success rate of 97,53%.


2021 ◽  
Vol 11 (15) ◽  
pp. 7132
Author(s):  
Jianfeng Xi ◽  
Shiqing Wang ◽  
Tongqiang Ding ◽  
Jian Tian ◽  
Hui Shao ◽  
...  

Whether in developing or developed countries, traffic accidents caused by freight vehicles are responsible for more than 10% of deaths of all traffic accidents. Fatigue driving is one of the main causes of freight vehicle accidents. Existing fatigue driving studies mostly use vehicle operating data from experiments or simulation data, exposing certain drawbacks in the validity and reliability of the models used. This study collected a large quantity of real driving data to extract sample data under different fatigue degrees. The parameters of vehicle operating data were selected based on significant driver fatigue degrees. The k-nearest neighbor algorithm was used to establish the detection model of fatigue driving behaviors, taking into account influence of the number of training samples and other parameters in the accuracy of fatigue driving behavior detection. With the collected operating data of 50 freight vehicles in the past month, the fatigue driving behavior detection models based on the k-nearest neighbor algorithm and the commonly used BP neural network proposed in this paper were tested, respectively. The analysis results showed that the accuracy of both models are 75.9%, but the fatigue driving detection model based on the k-nearest neighbor algorithm is more reliable.


Sign in / Sign up

Export Citation Format

Share Document