Study of distance metrics on k - nearest neighbor algorithm for star categorization

Abstract Classification of stars is essential to investigate the characteristics and behavior of stars. Performing classifications manually is error-prone and time-consuming. Machine learning provides a computerized solution to handle huge volumes of data with minimal human input. k-Nearest Neighbor (kNN) is one of the simplest supervised learning approaches in machine learning. This paper aims at studying and analyzing the performance of the kNN algorithm on the star dataset. In this paper, we have analyzed the accuracy of the kNN algorithm by considering various distance metrics and the range of k values. Minkowski, Euclidean, Manhattan, Chebyshev, Cosine, Jaccard, and Hamming distance were applied on kNN classifiers for different k values. It is observed that Cosine distance works better than the other distance metrics on star categorization.

Download Full-text

Application of Machine Learning Approaches for the Design and Study of Anticancer Drugs

Current Drug Targets ◽

10.2174/1389450119666180809122244 ◽

2019 ◽

Vol 20 (5) ◽

pp. 488-500 ◽

Cited By ~ 6

Author(s):

Yan Hu ◽

Yi Lu ◽

Shuo Wang ◽

Mengying Zhang ◽

Xiaosheng Qu ◽

...

Keyword(s):

Machine Learning ◽

Drug Design ◽

Anticancer Drugs ◽

Nearest Neighbor ◽

Cost Effective ◽

Support Vector ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Activity Prediction ◽

Linear Discriminant

Background: Globally the number of cancer patients and deaths are continuing to increase yearly, and cancer has, therefore, become one of the world's highest causes of morbidity and mortality. In recent years, the study of anticancer drugs has become one of the most popular medical topics. Objective: In this review, in order to study the application of machine learning in predicting anticancer drugs activity, some machine learning approaches such as Linear Discriminant Analysis (LDA), Principal components analysis (PCA), Support Vector Machine (SVM), Random forest (RF), k-Nearest Neighbor (kNN), and Naïve Bayes (NB) were selected, and the examples of their applications in anticancer drugs design are listed. Results: Machine learning contributes a lot to anticancer drugs design and helps researchers by saving time and is cost effective. However, it can only be an assisting tool for drug design. Conclusion: This paper introduces the application of machine learning approaches in anticancer drug design. Many examples of success in identification and prediction in the area of anticancer drugs activity prediction are discussed, and the anticancer drugs research is still in active progress. Moreover, the merits of some web servers related to anticancer drugs are mentioned.

Download Full-text

Identification and Classification of Technical Lignins by means of Principle Component Analysis and k‐Nearest Neighbor Algorithm

Chemistry–Methods ◽

10.1002/cmtd.202100065 ◽

2021 ◽

Vol 1 (8) ◽

pp. 352-353

Author(s):

Friedrich Fink ◽

Franziska Emmerling ◽

Jana Falkenhagen

Keyword(s):

Principle Component Analysis ◽

Nearest Neighbor ◽

Component Analysis ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Principle Component ◽

K Nearest Neighbor Algorithm ◽

Technical Lignins

Download Full-text

High-Speed and Accurate Meat Composition Imaging by Mechanically-Flexible Electrical Impedance Tomography With k-Nearest Neighbor and Fuzzy k-Means Machine Learning Approaches

IEEE Access ◽

10.1109/access.2021.3064315 ◽

2021 ◽

Vol 9 ◽

pp. 38792-38801

Author(s):

P. N. Darma ◽

M. Takei

Keyword(s):

Machine Learning ◽

Electrical Impedance Tomography ◽

High Speed ◽

Electrical Impedance ◽

Nearest Neighbor ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Impedance Tomography ◽

Meat Composition

Download Full-text

Classification of News Articles for Learning Using the K-Nearest Neighbor Algorithm

10.1109/icet53279.2021.9575101 ◽

2021 ◽

Author(s):

Utomo Pujianto ◽

Harits ar Rosyid ◽

Muhammad Khoirul Anam

Keyword(s):

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm

Download Full-text

The Classification of Skateboarding Tricks : A Transfer Learning and Machine Learning Approach

Mekatronika ◽

10.15282/mekatronika.v2i2.6683 ◽

2020 ◽

Vol 2 (2) ◽

pp. 1-12

Author(s):

Muhammad Nur Aiman Shapiee ◽

Muhammad Ar Rahim Ibrahim ◽

Muhammad Amirul Abdullah ◽

Rabiu Muazu Musa ◽

Noor Azuan Abu Osman ◽

...

Keyword(s):

Machine Learning ◽

Classification Accuracy ◽

Nearest Neighbor ◽

Olympic Games ◽

Learning Approach ◽

K Nearest Neighbor ◽

Test Dataset ◽

Machine Learning Approach ◽

Competitive Games

The skateboarding scene has arrived at new statures, particularly with its first appearance at the now delayed Tokyo Summer Olympic Games. Hence, attributable to the size of the game in such competitive games, progressed creative appraisal approaches have progressively increased due consideration by pertinent partners, particularly with the enthusiasm of a more goal-based assessment. This study purposes for classifying skateboarding tricks, specifically Frontside 180, Kickflip, Ollie, Nollie Front Shove-it, and Pop Shove-it over the integration of image processing, Trasnfer Learning (TL) to feature extraction enhanced with tradisional Machine Learning (ML) classifier. A male skateboarder performed five tricks every sort of trick consistently and the YI Action camera captured the movement by a range of 1.26 m. Then, the image dataset were features built and extricated by means of three TL models, and afterward in this manner arranged to utilize by k-Nearest Neighbor (k-NN) classifier. The perception via the initial experiments showed, the MobileNet, NASNetMobile, and NASNetLarge coupled with optimized k-NN classifiers attain a classification accuracy (CA) of 95%, 92% and 90%, respectively on the test dataset. Besides, the result evident from the robustness evaluation showed the MobileNet+k-NN pipeline is more robust as it could provide a decent average CA than other pipelines. It would be demonstrated that the suggested study could characterize the skateboard tricks sufficiently and could, over the long haul, uphold judges decided for giving progressively objective-based decision.

Download Full-text

Business Intelligence using the K-Nearest Neighbor Algorithm to Analyze Customer Behavior in Online Crowdfunding Systems

E3S Web of Conferences ◽

10.1051/e3sconf/202020216005 ◽

2020 ◽

Vol 202 ◽

pp. 16005

Author(s):

Chashif Syadzali ◽

Suryono Suryono ◽

Jatmiko Endro Suseno

Keyword(s):

Business Intelligence ◽

Nearest Neighbor ◽

Customer Behavior ◽

Training Data ◽

Business Strategies ◽

Intelligence Analysis ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm

Customer behavior classification can be useful to assist companies in conducting business intelligence analysis. Data mining techniques can classify customer behavior using the K-Nearest Neighbor algorithm based on the customer's life cycle consisting of prospect, responder, active and former. Data used to classify include age, gender, number of donations, donation retention and number of user visits. The calculation results from 2,114 data in the classification of each customer’s category are namely active by 1.18%, prospect by 8.99%, responder by 4.26% and former by 85.57%. System accuracy using a range of K from K = 1 to K = 20 produces that the highest accuracy is 94.3731% at a value of K = 4. The results of the training data that produce a classification of user behavior can be used as a Business Intelligence analysis that is useful for companies in determining business strategies by knowing the target of optimal market.

Download Full-text

Classification of Lower Back Pain Using K-Nearest Neighbor Algorithm

2018 6th International Conference on Cyber and IT Service Management (CITSM) ◽

10.1109/citsm.2018.8674361 ◽

2018 ◽

Cited By ~ 1

Author(s):

Green Arther Sandag ◽

Natalia Elisabet Tedry ◽

Steven Lolong

Keyword(s):

Back Pain ◽

Lower Back Pain ◽

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Lower Back ◽

K Nearest Neighbor Algorithm

Download Full-text

Identification and Classification of Technical Lignins by means of Principle Component Analysis and k‐Nearest Neighbor Algorithm

Chemistry–Methods ◽

10.1002/cmtd.202100028 ◽

2021 ◽

Author(s):

Friedrich Fink ◽

Franziska Emmerling ◽

Jana Falkenhagen

Keyword(s):

Principle Component Analysis ◽

Nearest Neighbor ◽

Component Analysis ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Principle Component ◽

K Nearest Neighbor Algorithm ◽

Technical Lignins

Download Full-text

A Robust Physical Exercise Recognition System Using Machine Learning Approach

10.46603/ejcee.v1i1.13 ◽

2020 ◽

Vol 1 (1) ◽

pp. 17-21

Author(s):

Steve Oscar ◽

◽

Mohammed Nazim Uddin ◽

Keyword(s):

Physical Exercise ◽

Nearest Neighbor ◽

Principal Component ◽

Recognition System ◽

Accelerometer Data ◽

K Nearest Neighbor ◽

Machine Learning Approach ◽

Principal Component Analysis Algorithm ◽

K Nearest Neighbor Algorithm

Modern life is becoming more linked to our devices, and work is being done in a more regulated way. As life became more complicated, it is becoming challenging to keep track of human health and fitness, leading to unexpected illnesses and diseases. Moreover, a lack of activity monitoring and corresponding reminders is preventing the adoption of a healthier lifestyle. This research provides a practical approach for identifying Human Activity by using accelerometer data obtained from wearable devices. The model automatically finds patterns among 33 different physical exercises such as running, rowing, cycling, jogging, etc. and correctly identifies them. The principal component analysis algorithm was used on the statistical features to make the system more robust. Classification of the physical exercise was performed on the reduced features using WEKA. The overall accuracy of 85.51% was obtained using the 10-Fold Cross-Validation method and K nearest Neighbor Algorithm while 84% accuracy for Random Forest. The accuracy obtained was better than previous models and could improve recognition systems in monitoring user activity more precisely.

Download Full-text

PREDICTION OF CORONARY ARTERY DISEASE BASED ON ENSEMBLE LEARNING APPROACHES AND CO-EXPRESSED OBSERVATIONS

Journal of Mechanics in Medicine and Biology ◽

10.1142/s0219519416400108 ◽

2016 ◽

Vol 16 (01) ◽

pp. 1640010 ◽

Cited By ~ 3

Author(s):

YING-TSANG LO ◽

HAMIDO FUJITA ◽

TUN-WEN PAI

Keyword(s):

Machine Learning ◽

Coronary Artery Disease ◽

Coronary Artery ◽

Nearest Neighbor ◽

Prediction Method ◽

Medical Decision ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Artery Disease ◽

Voting Mechanism

Background: Coronary artery disease (CAD) is one of the most representative cardiovascular diseases. Early and accurate prediction of CAD based on physiological measurements can reduce the risk of heart attack through medicine therapy, healthy diet, and regular physical activity. Methods:Four heart disease datasets from the UC Irvine Machine Learning Repository were combined and re-examined to remove incomplete entries, and a total of 822 cases were utilized in this study. Seven machine learning methods, including Naïve Bayes, artificial neural networks (ANNs), sequential minimal optimization (SMO), k-nearest neighbor (KNN), AdaBoost, J48, and random forest, were adopted to analyze the collected datasets for CAD prediction. By combining co-expressed observations and an ensemble voting mechanism, we designed and evaluated a new medical decision classifier for CAD prediction. The TOPSIS (Technique for Order Preference by Similarity to an Ideal Solution) algorithm was applied to determine the best prediction method for CAD diagnosis. Results: Features of systolic blood pressure, cholesterol, heart rate, and ST depression are considered to be the most significant differences between patients with and without CADs. We show that the prediction capability of seven machine learning classifiers can be enhanced by integrating combinations of observed co-expressed features. Finally, compared to the use of any single classifier, the proposed voting mechanism achieved optimal performance according to TOPSIS.

Download Full-text