Optimization for overfitting problems in spam email classification based on parameter adjusting

2021 ◽  
Author(s):  
Yusi Wei ◽  
Douhao Ma ◽  
Juncheng Dong
Keyword(s):  
2020 ◽  
Vol 4 (2) ◽  
pp. 377-383
Author(s):  
Eko Laksono ◽  
Achmad Basuki ◽  
Fitra Bachtiar

There are many cases of email abuse that have the potential to harm others. This email abuse is commonly known as spam, which contains advertisements, phishing scams, and even malware. This study purpose to know the classification of email spam with ham using the KNN method as an effort to reduce the amount of spam. KNN can classify spam or ham in an email by checking it using a different K value approach. The results of the classification evaluation using confusion matrix resulted in the KNN method with a value of K = 1 having the highest accuracy value of 91.4%. From the results of the study, it is known that the optimization of the K value in KNN using frequency distribution clustering can produce high accuracy of 100%, while k-means clustering produces an accuracy of 99%. So based on the results of the existing accuracy values, the frequency distribution clustering and k-means clustering can be used to optimize the K-optimal value of the KNN in the classification of existing spam emails.


2018 ◽  
Vol 9 (4) ◽  
pp. 3259-3269 ◽  
Author(s):  
Eman M. Bahgat ◽  
Sherine Rady ◽  
Walaa Gad ◽  
Ibrahim F. Moawad

2012 ◽  
Vol 56 (3) ◽  
pp. 741-751 ◽  
Author(s):  
Juan Carlos Gomez ◽  
Marie-Francine Moens
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document