Email Spam

2020 ◽

Vol 4 (2) ◽

pp. 377-383

Author(s):

Eko Laksono ◽

Achmad Basuki ◽

Fitra Bachtiar

Keyword(s):

Frequency Distribution ◽

Confusion Matrix ◽

High Accuracy ◽

K Value ◽

A Value ◽

Optimal Value ◽

Classification Evaluation ◽

Email Spam ◽

Email Classification

There are many cases of email abuse that have the potential to harm others. This email abuse is commonly known as spam, which contains advertisements, phishing scams, and even malware. This study purpose to know the classification of email spam with ham using the KNN method as an effort to reduce the amount of spam. KNN can classify spam or ham in an email by checking it using a different K value approach. The results of the classification evaluation using confusion matrix resulted in the KNN method with a value of K = 1 having the highest accuracy value of 91.4%. From the results of the study, it is known that the optimization of the K value in KNN using frequency distribution clustering can produce high accuracy of 100%, while k-means clustering produces an accuracy of 99%. So based on the results of the existing accuracy values, the frequency distribution clustering and k-means clustering can be used to optimize the K-optimal value of the KNN in the classification of existing spam emails.

Download Full-text

A Comprehensive Review on Email Spam Classification using Machine Learning Algorithms

2021 International Conference on Information Networking (ICOIN) ◽

10.1109/icoin50884.2021.9334020 ◽

2021 ◽

Author(s):

Mansoor RAZA ◽

Nathali Dilshani Jayasinghe ◽

Muhana Magboul Ali Muslam

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Comprehensive Review ◽

Email Spam

Download Full-text

Comparative Analysis of Detection of Email Spam With the Aid of Machine Learning Approaches

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1022/1/012113 ◽

2021 ◽

Vol 1022 ◽

pp. 012113

Author(s):

Mangena Venu Madhavan ◽

Sagar Pande ◽

Pooja Umekar ◽

Tushar Mahore ◽

Dhiraj Kalyankar

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Learning Approaches ◽

Email Spam

Download Full-text

Enhancement of email spam detection using improved deep learning algorithms for cyber security

Journal of Computer Security ◽

10.3233/jcs-200111 ◽

2021 ◽

pp. 1-34

Author(s):

Kadam Vikas Samarthrao ◽

Vandana M. Rohokale

Keyword(s):

Feature Selection ◽

Deep Learning ◽

Visual Features ◽

Spam Detection ◽

Learning Approaches ◽

Learning Technique ◽

Text Features ◽

Optimal Feature Selection ◽

Optimal Feature ◽

Email Spam

Email has sustained to be an essential part of our lives and as a means for better communication on the internet. The challenge pertains to the spam emails residing a large amount of space and bandwidth. The defect of state-of-the-art spam filtering methods like misclassification of genuine emails as spam (false positives) is the rising challenge to the internet world. Depending on the classification techniques, literature provides various algorithms for the classification of email spam. This paper tactics to develop a novel spam detection model for improved cybersecurity. The proposed model involves several phases like dataset acquisition, feature extraction, optimal feature selection, and detection. Initially, the benchmark dataset of email is collected that involves both text and image datasets. Next, the feature extraction is performed using two sets of features like text features and visual features. In the text features, Term Frequency-Inverse Document Frequency (TF-IDF) is extracted. For the visual features, color correlogram and Gray-Level Co-occurrence Matrix (GLCM) are determined. Since the length of the extracted feature vector seems to the long, the optimal feature selection process is done. The optimal feature selection is performed by a new meta-heuristic algorithm called Fitness Oriented Levy Improvement-based Dragonfly Algorithm (FLI-DA). Once the optimal features are selected, the detection is performed by the hybrid learning technique that is composed of two deep learning approaches named Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN). For improving the performance of existing deep learning approaches, the number of hidden neurons of RNN and CNN is optimized by the same FLI-DA. Finally, the optimized hybrid learning technique having CNN and RNN classifies the data into spam and ham. The experimental outcomes show the ability of the proposed method to perform the spam email classification based on improved deep learning.

Download Full-text

Spam classification: a comparative analysis of different boosted decision tree approaches

Journal of Systems and Information Technology ◽

10.1108/jsit-11-2017-0105 ◽

2018 ◽

Vol 20 (3) ◽

pp. 298-105 ◽

Cited By ~ 4

Author(s):

Shrawan Kumar Trivedi ◽

Prabin Kumar Panigrahi

Keyword(s):

Decision Tree ◽

False Positive ◽

False Positive Rate ◽

False Negative ◽

The Body ◽

Content Type ◽

Performance Accuracy ◽

Tree Classifier ◽

Boosted Decision Tree ◽

Email Spam

PurposeEmail spam classification is now becoming a challenging area in the domain of text classification. Precise and robust classifiers are not only judged by classification accuracy but also by sensitivity (correctly classified legitimate emails) and specificity (correctly classified unsolicited emails) towards the accurate classification, captured by both false positive and false negative rates. This paper aims to present a comparative study between various decision tree classifiers (such as AD tree, decision stump and REP tree) with/without different boosting algorithms (bagging, boosting with re-sample and AdaBoost).Design/methodology/approachArtificial intelligence and text mining approaches have been incorporated in this study. Each decision tree classifier in this study is tested on informative words/features selected from the two publically available data sets (SpamAssassin and LingSpam) using a greedy step-wise feature search method.FindingsOutcomes of this study show that without boosting, the REP tree provides high performance accuracy with the AD tree ranking as the second-best performer. Decision stump is found to be the under-performing classifier of this study. However, with boosting, the combination of REP tree and AdaBoost compares favourably with other classification models. If the metrics false positive rate and performance accuracy are taken together, AD tree and REP tree with AdaBoost were both found to carry out an effective classification task. Greedy stepwise has proven its worth in this study by selecting a subset of valuable features to identify the correct class of emails.Research limitations/implicationsThis research is focussed on the classification of those email spams that are written in the English language only. The proposed models work with content (words/features) of email data that is mostly found in the body of the mail. Image spam has not been included in this study. Other messages such as short message service or multi-media messaging service were not included in this study.Practical implicationsIn this research, a boosted decision tree approach has been proposed and used to classify email spam and ham files; this is found to be a highly effective approach in comparison with other state-of-the-art modes used in other studies. This classifier may be tested for different applications and may provide new insights for developers and researchers.Originality/valueA comparison of decision tree classifiers with/without ensemble has been presented for spam classification.

Download Full-text

Header Based Email Spam Detection Framework Using Support Vector Machine (SVM) Technique

Advances in Intelligent Systems and Computing - Recent Advances on Soft Computing and Data Mining ◽

10.1007/978-3-030-36056-6_6 ◽

2019 ◽

pp. 57-65 ◽

Cited By ~ 1

Author(s):

Siti Aqilah Khamis ◽

Cik Feresa Mohd Foozy ◽

Mohd Firdaus Ab Aziz ◽

Nordiana Rahim

Keyword(s):

Support Vector Machine ◽

Support Vector ◽

Spam Detection ◽

Email Spam

Download Full-text

Email Spam Detection Using Combination of Particle Swarm Optimization and Artificial Neural Network and Support Vector Machine

International Journal of Modern Education and Computer Science ◽

10.5815/ijmecs.2016.07.08 ◽

2016 ◽

Vol 8 (7) ◽

pp. 68-74 ◽

Cited By ~ 6

Author(s):

Mohammad Zavvar ◽

◽

Meysam Rezaei ◽

Shole Garavand

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Support Vector Machine ◽

Particle Swarm Optimization ◽

Particle Swarm ◽

Support Vector ◽

Spam Detection ◽

Swarm Optimization ◽

Artificial Neural ◽

Email Spam

Download Full-text

Enhanced Bio-Inspired Algorithms for Detecting and Filtering Spam

Global Implications of Emerging Technology Trends - Advances in IT Standards and Standardization Research ◽

10.4018/978-1-5225-4944-4.ch011 ◽

2018 ◽

pp. 179-215

Author(s):

Hadj Ahmed Bouarara

Keyword(s):

Electronic Communication ◽

Distance Measure ◽

Machine Learning Algorithms ◽

Spam Detection ◽

Social Bees ◽

Representation Technique ◽

Sensitive Parameters ◽

Measure Entropy ◽

Email Spam ◽

F Measure

The internet era promotes electronic commerce and facilitates access to many services. In today's digital society, the explosion in communication has revolutionized the field of electronic communication. Unfortunately, this technology has become incontestably the original source of malicious activities, especially the plague called undesirables email (SPAM) that has grown tremendously in the last few years. This chapter unveils fresh bio-inspired techniques (artificial social cockroaches [ASC], artificial haemostasis system [AHS], and artificial heart lungs system [AHLS]) and their application for SPAM detection. For the experimentation, the authors used the benchmark SMS Spam corpus V.0.1 and the validation measures (recall, precision, f-measure, entropy, accuracy, and error). They optimize the sensitive parameters of each algorithm (text representation technique, distance measure, weightings, and threshold). The results are positive compared to the result of artificial social bees and machine-learning algorithms (decision tree C4.5 and K-means).

Download Full-text

Analisa Dan Implementasi Security Mail Server

JURNAL FASILKOM ◽

10.37859/jf.v10i1.1906 ◽

2020 ◽

Vol 10 (1) ◽

pp. 25-32

Author(s):

Harun Mukhtar ◽

Daniel Adi Putra Sitorus ◽

Yulia Fatma

Keyword(s):

Security Design ◽

Final Project ◽

Security Issues ◽

Mail Server ◽

System Administrator ◽

Server Application ◽

E Mail ◽

Email Spam

Mail server is one of the most widely used server functions in the company. This discusses e-mail itself which can reduce mailing costs, is more efficient than manual communication and can be used as attachments that are useful as a supplement and additional documents related to the contents of e-mail. Zimbra is a mail server application that provides complete features and also makes it easy to install mail server management, also mail server security issues are a factor that must be considered by the system administrator. The security design for e-mail servers addresses the importance of being able to prevent spam e-mail attacks that can fill e-mail servers and make mail server performance faster. Because a good mail server security can optimize the performance of the mail server itself. In this final project, the work and implementation of the zimbra mail server security will be carried out specifically for handling email spam. The zimbra email server will analyze its security against spam email attacks, so that it can function as an email server on the company.

Download Full-text

Supervised Machine Learning Classifier for Email Spam Filtering

Innovations in Computer Science and Engineering - Lecture Notes in Networks and Systems ◽

10.1007/978-981-13-7082-3_41 ◽

2019 ◽

pp. 357-363 ◽

Cited By ~ 1

Author(s):

Deepika Mallampati ◽

K. Chandra Shekar ◽

K. Ravikanth

Keyword(s):

Machine Learning ◽

Supervised Machine Learning ◽

Spam Filtering ◽

Learning Classifier ◽

Email Spam

Download Full-text

Optimization of K Value in KNN Algorithm for Spam and Ham Email Classification

A Comprehensive Review on Email Spam Classification using Machine Learning Algorithms

Comparative Analysis of Detection of Email Spam With the Aid of Machine Learning Approaches

Enhancement of email spam detection using improved deep learning algorithms for cyber security

Spam classification: a comparative analysis of different boosted decision tree approaches

Header Based Email Spam Detection Framework Using Support Vector Machine (SVM) Technique

Email Spam Detection Using Combination of Particle Swarm Optimization and Artificial Neural Network and Support Vector Machine

Enhanced Bio-Inspired Algorithms for Detecting and Filtering Spam

Analisa Dan Implementasi Security Mail Server

Supervised Machine Learning Classifier for Email Spam Filtering

Export Citation Format