Deep Learning for Image Spam Detection

Email has sustained to be an essential part of our lives and as a means for better communication on the internet. The challenge pertains to the spam emails residing a large amount of space and bandwidth. The defect of state-of-the-art spam filtering methods like misclassification of genuine emails as spam (false positives) is the rising challenge to the internet world. Depending on the classification techniques, literature provides various algorithms for the classification of email spam. This paper tactics to develop a novel spam detection model for improved cybersecurity. The proposed model involves several phases like dataset acquisition, feature extraction, optimal feature selection, and detection. Initially, the benchmark dataset of email is collected that involves both text and image datasets. Next, the feature extraction is performed using two sets of features like text features and visual features. In the text features, Term Frequency-Inverse Document Frequency (TF-IDF) is extracted. For the visual features, color correlogram and Gray-Level Co-occurrence Matrix (GLCM) are determined. Since the length of the extracted feature vector seems to the long, the optimal feature selection process is done. The optimal feature selection is performed by a new meta-heuristic algorithm called Fitness Oriented Levy Improvement-based Dragonfly Algorithm (FLI-DA). Once the optimal features are selected, the detection is performed by the hybrid learning technique that is composed of two deep learning approaches named Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN). For improving the performance of existing deep learning approaches, the number of hidden neurons of RNN and CNN is optimized by the same FLI-DA. Finally, the optimized hybrid learning technique having CNN and RNN classifies the data into spam and ham. The experimental outcomes show the ability of the proposed method to perform the spam email classification based on improved deep learning.

Download Full-text

Distributed classification for image spam detection

Multimedia Tools and Applications ◽

10.1007/s11042-017-4944-y ◽

2017 ◽

Vol 77 (11) ◽

pp. 13249-13278 ◽

Cited By ~ 3

Author(s):

Amiza Amir ◽

Bala Srinivasan ◽

Asad I. Khan

Keyword(s):

Spam Detection ◽

Image Spam ◽

Distributed Classification

Download Full-text

Enhancing Multimodal Clustering Framework with Deep Learning to Reveal Image Spam Authorship

10.1109/iri51335.2021.00032 ◽

2021 ◽

Author(s):

Wei-Bang Chen ◽

Yongjin Lu ◽

Zanyah Ailsworth ◽

Xiaoliang Wang ◽

Chengcui Zhang

Keyword(s):

Deep Learning ◽

Image Spam

Download Full-text

PRIVACY PRESERVING SMS SPAM DETECTION WITH DEEP LEARNING MODELS IN DISTRIBUTED ENVIRONMENT

KỶ YẾU HỘI NGHỊ KHOA HỌC CÔNG NGHỆ QUỐC GIA LẦN THỨ XIII NGHIÊN CỨU CƠ BẢN VÀ ỨNG DỤNG CÔNG NGHỆ THÔNG TIN - Proceedings of the 13th National Conference on Fundamental & Applied Information Technology Research ◽

10.15625/vap.2020.00207 ◽

2020 ◽

Author(s):

Tran Anh Tu ◽

Luong The Dung ◽

Huynh Van Nam ◽

Dang Viet Hung

Keyword(s):

Deep Learning ◽

Privacy Preserving ◽

Spam Detection ◽

Learning Models ◽

Distributed Environment

Download Full-text

Image Spam Detection : A Review

Proceedings of the International Conference on Advances in Computer Science and Electronics Engineering ◽

10.3850/978-981-07-1403-1_624 ◽

2012 ◽

Author(s):

M. Kamble ◽

Chhaya Dule

Keyword(s):

Spam Detection ◽

Image Spam

Download Full-text

Image Spam

Advanced Image-Based Spam Detection and Filtering Techniques - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-68318-013-5.ch003 ◽

2017 ◽

pp. 58-89

Keyword(s):

Feature Extraction ◽

Feature Vector ◽

Image Features ◽

Spam Detection ◽

Detection Algorithms ◽

Color Features ◽

Spam Filters ◽

Image Spam ◽

High Level

Spam features represent the unique and special characteristics associated with spam, which are further used to differentiate them from other genuine messages. Each message m is processed by a feature extraction module to represent m in terms of n dimensional feature vector x = (x1, x2, …, xn) containing n features. This feature vector consists of many such features extracted from spam. In case of text based spam filters, a feature can be a word and a feature vector may be composed of various words extracted from spam. Each spam is associated with one feature vector. Based on the characteristics discussed in previous chapter, we will try to extract different features capturing those unique characteristics from image spam, in order to build the robust spam detection algorithms further. These features are broadly classified into high level metadata features, low level image features like color features, grayscale features, texture related features and embedded text related features.

Download Full-text

Image Spam

Advanced Image-Based Spam Detection and Filtering Techniques - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-68318-013-5.ch001 ◽

2017 ◽

pp. 1-27

Keyword(s):

Digital Equipment Corporation ◽

The Internet ◽

Spam Detection ◽

Marketing Manager ◽

Detection Techniques ◽

History Of ◽

Image Spam ◽

Two Sides ◽

E Mail ◽

Digital Equipment

In order to understand the never-ending fights between developers of anti-spam detection techniques and the spammers; it is important to have an insight of the history of spam mails. On May 3, 1978, Gary Thuerk, a marketing manager at Digital Equipment Corporation sent his first mass email to more than 400 customers over the Arpanet in order to promote and sell Digital's new T-Series of VAX systems (Streitfeld, 2003). In this regard, he said, “It's too much work to send everyone an e-mail. So we'll send one e-mail to everyone”. He said with pride, “I was the pioneer. I saw a new way of doing things.” As every coin has two sides, any technology too can be utilized for good and bad intention. At that time, Gary Thuerk would have never dreamt of this method of sending mails to emerge as an area of research in future. Gary Thuerk ended up getting crowned as the father of spam mails instead of the father of e-marketing. In the present scenario, the internet receives 2.5 billion pieces of spam a day by spiritual followers of Thuerk.

Download Full-text

Deep Learning for Image Spam Detection

DeepCapture: Image Spam Detection Using Deep Learning and Data Augmentation

PROTECTOR: An optimized deep learning-based framework for image spam detection and prevention

Analysis of Optimized Machine Learning and Deep Learning Techniques for Spam Detection

Enhancement of email spam detection using improved deep learning algorithms for cyber security

Distributed classification for image spam detection

Enhancing Multimodal Clustering Framework with Deep Learning to Reveal Image Spam Authorship

PRIVACY PRESERVING SMS SPAM DETECTION WITH DEEP LEARNING MODELS IN DISTRIBUTED ENVIRONMENT

Image Spam Detection : A Review

Image Spam

Image Spam

Export Citation Format