Improving Spam Email Filtering Systems Using Data Mining Techniques

Implementing Computational Intelligence Techniques for Security Systems Design - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-2418-3.ch003 ◽

2020 ◽

pp. 43-72

Author(s):

Wasan Shaker Awad ◽

Wafa M. Rafiq

Keyword(s):

Machine Learning ◽

Data Mining ◽

Genetic Algorithm ◽

Low Cost ◽

High Accuracy ◽

False Positives ◽

Spam Filtering ◽

Spam Filter ◽

Using Data ◽

Email Spam

Email is the most popular choice of communication due to its low-cost and easy accessibility, which makes email spam a major issue. Emails can be incorrectly marked by a spam filter and legitimate emails can get lost in the spam folder or the spam emails can deluge the users' inboxes. Therefore, various methods based on statistics and machine learning have been developed to classify emails accurately. In this chapter, the existing spam filtering methods were studied comprehensively, and a spam email classifier based on the genetic algorithm was proposed. The proposed algorithm was successful in achieving high accuracy by reducing the rate of false positives, but at the same time, it also maintained an acceptable rate of false negatives. The proposed algorithm was tested on 2000 emails from the two popular spam datasets, Enron and LingSpam, and the accuracy was found to be nearly 90%. The results showed that the genetic algorithm is an effective method for spam classification and with further enhancements that will provide a more robust spam filter.

Download Full-text

Unsupervised Approach for Email Spam Filtering using Data Mining

EAI Endorsed Transactions on Energy Web ◽

10.4108/eai.9-3-2021.168962 ◽

2018 ◽

pp. 168962

Author(s):

Mehdi Manaa ◽

Ahmed Obaid ◽

Mohammed Dosh

Keyword(s):

Data Mining ◽

Spam Filtering ◽

Unsupervised Approach ◽

Using Data ◽

Email Spam

Download Full-text

Instant medical care and drug suggestion service using data mining and machine learning based intelligent self-diagnosis medical system

International Journal of Advanced Life Sciences ◽

10.26627/ijals/2017/10.03.0022 ◽

2017 ◽

Vol 10 (03) ◽

pp. 318-325

Author(s):

sudha M

Keyword(s):

Machine Learning ◽

Data Mining ◽

Medical Care ◽

Medical System ◽

Using Data

Download Full-text

Effective Prediction of Heart Disease Using Data Mining and Machine Learning: A Review

2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS) ◽

10.1109/icais50930.2021.9395963 ◽

2021 ◽

Author(s):

Simran Verma ◽

Abhishek Gupta

Keyword(s):

Machine Learning ◽

Data Mining ◽

Heart Disease ◽

Using Data

Download Full-text

A Machine Vision Approach for Bioreactor Foam Sensing

SLAS TECHNOLOGY Translating Life Sciences Innovation ◽

10.1177/24726303211008861 ◽

2021 ◽

pp. 247263032110088

Author(s):

Jonas Austerjost ◽

Robert Söldner ◽

Christoffer Edlund ◽

Johan Trygg ◽

David Pollard ◽

...

Keyword(s):

Machine Learning ◽

Machine Vision ◽

State Of The Art ◽

Low Cost ◽

High Accuracy ◽

Consumer Electronics ◽

Learning System ◽

Automotive Applications ◽

Fine Grained

Machine vision is a powerful technology that has become increasingly popular and accurate during the last decade due to rapid advances in the field of machine learning. The majority of machine vision applications are currently found in consumer electronics, automotive applications, and quality control, yet the potential for bioprocessing applications is tremendous. For instance, detecting and controlling foam emergence is important for all upstream bioprocesses, but the lack of robust foam sensing often leads to batch failures from foam-outs or overaddition of antifoam agents. Here, we report a new low-cost, flexible, and reliable foam sensor concept for bioreactor applications. The concept applies convolutional neural networks (CNNs), a state-of-the-art machine learning system for image processing. The implemented method shows high accuracy for both binary foam detection (foam/no foam) and fine-grained classification of foam levels.

Download Full-text

Using Data Mining Techniques and Genetic Algorithm

Proceedings of the International Conference on Learning and Optimization Algorithms: Theory and Applications - LOPAL '18 ◽

10.1145/3230905.3230915 ◽

2018 ◽

Author(s):

Lamia Berkani ◽

Yanis Chebahi ◽

Lilya Betit

Keyword(s):

Data Mining ◽

Genetic Algorithm ◽

Data Mining Techniques ◽

Using Data

Download Full-text

A Comparative Study to analyze crime threats using data mining and machine learning approach

10.1109/icscan53069.2021.9526489 ◽

2021 ◽

Author(s):

Puninder Kaur ◽

Geeta Rani ◽

Taruna Sharma ◽

Avinash Sharma

Keyword(s):

Machine Learning ◽

Data Mining ◽

Comparative Study ◽

Learning Approach ◽

Machine Learning Approach ◽

Using Data

Download Full-text

Towards Behaviour Recognition with Unlabelled Sensor Data

Human Behavior Recognition Technologies ◽

10.4018/978-1-4666-3682-8.ch005 ◽

2013 ◽

pp. 86-110

Author(s):

Sook-Ling Chua ◽

Stephen Marsland ◽

Hans W. Guesgen

Keyword(s):

Machine Learning ◽

Data Mining ◽

Inverse Problem ◽

Sensor Data ◽

Training Set ◽

Learning Methods ◽

Machine Learning Methods ◽

Using Data ◽

Symbolic Approach ◽

Behaviour Recognition

The problem of behaviour recognition based on data from sensors is essentially an inverse problem: given a set of sensor observations, identify the sequence of behaviours that gave rise to them. In a smart home, the behaviours are likely to be the standard human behaviours of living, and the observations will depend upon the sensors that the house is equipped with. There are two main approaches to identifying behaviours from the sensor stream. One is to use a symbolic approach, which explicitly models the recognition process. Another is to use a sub-symbolic approach to behaviour recognition, which is the focus in this chapter, using data mining and machine learning methods. While there have been many machine learning methods of identifying behaviours from the sensor stream, they have generally relied upon a labelled dataset, where a person has manually identified their behaviour at each time. This is particularly tedious to do, resulting in relatively small datasets, and is also prone to significant errors as people do not pinpoint the end of one behaviour and commencement of the next correctly. In this chapter, the authors consider methods to deal with unlabelled sensor data for behaviour recognition, and investigate their use. They then consider whether they are best used in isolation, or should be used as preprocessing to provide a training set for a supervised method.

Download Full-text

A Survey on Building Recommendation Systems Using Data Mining Techniques

10.4018/978-1-7998-8413-2.ch002 ◽

2022 ◽

pp. 24-56

Author(s):

Rajab Ssemwogerere ◽

Wamwoyo Faruk ◽

Nambobi Mutwalibi

Keyword(s):

Machine Learning ◽

Data Mining ◽

Recommender Systems ◽

Performance Measures ◽

Data Mining Technique ◽

Data Mining Techniques ◽

Learning Hypothesis ◽

Depth Study ◽

And Performance ◽

Using Data

Classification is a data mining technique or approach used to estimate the grouped membership of items on a basis of a common feature. This technique is virtuous for future planning and discovering new knowledge about a specific dataset. An in-depth study of previous pieces of literature implementing data mining techniques in the design of recommender systems was performed. This chapter provides a broad study of the way of designing recommender systems using various data mining classification techniques of machine learning and also exploiting their methodological decisions in four aspects, the recommendation approaches, data mining techniques, recommendation types, and performance measures. This study focused on some selected classification methods and can be so supportive for both the researchers and the students in the field of computer science and machine learning in strengthening their knowledge about the machine learning hypothesis and data mining.

Download Full-text