Using machine learning to deal with Phishing and Spam Detection

Author(s):  
Oumaima El Kouari ◽  
Hafssa Benaboud ◽  
Saiida Lazaar
Author(s):  
Rashida Ali ◽  
Ibrahim Rampurawala ◽  
Mayuri Wandhe ◽  
Ruchika Shrikhande ◽  
Arpita Bhatkar

Internet provides a medium to connect with individuals of similar or different interests creating a hub. Since a huge hub participates on these platforms, the user can receive a high volume of messages from different individuals creating a chaos and unwanted messages. These messages sometimes contain a true information and sometimes false, which leads to a state of confusion in the minds of the users and leads to first step towards spam messaging. Spam messages means an irrelevant and unsolicited message sent by a known/unknown user which may lead to a sense of insecurity among users. In this paper, the different machine learning algorithms were trained and tested with natural language processing (NLP) to classify whether the messages are spam or ham.


Author(s):  
Niddal Imam ◽  
Biju Issac ◽  
Seibu Mary Jacob

Twitter has changed the way people get information by allowing them to express their opinion and comments on the daily tweets. Unfortunately, due to the high popularity of Twitter, it has become very attractive to spammers. Unlike other types of spam, Twitter spam has become a serious issue in the last few years. The large number of users and the high amount of information being shared on Twitter play an important role in accelerating the spread of spam. In order to protect the users, Twitter and the research community have been developing different spam detection systems by applying different machine-learning techniques. However, a recent study showed that the current machine learning-based detection systems are not able to detect spam accurately because spam tweet characteristics vary over time. This issue is called “Twitter Spam Drift”. In this paper, a semi-supervised learning approach (SSLA) has been proposed to tackle this. The new approach uses the unlabeled data to learn the structure of the domain. Different experiments were performed on English and Arabic datasets to test and evaluate the proposed approach and the results show that the proposed SSLA can reduce the effect of Twitter spam drift and outperform the existing techniques.


2020 ◽  
Vol 8 (6) ◽  
pp. 5326-5329

The current use of social media has created incomparable amounts of social data, as it is a cheap and popular information sharing communication platform. Nowadays, a huge percentage of people depend on the accessible material on social networking in their choices (e.g. comments and suggestions about a subject or product). This feature on exchanging knowledge with a wide number of users has quickly prompted social spammers to exploit the network of confidence to distribute spam messages and support personal forums, advertising, phishing, scams and so on. Identifying these spammers and spam material is a hot subject of study, and while large amounts of experiments have recently been conducted to this end, so far the methodologies are only barely able to identify spam feedback, and none of them demonstrates the value of each derived function type. In this study, we have suggested a machine learning-based spam detection system that determines whether or not a specific message in the dataset is spam using a set of machine learning algorithms. Four main features have been used; including user-behavioral, user-linguistic, reviewbehavioral and review-linguistic, to improve the spam detection process and to gather reliable data


Spam has become one of the growing issues in social media websites. Some of the users in these websites creates spam news. Coming to twitter, Users inject tweets in trending topics and replies with promotional messages providing links. A large amount of spam has been noticied in twitter. It is necessary to identify these spams tweets in a twitter stream. Now a days ,a big part of people rely on content available in social media in their decisions, so detecting and deleting these spam details is very important. A basic framework is suggested to detect malicious account holders in twitter..At present to detect these spam users or accounts there are methods which are based on content based features, Graph based features. The system which is going to be created works on machine learning based algorithms. These algorithms help to give accurate results. In this system algorithm named Naïve Bayes classifier algorithm is going to be used. This algorithm is said to be combination of many other principles relyingupon “Bayes theorem” wherein the methods share a common mode of working.


Sign in / Sign up

Export Citation Format

Share Document