A Robust Approach for Effective Spam Detection Using Supervised Learning Techniques

Twitter has changed the way people get information by allowing them to express their opinion and comments on the daily tweets. Unfortunately, due to the high popularity of Twitter, it has become very attractive to spammers. Unlike other types of spam, Twitter spam has become a serious issue in the last few years. The large number of users and the high amount of information being shared on Twitter play an important role in accelerating the spread of spam. In order to protect the users, Twitter and the research community have been developing different spam detection systems by applying different machine-learning techniques. However, a recent study showed that the current machine learning-based detection systems are not able to detect spam accurately because spam tweet characteristics vary over time. This issue is called “Twitter Spam Drift”. In this paper, a semi-supervised learning approach (SSLA) has been proposed to tackle this. The new approach uses the unlabeled data to learn the structure of the domain. Different experiments were performed on English and Arabic datasets to test and evaluate the proposed approach and the results show that the proposed SSLA can reduce the effect of Twitter spam drift and outperform the existing techniques.

Download Full-text

Analysis of Optimized Machine Learning and Deep Learning Techniques for Spam Detection

2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS) ◽

10.1109/iemtronics52119.2021.9422508 ◽

2021 ◽

Author(s):

Fahima Hossain ◽

Mohammed Nasir Uddin ◽

Rajib Kumar Halder

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Spam Detection ◽

Learning Techniques

Download Full-text

Effects of Lockdown and Post Lockdown on Covid19 cases across India using Supervised Learning Techniques

2020 11th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON) ◽

10.1109/uemcon51285.2020.9298183 ◽

2020 ◽

Author(s):

Narayana Darapaneni ◽

Amol Kobal ◽

Rohit Chaoji ◽

Rajiv Tiwari ◽

Suman Saurav ◽

...

Keyword(s):

Supervised Learning ◽

Learning Techniques

Download Full-text

Near real-time twitter spam detection with machine learning techniques

International Journal of Computers and Applications ◽

10.1080/1206212x.2020.1751387 ◽

2020 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Nan Sun ◽

Guanjun Lin ◽

Junyang Qiu ◽

Paul Rimba

Keyword(s):

Machine Learning ◽

Real Time ◽

Machine Learning Techniques ◽

Spam Detection ◽

Learning Techniques

Download Full-text

Fraud Detection in Online Transactions Using Supervised Learning Techniques

Towards Extensible and Adaptable Methods in Computing ◽

10.1007/978-981-13-2348-5_23 ◽

2018 ◽

pp. 309-321

Author(s):

Akshi Kumar ◽

Garima Gupta

Keyword(s):

Supervised Learning ◽

Fraud Detection ◽

Online Transactions ◽

Learning Techniques

Download Full-text

A Comparative Analysis of Supervised Learning Techniques for Pixel Classification in Remote Sensing Images

2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) ◽

10.1109/wispnet.2018.8538518 ◽

2018 ◽

Cited By ~ 1

Author(s):

R. Sivagami ◽

R. Krishankumar ◽

K. S. Ravichandran

Keyword(s):

Remote Sensing ◽

Comparative Analysis ◽

Supervised Learning ◽

Remote Sensing Images ◽

Pixel Classification ◽

Learning Techniques

Download Full-text

Acquiring Sentiment from Twitter using Supervised Learning and Lexicon-based Techniques

Walailak Journal of Science and Technology (WJST) ◽

10.48048/wjst.2018.2731 ◽

2016 ◽

Vol 15 (1) ◽

pp. 63-80

Author(s):

Jitrlada ROJRATANAVIJIT ◽

Preecha VICHITTHAMAROS ◽

Sukanya PHONGSUPHAP

Keyword(s):

Supervised Learning ◽

Mixed Method ◽

Short Length ◽

Writing Systems ◽

Learning Techniques ◽

Average Accuracy ◽

The Difference ◽

Tweet Classification ◽

Source Of Information ◽

Processing Steps

The emergence of Twitter in Thailand has given millions of users a platform to express and share their opinions about products and services, among other subjects, and so Twitter is considered to be a rich source of information for companies to understand their customers by extracting and analyzing sentiment from Tweets. This offers companies a fast and effective way to monitor public opinions on their brands, products, services, etc. However, sentiment analysis performed on Thai Tweets has challenges brought about by language-related issues, such as the difference in writing systems between Thai and English, short-length messages, slang words, and word usage variation. This research paper focuses on Tweet classification and on solving data sparsity issues. We propose a mixed method of supervised learning techniques and lexicon-based techniques to filter Thai opinions and to then classify them into positive, negative, or neutral sentiments. The proposed method includes a number of pre-processing steps before the text is fed to the classifier. Experimental results showed that the proposed method overcame previous limitations from other studies and was very effective in most cases. The average accuracy was 84.80 %, with 82.42 % precision, 83.88 % recall, and 82.97 % F-measure.

Download Full-text

Ecological Interactions and the Netflix Problem

10.1101/089771 ◽

2016 ◽

Cited By ~ 1

Author(s):

Philippe Desjardins-Proulx ◽

Idaline Laigle ◽

Timothée Poisot ◽

Dominique Gravel

Keyword(s):

Machine Learning ◽

Supervised Learning ◽

Random Forests ◽

Species Interactions ◽

Similarity Measures ◽

Theoretical Models ◽

Machine Learning Techniques ◽

Nearest Neighbour ◽

Ecological Interactions ◽

Learning Techniques

0AbstractSpecies interactions are a key component of ecosystems but we generally have an incomplete picture of who-eats-who in a given community. Different techniques have been devised to predict species interactions using theoretical models or abundances. Here, we explore the K nearest neighbour approach, with a special emphasis on recommendation, along with other machine learning techniques. Recommenders are algorithms developed for companies like Netflix to predict if a customer would like a product given the preferences of similar customers. These machine learning techniques are well-suited to study binary ecological interactions since they focus on positive-only data. We also explore how the K nearest neighbour approach can be used with both positive and negative information, in which case the goal of the algorithm is to fill missing entries from a matrix (imputation). By removing a prey from a predator, we find that recommenders can guess the missing prey around 50% of the times on the first try, with up to 881 possibilities. Traits do not improve significantly the results for the K nearest neighbour, although a simple test with a supervised learning approach (random forests) show we can predict interactions with high accuracy using only three traits per species. This result shows that binary interactions can be predicted without regard to the ecological community given only three variables: body mass and two variables for the species’ phylogeny. These techniques are complementary, as recommenders can predict interactions in the absence of traits, using only information about other species’ interactions, while supervised learning algorithms such as random forests base their predictions on traits only but do not exploit other species’ interactions. Further work should focus on developing custom similarity measures specialized to ecology to improve the KNN algorithms and using richer data to capture indirect relationships between species.

Download Full-text

A Comparative Analysis of Machine Learning Techniques for Spam Detection

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1308 ◽

2021 ◽

pp. 657-661

Author(s):

Rashida Ali ◽

Ibrahim Rampurawala ◽

Mayuri Wandhe ◽

Ruchika Shrikhande ◽

Arpita Bhatkar

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Comparative Analysis ◽

Natural Language ◽

Language Processing ◽

High Volume ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Spam Detection ◽

Learning Techniques

Internet provides a medium to connect with individuals of similar or different interests creating a hub. Since a huge hub participates on these platforms, the user can receive a high volume of messages from different individuals creating a chaos and unwanted messages. These messages sometimes contain a true information and sometimes false, which leads to a state of confusion in the minds of the users and leads to first step towards spam messaging. Spam messages means an irrelevant and unsolicited message sent by a known/unknown user which may lead to a sense of insecurity among users. In this paper, the different machine learning algorithms were trained and tested with natural language processing (NLP) to classify whether the messages are spam or ham.

Download Full-text

Performance of cache placement using supervised learning techniques in mobile edge networks

IET Networks ◽

10.1049/ntw2.12029 ◽

2021 ◽

Author(s):

Lubna Mohammed ◽

Alagan Anpalagan ◽

Ahmed S. Khwaja ◽

Muhammad Jaseemuddin

Keyword(s):

Supervised Learning ◽

Learning Techniques ◽

Cache Placement ◽

Edge Networks

Download Full-text

A Robust Approach for Effective Spam Detection Using Supervised Learning Techniques

A Semi-Supervised Learning Approach for Tackling Twitter Spam Drift

Analysis of Optimized Machine Learning and Deep Learning Techniques for Spam Detection

Effects of Lockdown and Post Lockdown on Covid19 cases across India using Supervised Learning Techniques

Near real-time twitter spam detection with machine learning techniques

Fraud Detection in Online Transactions Using Supervised Learning Techniques

A Comparative Analysis of Supervised Learning Techniques for Pixel Classification in Remote Sensing Images

Acquiring Sentiment from Twitter using Supervised Learning and Lexicon-based Techniques

Ecological Interactions and the Netflix Problem

A Comparative Analysis of Machine Learning Techniques for Spam Detection

Performance of cache placement using supervised learning techniques in mobile edge networks

Export Citation Format