Improvised Spam Detection in Twitter Data Using Lightweight Detectors and Classifiers

Receiving spam messages is one of the most serious issues in social media, especially in Twitter, which is a widely used platform to reflect the opinions and emotions of an individual publicly as well as focused to a specific group of members with similar thoughts or discussion topic. In such focused discussion groups, getting spam message through social media sites is the most annoying issue. In this paper, a system is developed to detect spam tweets by using four lightweight detectors, namely blacklist domain detector, near duplicate detector, reliable ham detector, and multiclass detector. The detected tweets are then classified using ensemble classifiers such as naïve Bayes, logistic regression, and random forest. Voting method is applied to decide the labels for the tweets obtained after classification process. The proposed system has achieved an accuracy of 79% to detect spam tweets with the help of naïve Bayes classifier method and the value seems to be optimizing further with the availability of more sample data.

Download Full-text

Efficient Jamming Identification in Wireless Communication: Using Small Sample Data Driven Naive Bayes Classifier

IEEE Wireless Communications Letters ◽

10.1109/lwc.2021.3064843 ◽

2021 ◽

pp. 1-1

Author(s):

Yuxin Shi ◽

Xinjin Lu ◽

Yingtao Niu ◽

Yusheng Li.

Keyword(s):

Wireless Communication ◽

Naive Bayes ◽

Naïve Bayes ◽

Small Sample ◽

Data Driven ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Sample Data

Download Full-text

The Sentiment Analysis Reviewing Indosat Services from Twitter Using the Naive Bayes Classifier

Journal of Applied Computer Science and Technology ◽

10.52158/jacost.v1i2.79 ◽

2020 ◽

Vol 1 (2) ◽

pp. 61-66

Author(s):

Febri Astiko ◽

Achmad Khodar

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Naive Bayes ◽

Learning Model ◽

Naïve Bayes ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Machine Learning Model ◽

Bayes Algorithm

This study aims to design a machine learning model of sentiment analysis on Indosat Ooredoo service reviews on social media twitter using the Naive Bayes algorithm as a classifier of positive and negative labels. This sentiment analysis uses machine learning to get patterns an model that can be used again to predict new data.

Download Full-text

Sentiment Analysis Of Online Lecture Opinions On Twitter Social Media Using Naive Bayes Classifier

10.1109/icomitee53461.2021.9650135 ◽

2021 ◽

Author(s):

Devi Ajeng Damaratih

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Online Lecture

Download Full-text

Mining Social Media Data of Philippine Higher Education Institutions Using Naive Bayes Classifier Algorithm

SSRN Electronic Journal ◽

10.2139/ssrn.3379025 ◽

2019 ◽

Author(s):

Joey Aviles ◽

Rosanna Esquivel

Keyword(s):

Higher Education ◽

Social Media ◽

Naive Bayes ◽

Higher Education Institutions ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Social Media Data ◽

Media Data

Download Full-text

Implementation of Text Mining Model to Emotions Detection on Social Media Comments Using Particle Swarm Optimization and Naive Bayes Classifier

2019 7th International Conference on Cyber and IT Service Management (CITSM) ◽

10.1109/citsm47753.2019.8965382 ◽

2019 ◽

Author(s):

Erfian Junianto ◽

Rizal Rachman

Keyword(s):

Social Media ◽

Particle Swarm Optimization ◽

Text Mining ◽

Naive Bayes ◽

Particle Swarm ◽

Naïve Bayes ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Swarm Optimization ◽

Mining Model

Download Full-text

Hybrid approach: naive bayes and sentiment VADER for analyzing sentiment of mobile unboxing video comments

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i5.pp4452-4459 ◽

2019 ◽

Vol 9 (5) ◽

pp. 4452 ◽

Cited By ~ 1

Author(s):

Chaithra V. D

Keyword(s):

Social Media ◽

Mobile Phone ◽

Mobile Phones ◽

Naive Bayes ◽

Learning Algorithm ◽

Hybrid Approach ◽

Naïve Bayes ◽

Bayes Classifier ◽

Video Sharing ◽

Social Media Site

<p align="justify">Revolution in social media has attracted the users towards video sharing sites like YouTube. It is the most popular social media site where people view, share and interact by commenting on the videos. There are various types of videos that are shared by the users like songs, movie trailers, news, entertainment etc. Nowadays the most trending videos is the unboxing videos and in particular unboxing of mobile phones which gets more views, likes/dislikes and comments. Analyzing the comments of the mobile unboxing videos provides the opinion of the viewers towards the mobile phone. Studying the sentiment expressed in these comments show if the mobile phone is getting positive or negative feedback. A Hybrid approach combining the lexicon approach Sentiment VADER and machine learning algorithm Naive Bayes is applied on the comments to predict the sentiment. Sentiment VADER has a good impact on the Naive Bayes classifier in predicting the sentiment of the comment. The classifier achieves an accuracy of 79.78% and F1 score of 83.72%.</p>

Download Full-text

Job Seeker Profile Classification of Twitter Data Using the Naïve Bayes Classifier Algorithm Based on the DISC Method

2019 4th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE) ◽

10.1109/icitisee48480.2019.9003963 ◽

2019 ◽

Author(s):

Anggit Dwi Hartanto ◽

Ema Utami ◽

Sumarni Adi ◽

Harish Setyo Hudnanto

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Twitter Data

Download Full-text

Knowing Personality Traits on Facebook Status Using the Naïve Bayes Classifier

International Journal of Artificial Intelligence & Robotics (IJAIR) ◽

10.25139/ijair.v2i1.2636 ◽

2020 ◽

Vol 2 (1) ◽

pp. 22

Author(s):

Mohammad Zoqi Sarwani ◽

Muhammad Shubkhan Salafudin ◽

Dian Ahkam Sani

Keyword(s):

Social Media ◽

Big Five ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Surrounding Environment ◽

Testing Data

With the development of social media trends among students by using Facebook social media, students can communicate and pour out everything that is felt in the form of status. Personality is the character or various characters of a person - therefore, how a person to adjust to the surrounding environment for the achievement of communication smoothly. In the personality category, many things classify a person's category in the psychologist theory. In this exercise, the Big Five, the psychologist theory, is described in five codes, namely Openness, Conscientiousness, Extraversion, Agreeables, Neuroticism. Naive Bayes Classifier is used to determine the highest probability value with the aim to determine the highest value. The data used are two namely training data and testing data obtained from the Facebook status of students. From the data obtained can be tested in the system that the accuracy value is 88%.

Download Full-text

IMPLEMENTASI LEXICON BASED DAN NAIVE BAYES PADA ANALISIS SENTIMEN PENGGUNA TWITTER TOPIK PEMILIHAN PRESIDEN 2019

Jurnal Ilmiah Informatika Komputer ◽

10.35760/ik.2019.v24i2.2369 ◽

2019 ◽

Vol 24 (2) ◽

pp. 140-153

Author(s):

Gusti Nur Aulia ◽

Eka Patriya

Keyword(s):

Naive Bayes ◽

Confusion Matrix ◽

Web Server ◽

Data Preprocessing ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Data Filtering ◽

Naïve Bayes Classifier ◽

Twitter Data

Pilpres saat ini cukup menyita perhatian, karena berbagai rumor yang beredar. Masyarakat juga menjadi sasaran elit politik, dimana suara mereka merupakan penentu keberlangsungan arah politik untuk lima tahun kedepan. Opini-opini positif, netral maupun negatif dapat menimbulkan ancaman munculnya berita bohong (hoax). Salah satu sarana yang digunakan masyarakat dalam mengekspresikan pilihan politiknya adalah melalui media sosial salah satunya twitter. Data seperti opini publik dapat diolah menjadi sebuah informasi yang bermanfaat, salah satunya melalui analisis sentimen. Pada penelitian ini, akan dilakukan analisis sentimen pada Twitter tentang pemilihan presiden 2019. Tahapan analisis sentimen pada penelitian ini terdiri dari akuisisi data, pre-processing, klasifikasi data, evaluasi data dan visualisasi data. Preprocessing dilakukan dengan case folding, normalisasi data, filtering, ubah kata baku, stopword dan stemming. Penelitian ini melakukan 2 metode yaitu dengan metode Lexicon Based dan Naïve Bayes Classifier. Hasil akhir dari analisis kemudian dihitung nilai akurasi menggunakan confusion matrix dan di visualisasikan menggunakan web server. Penentuan sentimen prediksi dilakukan menggunakan metode Lexicon Based dan Labelisasi dengan perhitungan secara manual. Data latih dan data uji akan digunakan dalam proses pelatihan dan pengujian menggunakan Naive Bayes Classiﬁer. Hasil klasiﬁkasi yang dilakukan oleh metode Naive Bayes Classiﬁer disebut sentimen aktual. Perhitungan tingkat keakurasian antara sentimen prediksi terhadap sentimen aktual menggunakan pengujian confusion matrix. Hasil yang didapatkan adalah tingkat akurasi antara sentimen prediksi dan sentimen aktual dengan Lexicon Based sebesar 64,49% pada data uji dan pada data latih sebanyak 94,2% serta dengan menggunakan Labelisasi dan Naive Bayes Classiﬁer sebesar 86,53% pada data uji dan data latih sebesar 94,08%. Hasil penelitian ini diharapkan dapat membantu melakukan riset atas opini masyarakat pada Twitter mengenai Pilpres 2019 yang mengandung sentimen positif, negatif atau netral.

Download Full-text