The Impact of Data Preprocessing on the Performance of a Naive Bayes Classifier

2021 ◽

Vol 12 (03) ◽

pp. 15-24

Author(s):

Swetha Sree Cheeti ◽

Yanyan Li ◽

Ahmad Hadaegh

Keyword(s):

Natural Language ◽

Sentiment Analysis ◽

Education System ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

The World ◽

The Impact

Education system has been gravely affected due to widespread of Covid-19 across the globe. In this paper we present a thorough sentiment analysis of tweets related to education available on twitter platform and deduce conclusions about its impact on people’s emotions as the pandemic advanced over the months. Through twitter over ninety thousand tweets have been gathered related to the circumstances involving the change in education system over the world. Using Natural language tool kit (NLTK) functionalities and Naive Bayes Classifier a sentiment analysis has been performed on the gathered dataset. Based on the results of this analysis we infer to exhibit the impact of covid-19 on education and how people’s sentiment altered due to the changes with regard to the education system. Thus, we would like to present a better understanding of people’s sentiment on education while trying to cope with the pandemic in such unprecedented times.

Download Full-text

IMPLEMENTASI LEXICON BASED DAN NAIVE BAYES PADA ANALISIS SENTIMEN PENGGUNA TWITTER TOPIK PEMILIHAN PRESIDEN 2019

Jurnal Ilmiah Informatika Komputer ◽

10.35760/ik.2019.v24i2.2369 ◽

2019 ◽

Vol 24 (2) ◽

pp. 140-153

Author(s):

Gusti Nur Aulia ◽

Eka Patriya

Keyword(s):

Naive Bayes ◽

Confusion Matrix ◽

Web Server ◽

Data Preprocessing ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Data Filtering ◽

Naïve Bayes Classifier ◽

Twitter Data

Pilpres saat ini cukup menyita perhatian, karena berbagai rumor yang beredar. Masyarakat juga menjadi sasaran elit politik, dimana suara mereka merupakan penentu keberlangsungan arah politik untuk lima tahun kedepan. Opini-opini positif, netral maupun negatif dapat menimbulkan ancaman munculnya berita bohong (hoax). Salah satu sarana yang digunakan masyarakat dalam mengekspresikan pilihan politiknya adalah melalui media sosial salah satunya twitter. Data seperti opini publik dapat diolah menjadi sebuah informasi yang bermanfaat, salah satunya melalui analisis sentimen. Pada penelitian ini, akan dilakukan analisis sentimen pada Twitter tentang pemilihan presiden 2019. Tahapan analisis sentimen pada penelitian ini terdiri dari akuisisi data, pre-processing, klasifikasi data, evaluasi data dan visualisasi data. Preprocessing dilakukan dengan case folding, normalisasi data, filtering, ubah kata baku, stopword dan stemming. Penelitian ini melakukan 2 metode yaitu dengan metode Lexicon Based dan Naïve Bayes Classifier. Hasil akhir dari analisis kemudian dihitung nilai akurasi menggunakan confusion matrix dan di visualisasikan menggunakan web server. Penentuan sentimen prediksi dilakukan menggunakan metode Lexicon Based dan Labelisasi dengan perhitungan secara manual. Data latih dan data uji akan digunakan dalam proses pelatihan dan pengujian menggunakan Naive Bayes Classiﬁer. Hasil klasiﬁkasi yang dilakukan oleh metode Naive Bayes Classiﬁer disebut sentimen aktual. Perhitungan tingkat keakurasian antara sentimen prediksi terhadap sentimen aktual menggunakan pengujian confusion matrix. Hasil yang didapatkan adalah tingkat akurasi antara sentimen prediksi dan sentimen aktual dengan Lexicon Based sebesar 64,49% pada data uji dan pada data latih sebanyak 94,2% serta dengan menggunakan Labelisasi dan Naive Bayes Classiﬁer sebesar 86,53% pada data uji dan data latih sebesar 94,08%. Hasil penelitian ini diharapkan dapat membantu melakukan riset atas opini masyarakat pada Twitter mengenai Pilpres 2019 yang mengandung sentimen positif, negatif atau netral.

Download Full-text

CNB-MRF: Adapting Correlative Naive Bayes Classifier and MapReduce Framework for Big Data Classification

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v11i11.10116 ◽

2016 ◽

Vol 11 (11) ◽

pp. 1007 ◽

Cited By ~ 3

Author(s):

Chitrakant Banchhor ◽

N. Srinivasu

Keyword(s):

Big Data ◽

Naive Bayes ◽

Data Classification ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Mapreduce Framework ◽

Big Data Classification

Download Full-text

An Approach for the Segmentation of Satellite Images Using Moving KFCM and Naive Bayes Classifier

i-manager’s Journal on Electronics Engineering ◽

10.26634/jele.3.2.2117 ◽

2013 ◽

Vol 3 (2) ◽

pp. 7-15 ◽

Cited By ~ 1

Author(s):

S. Praveena ◽

◽

S.P. Singh ◽

I.V. Muralikrishna ◽

◽

...

Keyword(s):

Satellite Images ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

Behavior recognition in rehabilitation training based on modified naive Bayes classifier

Journal of Computer Applications ◽

10.3724/sp.j.1087.2013.03187 ◽

2013 ◽

Vol 33 (11) ◽

pp. 3187-3189

Author(s):

Yi ZHANG ◽

Cong HUANG ◽

Yuan LUO

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Behavior Recognition ◽

Rehabilitation Training

Download Full-text

Retrieval Information Using Generalized Vector Space Models And Sentiment Analysis Using Naïve Bayes Classifier For Evaluation Of Lecturers By Students

2020 Fifth International Conference on Informatics and Computing (ICIC) ◽

10.1109/icic50835.2020.9288584 ◽

2020 ◽

Author(s):

Suprianto ◽

Muhammad Fadlan ◽

Muhammad ◽

Yusni Amaliah ◽

Mussallimah

Keyword(s):

Vector Space ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Vector Space Models

Download Full-text

Design of agricultural ontology based on levy flight distributed optimization and Naïve Bayes classifier

Sadhana ◽

10.1007/s12046-021-01652-x ◽

2021 ◽

Vol 46 (3) ◽

Author(s):

Deepa Rajendran ◽

S Vigneshwari

Keyword(s):

Naive Bayes ◽

Distributed Optimization ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Lévy Flight ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Levy Flight

Download Full-text

Determination of near-fault impulsive signals with multivariate naïve Bayes method

Natural Hazards ◽

10.1007/s11069-021-04755-0 ◽

2021 ◽

Author(s):

Deniz Ertuncay ◽

Giovanni Costa

Keyword(s):

Naive Bayes ◽

Strong Motion ◽

Naïve Bayes ◽

Strike Slip ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Earthquake Physics ◽

Near Fault ◽

A Site

AbstractNear-fault ground motions may contain impulse behavior on velocity records. To calculate the probability of occurrence of the impulsive signals, a large dataset is collected from various national data providers and strong motion databases. The dataset has a large number of parameters which carry information on the earthquake physics, ruptured faults, ground motion parameters, distance between the station and several parts of the ruptured fault. Relation between the parameters and impulsive signals is calculated. It is found that fault type, moment magnitude, distance and azimuth between a site of interest and the surface projection of the ruptured fault are correlated with the impulsiveness of the signals. Separate models are created for strike-slip faults and non-strike-slip faults by using multivariate naïve Bayes classifier method. Naïve Bayes classifier allows us to have the probability of observing impulsive signals. The models have comparable accuracy rates, and they are more consistent on different fault types with respect to previous studies.

Download Full-text

Performance of SMOTE in a random forest and naive Bayes classifier for imbalanced Hepatitis-B vaccination status

Journal of Physics Conference Series ◽

10.1088/1742-6596/1863/1/012073 ◽

2021 ◽

Vol 1863 (1) ◽

pp. 012073

Author(s):

V M Putri ◽

M Masjkur ◽

C Suhaeni

Keyword(s):

Random Forest ◽

Hepatitis B ◽

Naive Bayes ◽

Naïve Bayes ◽

Vaccination Status ◽

Hepatitis B Vaccination ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

Efficient Jamming Identification in Wireless Communication: Using Small Sample Data Driven Naive Bayes Classifier

IEEE Wireless Communications Letters ◽

10.1109/lwc.2021.3064843 ◽

2021 ◽

pp. 1-1

Author(s):

Yuxin Shi ◽

Xinjin Lu ◽

Yingtao Niu ◽

Yusheng Li.

Keyword(s):

Wireless Communication ◽

Naive Bayes ◽

Naïve Bayes ◽

Small Sample ◽

Data Driven ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Sample Data

Download Full-text

The Impact of Data Preprocessing on the Performance of a Naive Bayes Classifier

Twitter based Sentiment Analysis of Impact of Covid-19 on Education Globaly

IMPLEMENTASI LEXICON BASED DAN NAIVE BAYES PADA ANALISIS SENTIMEN PENGGUNA TWITTER TOPIK PEMILIHAN PRESIDEN 2019

CNB-MRF: Adapting Correlative Naive Bayes Classifier and MapReduce Framework for Big Data Classification

An Approach for the Segmentation of Satellite Images Using Moving KFCM and Naive Bayes Classifier

Behavior recognition in rehabilitation training based on modified naive Bayes classifier

Retrieval Information Using Generalized Vector Space Models And Sentiment Analysis Using Naïve Bayes Classifier For Evaluation Of Lecturers By Students

Design of agricultural ontology based on levy flight distributed optimization and Naïve Bayes classifier

Determination of near-fault impulsive signals with multivariate naïve Bayes method

Performance of SMOTE in a random forest and naive Bayes classifier for imbalanced Hepatitis-B vaccination status

Efficient Jamming Identification in Wireless Communication: Using Small Sample Data Driven Naive Bayes Classifier

Export Citation Format