Sentimen Analisis Publik Terhadap Joko Widodo terhadap wabah Covid-19 menggunakan Metode Machine Learning

Abstract Analyzing public sentiment towards a government policy is no longer impossible, the process of analyzing with data mining is a method that is often used. The Data Mining method is always related to the dataset, with the keywords "Jokowi" and "Covid" twitter allowing us to make tweets in it to be used as a dataset. In data mining for sentiment analysis, techniques such as transform, tokenize, stemming, classification, etc. are very influential on its accuracy. Gata Framework is used for preprocessing, and Rapidminer is also used to analyze and compare three classification methods namely Naive Bayes, Support Vector Machine, and k-NN. And the best value is obtained, the Support Vector Machine with an accuracy of 84.58%, precision 82.14% and recall 85.82%. Keywords: Covid, Jokowi, SVM, K-NN, Naive Bayes Abstrak Menganalisa sentimen publik terhadap suatu kebijakan pemerintah merupakan cara yang tidak lagi mustahil, proses analisa dengan data mining merupakan metode yang sering digunakan. Metode Data Mining selalu berkaitan dengan dataset, dengan kata kunci “Jokowi” dan “Covid” twitter memungkinkan kita menjadikan tweet didalamnya untuk dijadikan dataset. Dalam data mining untuk sentimen analisis, dilakukan teknik seperti transform, tokenize, stemming, classification, dan lain-lain sangat berpengaruh pada akurasinya. Gata Framework digunakan untuk preprocessing, dan Rapidminer juga digunakan untuk menganalisa dan membandingkan tiga metode klasifikasi yaitu Naive Bayes, Support Vector Machine, dan k-NN. Dan dihasilkan nilai terbaik yaitu Support Vector Machine dengan accuracy 84.58%, precision 82.14% dan recall 85.82%. Kata kunci: Covid, Jokowi, SVM, K-NN, Naive Bayes

Download Full-text

KLASIFIKASI SMS SPAM MENGGUNAKAN SUPPORT VECTOR MACHINE

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i2.693 ◽

2019 ◽

Vol 15 (2) ◽

pp. 275-280

Author(s):

Agus Setiyono ◽

Hilman F Pardede

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Spam Detection ◽

Support Vector Machine Algorithm ◽

Data Mining Techniques ◽

To Receive

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam. One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.

Download Full-text

Perbandingan Teknik Klasifikasi Neural Network, Support Vector Machine, dan Naive Bayes dalam Mendeteksi Kanker Payudara

BINA INSANI ICT JOURNAL ◽

10.51211/biict.v7i1.1343 ◽

2020 ◽

Vol 7 (1) ◽

pp. 53

Author(s):

Derisma Derisma ◽

Fajri Febrian

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Data Mining ◽

Support Vector Machine ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Accuracy Rate ◽

Cancer Disease ◽

Network Support

Abstrak: Kanker payudara merupakan jenis kanker yang sering ditemukan oleh kebanyakan wanita. Di Indonesia Kanker payudara menempati urutan pertama pada pasien rawat inap di seluruh rumah sakit. Tujuan dari penelitian ini adalah melakukan diagnosis penyakit kanker payudara berbasis komputasi yang dapat menghasilkan bagaimana kondisi kanker seseorang berdasarkan akurasi algoritma. Penelitian ini menggunakan pemrograman orange python dan dataset Wisconsin Breast Cancer untuk pemodelan klasifikasi kanker payudara. Metode data mining yang diterapkan yaitu Neural Network, Support Vector Machine, dan Naive Bayes. Dalam penelitian ini didapat algoritma klasifikasi terbaik yaitu algoritma Kernel SVM dengan tingkat akurasi sebesar 98.9 % dan algoritma terendah yaitu Naive Bayes senilai 96.1 %. Kata kunci: kanker payudara, neural network, support vector machine, naive bayes Abstract: Breast cancer is a type of cancer that mostly found in many women. In Indonesia, breast cancer ranks first in hospitalized patients at every hospital. This study aimed to conduct a computation-based diagnose of breast cancer disease that could produce the state of cancer of an individual based on the accuracy of algorithm. This study used python orange programming and Wisconsin Breast Cancer dataset for a modeling and application of breast cancer classification. The data mining methods that were applied in this study were Neural Network, Support Vector Machine, dan Naive Bayes. In this study, Kernel SVM’s algorithm was the best classification algorithm of breast cancer disease with 98.9 % accuracy rate and Naïve Beyes was the lowest with 96.1 % of accuracy rate. Keywords: breast cancer, neural network, support vector machine, naive bayes

Download Full-text

Analisis Sentimen Twitter terhadap Tokoh Publik dengan Algoritma Naive Bayes dan Support Vector Machine

Simetris Jurnal Teknik Mesin Elektro dan Ilmu Komputer ◽

10.24176/simet.v11i2.4568 ◽

2021 ◽

Vol 11 (2) ◽

pp. 626-636

Author(s):

Tanthy Tawaqalia Widowati ◽

Mujiono Sadikin

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Cross Validation ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Fold Cross Validation

Salah satu media sosial yang berkembang adalah Twitter. Media sosial Twitter mempermudah masyarakat untuk bebas berpendapat melalui cuitan atau biasa disebut dengan tweets. Netizen dengan bebas menyampaikan opini pribadinya untuk topik apapun, termasuk persepsi terhadap tokoh publik. Artikel ini menyajikan hasil penelitian dan analisis sentimen masyarakat (netizen) terhadap tokoh publik, Nadiem Makariem sebagai Menteri Kementerian Pendidikan dan Kebudayaan baru. Penelitian ini menggunakan teknik data mining yang bertujuan untuk membandingkan hasil klasifikasi dari opini masyarakat yang dituliskan di Twitter. Dataset yang digunakan berasal dari tweets dengan kata kunci ”nadiem makariem”, ”kemendikbud” dan ”pak nadiem”. Tools RapidMiner digunakan untuk membantu tahap pre-processing dan klasifikasi menggunakan dua metode yaitu, Naive Bayes dan Support Vector Machine dengan evaluasi k-fold cross-validation. Dari hasil ujicoba diketahui bahwa untuk kasus yang diteliti, metode Naive Bayes menghasilkan kinerja yang lebih baik dengan accuracy 91.48%, precision 89.28% dan recall 91.58%.

Download Full-text

SENTIMENT ANALYSIS OF THE COMMUNITY IN THE TWITTER TO THE 2020 ELECTION IN PANDEMIC COVID-19 BY METHOD NAIVE BAYES CLASSIFIER

Jurnal Teknik Informatika (Jutif) ◽

10.20884/1.jutif.2021.2.2.51 ◽

2021 ◽

Vol 2 (2) ◽

pp. 101-107

Author(s):

Akhmad Muzaki ◽

Arita Witanti

Keyword(s):

Support Vector Machine ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Local Elections ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Public Sentiment

The 2020 regional elections in the midst of the COVID-19 pandemic are starting to get crowded starting from the real world and in cyberspace, especially on Twitter social media. Twitter's existence has been widely used by various communities in recent years. Twitter is one of the media that represents the public response regarding public issu. Ahead of the general election (PEMILU), there are usually some parties who want to know the results of public sentiment or response to the issue, namely academics, intellectuals or even political opponents. Nevertheless, the implementation of local elections is very polemic in the community, therefore this study tries to analyze tweets that talk about issue public, namely the 2020 elections in the wake of the COVID-19 Pandemic. The analysis usually uses the classification of tweets containing public sentiment about the issue. The classification method used in this research is Naive Bayes Classifier (NBC) And Support Vector Machine (SVM). Naive Bayes Classifier is combined with features that can detect weighting using probability. The classification of tweets in this study was obtained based on a combination of two classes namely sentiment class and category class. The classification of sentiment consists of positive and negative. Test results on built-in applications show that accuracy with Naive Bayes delivers better results than Support Vector Machine. However, overall the use of the Naive Bayes method has a good performance to classify tweets with an accuracy rate of 92.2%

Download Full-text

Pemodelan Prediksi Status Keberlanjutan Polis Asuransi Kendaraan dengan Teknik Pemilihan Mayoritas Menggunakan Algoritma-Algoritma Klasifikasi Data Mining

Prosiding Seminar Nasional Teknoka ◽

10.22236/teknoka.v5i.391 ◽

2020 ◽

Vol 5 ◽

pp. 19-24

Author(s):

Dyah Retno Utari ◽

Arief Wibowo

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Confusion Matrix ◽

Naïve Bayes ◽

Majority Voting ◽

Support Vector ◽

F Measure

Asuransi kendaraan bermotor merupakan jenis usaha pertanggungan terhadap kerugian atau risiko kerusakan yang dapat timbul dari berbagai macam potensi kejadian yang menimpa kendaraan. Persaingan dalam bisnis asuransi khususnya untuk kendaraan bermotor menuntut inovasi dan strategi agar keberlangsungan bisnis tetap terjamin. Salah satu upaya yang dapat dilakukan perusahaan adalah memprediksi status keberlanjutan polis asuransi kendaraan dengan menganalisis data-data profil dan transaksi nasabah. Prediksi terhadap keputusan pemegang polis menjadi sangat penting bagi perusahaan, karena dapat menentukan strategi pemasaran yang mempengaruhi keputusan pelanggan untuk pembaharuan polis asuransi. Penelitian ini telah mengusulkan suatu model prediksi status keberlanjutan polis asuransi kendaraan dengan teknik pemilihan mayoritas dari hasil klasifikasi menggunakan algoritma- algoritma data mining seperti Naive Bayes, Support Vector Machine dan Decision Tree. Hasil pengujian menggunakan confusion matrix menunjukkan nilai akurasi terbaik diperoleh sebesar 93,57%, apapun untuk nilai precision mencapai 97,20%, dan nilai recall sebesar 95,20% serta nilai F-Measure sebesar 95,30%. Nilai evaluasi model terbaik dihasilkan menggunakan pendekatan pemilihan mayoritas (majority voting), mengungguli kinerja model prediksi berbasis pengklasifikasi tunggal.

Download Full-text

Perbandingan Metode Naïve Bayes Dan Support Vector Machine Dalam Klasifikasi Penyakit Diabetes Melitus

Journal of Information Technology Ampera ◽

10.51519/journalita.volume1.isssue3.year2020.page133-143 ◽

2020 ◽

Vol 1 (3) ◽

pp. 133-143

Author(s):

Hilda Apriyani ◽

Kurniati Kurniati

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Cross Validation ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Diabetes Melitus

Diabetes melitus merupakan penyakit kronis yang terjadi akibat kadar glukosa didalam darah yang terlalu tinggi sehingga tidak adanya insulin. Dalam kurun waktu data di Rumah Sakit Islam Siti Khadijah Palembang yang dipengaruhi oleh jumlah dari pasien yang melakukan pemeriksaan kesehatan seperti penyakit diabetes melitus sehingga berpengaruh dalam hal klasifikasi data yang akan menyulitkan pihak rumah sakit. Maka dengan memanfaatkan data mining, pengklasifikasian untuk menentukan pasien yang telah melakukan pemeriksaan termasuk penderita penyakit diabetes atau tidak. Dengan adanya permasalahan tersebut maka penulis melakukan analisis perbandingan dari dua algoritma yaitu algoritma naïve bayes dan algoritma support vector machine untuk klasifikasi penyakit diabetes dengan menggunakan alat bantu WEKA dengan tools options Cross Validation dan Confussion Matrix dengan hasil akurasi tertinggi yaitu algoritma support vector machine dengan kernel polynomial yang hasilnya 96.2704% dan tingkat error sebanyak 3.7296% dapat disimpulkan algoritma yang akurat dalam klasifikasi penyakit diabetes yaitu algoritma support vector machine dengan kernel polynomial.

Download Full-text

Perbandingan Teknik Klasifikasi Dalam Data Mining Untuk Bank Direct Marketing

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.201855958 ◽

2018 ◽

Vol 5 (5) ◽

pp. 567 ◽

Cited By ~ 2

Author(s):

Irvi Oktanisa ◽

Ahmad Afif Supianto

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Random Forest ◽

Gradient Descent ◽

Naive Bayes ◽

Direct Marketing ◽

Naïve Bayes ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Support Vector

Klasifikasi merupakan teknik dalam data mining untuk mengelompokkan data berdasarkan keterikatan data terhadap data sampel. Pada penelitian ini, kami melakukan perbandingan 9 teknik klasifikasi untuk mengklasifikasi respon pelanggan pada dataset Bank Direct Marketing. Perbandingan teknik klasifikasi ini dilakukan untuk mengetahui model dalam teknik klasfikasi yang paling efektif untuk mengklasifikasi target pada dataset Bank Direct Marketing. Teknik klasifikasi yang digunakan yaitu Support Vector Machine, AdaBoost, Naïve Bayes, Constant, KNN, Tree, Random Forest, Stochastic Gradient Descent, dan CN2 Rule. Proses klasifikasi diawali dengan preprocessing data untuk melakukan penghilangan missing value dan pemilihan fitur pada dataset. Pada tahap evaluasi digunakan teknik 10 fold cross validation. Setelah dilakukan pengujian, didapatkan bahwa hasil klasifikasi menunjukkan akurasi terbaik diperoleh oleh model Tree, Constant, Naive Bayes, dan Stochastic Gardient Descent. Kemudian diikuti oleh model Random Forest, K-Nearest Neighbor, CN-2 Rule, AdaBoost dan Support Vector Machine. Dari keempat model yang menunjukkan hasil akurasi terbaik, untuk kasus ini Stochastic Gradient Descent terpilih sebagai model yang memiliki akurasi terbaik dengan nilai akurasi sebesar 0,972 dan hasil visualisasi yang dihasilkan lebih jelas untuk mengklasifikasi target pada dataset Bank Direct Marketing. AbstractClassification is a technique in data mining to classify data based on the attachment of data to the sample data.. In this paper, we present the comparison of 9 classification techniques performed to classify customer response on the dataset of Bank Direct Marketing. The techniques performed to find out the effectiveness model in the classification technique used to classify targets on the dataset of Bank Direct Marketing. The techniques used are Support Vector Machine, AdaBoost, Naïve Bayes, Constant, KNN, Tree, Random Forest, Stochastic Gradient Descent, and CN2 Rule. The classification process begins with preprocessing data to perform missing value omissions and feature selection on the dataset. Cross validation technique, with k value is 10, used in the evaluation stage. After testing, it was found that the classification results showed the best accuracy obtained when using the Tree model, Constant, Naive Bayes and Stochastic Gradient Descent. Afterwards the Random Forest model, K-Nearest Neighbor, CN-2 Rule, AdaBoost, and Support Vector Machine are followed. Of the four models with the high accuracy results, in this case Stochastic Gradient Descent was selected as the best accuracy model with an accuracy value of 0.972 and resulting visualization more clearly to classify targets on the dataset of Bank Direct Marketing.

Download Full-text

Komparasi Algoritma Naïve Bayes dan Support Vectors Machine pada Analisis Sentimen SMS HAM dan SPAM

Infotek : Jurnal Informatika dan Teknologi ◽

10.29408/jit.v4i2.3665 ◽

2021 ◽

Vol 4 (2) ◽

pp. 249-258

Author(s):

Lila Dini Utami ◽

◽

Lestari Yusuf ◽

Dini Nurlaela ◽

◽

...

Keyword(s):

Support Vector Machine ◽

Mobile Phones ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Classification Methods ◽

Phone Number ◽

Support Vectors ◽

Official Information

SMS is a form of communication in the form of messages sent using mobile phones between the designated numbers. SMS is now rarely used because many of the features that have changed are used by chat applications. However, the SMS feature was not removed for one thing, official messages from various applications for leveraging or other official information still use SMS as a sign that the phone number used is there. However, since 2011 there have been so many misuses of this function, so it is suspected that many frauds use SMS as a tool to influence victims. This sms category goes to SMS spam. Therefore, SMS needs to be classified so that users can find out that the SMS is included in the category of Spam or ham (the opposite of spam). Using 400 datasets taken from the UCI repository which is divided into two classes, namely spam and ham, we compare two classification methods, namely Naive Bayes and Support vector Machine in order to get SMS filtering correctly. And after the calculations are done, the accuracy is obtained in Naive Bayes, which is 90.00% Support Vector Machine 81.00%.

Download Full-text

A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification

Jurnal Informatika Jurnal Pengembangan IT ◽

10.30591/jpit.v3i2.828 ◽

2018 ◽

Vol 3 (2) ◽

pp. 157-160 ◽

Cited By ~ 2

Author(s):

Yohan Muliono ◽

Fidelson Tanzil

Keyword(s):

Support Vector Machine ◽

Text Classification ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Classification Methods

Download Full-text

Pemanfaatan Teknik Data Mining Dalam Menentukan Standar Mutu Jagung

CogITo Smart Journal ◽

10.31154/cogito.v5i2.172.171-180 ◽

2019 ◽

Vol 5 (2) ◽

pp. 171

Author(s):

Alfa Saleh ◽

Meilinda Maryam

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector

Sebagai salah satu perusahaan pengolahan pakan teknak berbahan dasar jagung, PT. Indojaya Agrinusa berusaha untuk menjaga kualitas produk-produk terbaik yang akan dipasarkan. tentu hal tersebut tidak terlepas dari usaha untuk meningkatkan mutu ataupun kualitas yang menjadi tolak ukur pelanggan dalam memilih barang atau produk mana yang akan digunakan untuk keperluan masing-masing. Belum adanya penentuan standar mutu jagung yang tersistem sehingga perusahaan lambat dalam menentukan standar mutu jagung serta cukup menghambat proses produksi pakan ternak pada perusahaan tersebut. Pada Penelitian terdahulu, telah dilakukan pengujian menggunakan teknik data mining khususnya metode Naive Bayes dalam menentukan standar mutu jagung, di mana penerapan metode tersebut menghasilkan tingkat akurasi rata-rata sebesar 83.33 %. sementara pada penelitian ini, pengujian dilakukan dengan menggunakan metode Decision Tree (C 4.5) dan Support Vector Machine (SVM) untuk menemukan metode yang lebih efektif dan akurat dalam menentukan mutu jagung sehingga layak untuk diproduksi. Hasil akurasi yang didapatkan oleh metode Decision Tree (C 4.5) meningkat dari hasil pengujian sebelumnya menjadi 86.17%, sementara hasil akurasi yang didapatkan dari penerapan metode Support Vector Machine (SVM) juga meningkat dari hasi pengujian sebelumnya yang menggunakan metode Naïve Bayes menjadi 94.03%. Sehingga dapat disimpulkan bahwa penggunaan metode Support Vector Machine (SVM) lebih baik dibandingkan metode Naïve Bayes dan Decision Tree (C 4.5). Kata kunci—Mutu Jagung, Data Mining, Naïve Bayes, Decision Tree, Support Vector Machine

Download Full-text