scholarly journals Komparasi Algoritma Naïve Bayes dan Support Vectors Machine pada Analisis Sentimen SMS HAM dan SPAM

2021 ◽  
Vol 4 (2) ◽  
pp. 249-258
Author(s):  
Lila Dini Utami ◽  
◽  
Lestari Yusuf ◽  
Dini Nurlaela ◽  
◽  
...  

SMS is a form of communication in the form of messages sent using mobile phones between the designated numbers. SMS is now rarely used because many of the features that have changed are used by chat applications. However, the SMS feature was not removed for one thing, official messages from various applications for leveraging or other official information still use SMS as a sign that the phone number used is there. However, since 2011 there have been so many misuses of this function, so it is suspected that many frauds use SMS as a tool to influence victims. This sms category goes to SMS spam. Therefore, SMS needs to be classified so that users can find out that the SMS is included in the category of Spam or ham (the opposite of spam). Using 400 datasets taken from the UCI repository which is divided into two classes, namely spam and ham, we compare two classification methods, namely Naive Bayes and Support vector Machine in order to get SMS filtering correctly. And after the calculations are done, the accuracy is obtained in Naive Bayes, which is 90.00% Support Vector Machine 81.00%.

2020 ◽  
Vol 20 (2) ◽  
pp. 167-176
Author(s):  
Sisferi Hikmawan ◽  
Amsal Pardamean ◽  
Siti Nur Khasanah

Abstract   Analyzing public sentiment towards a government policy is no longer impossible, the process of analyzing with data mining is a method that is often used. The Data Mining method is always related to the dataset, with the keywords "Jokowi" and "Covid" twitter allowing us to make tweets in it to be used as a dataset. In data mining for sentiment analysis, techniques such as transform, tokenize, stemming, classification, etc. are very influential on its accuracy. Gata Framework is used for preprocessing, and Rapidminer is also used to analyze and compare three classification methods namely Naive Bayes, Support Vector Machine, and k-NN. And the best value is obtained, the Support Vector Machine with an accuracy of 84.58%, precision 82.14% and recall 85.82%.   Keywords: Covid, Jokowi, SVM, K-NN, Naive Bayes   Abstrak   Menganalisa sentimen publik terhadap suatu kebijakan pemerintah merupakan cara yang tidak lagi mustahil, proses analisa dengan data mining merupakan metode yang sering digunakan. Metode Data Mining selalu berkaitan dengan dataset, dengan kata kunci “Jokowi” dan “Covid” twitter memungkinkan kita menjadikan tweet didalamnya untuk dijadikan dataset. Dalam data mining untuk sentimen analisis, dilakukan teknik seperti transform, tokenize, stemming, classification, dan lain-lain sangat berpengaruh pada akurasinya. Gata Framework digunakan untuk preprocessing, dan Rapidminer juga digunakan untuk menganalisa dan membandingkan tiga metode klasifikasi yaitu Naive Bayes, Support Vector Machine, dan k-NN. Dan dihasilkan nilai terbaik yaitu Support Vector Machine dengan accuracy 84.58%, precision 82.14% dan recall 85.82%.      Kata kunci: Covid, Jokowi, SVM, K-NN, Naive Bayes  


2019 ◽  
Vol 15 (2) ◽  
pp. 275-280
Author(s):  
Agus Setiyono ◽  
Hilman F Pardede

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam.  One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.


2018 ◽  
Vol 4 (10) ◽  
pp. 6
Author(s):  
Shivangi Bhargava ◽  
Dr. Shivnath Ghosh

News popularity is the maximum growth of attention given for particular news article. The popularity of online news depends on various factors such as the number of social media, the number of visitor comments, the number of Likes, etc. It is therefore necessary to build an automatic decision support system to predict the popularity of the news as it will help in business intelligence too. The work presented in this study aims to find the best model to predict the popularity of online news using machine learning methods. In this work, the result analysis is performed by applying Co-relation algorithm, particle swarm optimization and principal component analysis. For performance evaluation support vector machine, naïve bayes, k-nearest neighbor and neural network classifiers are used to classify the popular and unpopular data. From the experimental results, it is observed that support vector machine and naïve bayes outperforms better with co-relation algorithm as well as k-NN and neural network outperforms better with particle swarm optimization.


2020 ◽  
Vol 4 (2) ◽  
pp. 362-369
Author(s):  
Sharazita Dyah Anggita ◽  
Ikmah

The needs of the community for freight forwarding are now starting to increase with the marketplace. User opinion about freight forwarding services is currently carried out by the public through many things one of them is social media Twitter. By sentiment analysis, the tendency of an opinion will be able to be seen whether it has a positive or negative tendency. The methods that can be applied to sentiment analysis are the Naive Bayes Algorithm and Support Vector Machine (SVM). This research will implement the two algorithms that are optimized using the PSO algorithms in sentiment analysis. Testing will be done by setting parameters on the PSO in each classifier algorithm. The results of the research that have been done can produce an increase in the accreditation of 15.11% on the optimization of the PSO-based Naive Bayes algorithm. Improved accuracy on the PSO-based SVM algorithm worth 1.74% in the sigmoid kernel.


2020 ◽  
Vol 4 (3) ◽  
pp. 504-512
Author(s):  
Faried Zamachsari ◽  
Gabriel Vangeran Saragih ◽  
Susafa'ati ◽  
Windu Gata

The decision to move Indonesia's capital city to East Kalimantan received mixed responses on social media. When the poverty rate is still high and the country's finances are difficult to be a factor in disapproval of the relocation of the national capital. Twitter as one of the popular social media, is used by the public to express these opinions. How is the tendency of community responses related to the move of the National Capital and how to do public opinion sentiment analysis related to the move of the National Capital with Feature Selection Naive Bayes Algorithm and Support Vector Machine to get the highest accuracy value is the goal in this study. Sentiment analysis data will take from public opinion using Indonesian from Twitter social media tweets in a crawling manner. Search words used are #IbuKotaBaru and #PindahIbuKota. The stages of the research consisted of collecting data through social media Twitter, polarity, preprocessing consisting of the process of transform case, cleansing, tokenizing, filtering and stemming. The use of feature selection to increase the accuracy value will then enter the ratio that has been determined to be used by data testing and training. The next step is the comparison between the Support Vector Machine and Naive Bayes methods to determine which method is more accurate. In the data period above it was found 24.26% positive sentiment 75.74% negative sentiment related to the move of a new capital city. Accuracy results using Rapid Miner software, the best accuracy value of Naive Bayes with Feature Selection is at a ratio of 9:1 with an accuracy of 88.24% while the best accuracy results Support Vector Machine with Feature Selection is at a ratio of 5:5 with an accuracy of 78.77%.


This research work is based on the diabetes prediction analysis. The prediction analysis technique has the three steps which are dataset input, feature extraction and classification. In this previous system, the Support Vector Machine and naïve bayes are applied for the diabetes prediction. In this research work, voting based method is applied for the diabetes prediction. The voting based method is the ensemble based which is applied for the diabetes prediction method. In the voting method, three classifiers are applied which are Support Vector Machine, naïve bayes and decision tree classifier. The existing and proposed methods are implemented in python and results in terms of accuracy, precision-recall and execution time. It is analyzed that voting based method give high performance as compared to other classifiers.


Sign in / Sign up

Export Citation Format

Share Document