scholarly journals Peringkasan dan Support Vector Machine pada Klasifikasi Dokumen

2017 ◽  
Vol 9 (4) ◽  
pp. 416 ◽  
Author(s):  
Nelly Indriani Widiastuti ◽  
Ednawati Rainarli ◽  
Kania Evita Dewi

Classification is the process of grouping objects that have the same features or characteristics into several classes. The automatic documents classification use words frequency that appears on training data as features. The large number of documents cause the number of words that appears as a feature will increase. Therefore, summaries are chosen to reduce the number of words that used in classification. The classification uses multiclass Support Vector Machine (SVM) method. SVM was considered to have a good reputation in the classification. This research tests the effect of summary as selection features into documents classification. The summaries reduce text into 50%. A result obtained that the summaries did not affect value accuracy of classification of documents that use SVM. But, summaries improve the accuracy of Simple Logistic Classifier. The classification testing shows that the accuracy of Naïve Bayes Multinomial (NBM) better than SVM

2021 ◽  
Vol 2 (2) ◽  
pp. 101-107
Author(s):  
Akhmad Muzaki ◽  
Arita Witanti

The 2020 regional elections in the midst of the COVID-19 pandemic are starting to get crowded starting from the real world and in cyberspace, especially on Twitter social media. Twitter's existence has been widely used by various communities in recent years. Twitter is one of the media that represents the public response regarding public issu. Ahead of the general election (PEMILU), there are usually some parties who want to know the results of public sentiment or response to the issue, namely academics, intellectuals or even political opponents. Nevertheless, the implementation of local elections is very polemic in the community, therefore this study tries to analyze tweets that talk about issue public, namely the 2020 elections in the wake of the COVID-19 Pandemic. The analysis usually uses the classification of tweets containing public sentiment about the issue. The classification method used in this research is Naive Bayes Classifier (NBC) And Support Vector Machine (SVM). Naive Bayes Classifier is combined with features that can detect weighting using probability. The classification of tweets in this study was obtained based on a combination of two classes namely sentiment class and category class. The classification of sentiment consists of positive and negative. Test results on built-in applications show that accuracy with Naive Bayes delivers better results than Support Vector Machine. However, overall the use of the Naive Bayes method has a good performance to classify tweets with an accuracy rate of 92.2%


2020 ◽  
Vol 5 (2) ◽  
pp. 211-220 ◽  
Author(s):  
Hermanto Hermanto ◽  
Ali Mustopa ◽  
Antonius Yadi Kuntoro

Service in the world of education is an important element for the creation of an academic atmosphere that is conducive to the implementation of a successful teaching and learning process. The process of service to students there is a tendency to be implemented not following the minimum service standards that must be provided to students so that students tend to complain about the services provided. Submission of criticism, complaints, input, or suggestions for dissatisfaction and problems that exist in the university environment is still very limited. Complaints can be constructive if submitted to the right place and party. In this research the data processing of email complaints from students conducted at the academic student body (students.bsi.ac.id). Student complaint data that will be processed is data in the form of * .xls complaint file. Before text data is analyzed using text mining methods, the pre-processing text needs to be done including tokenizing, case folding, stopwords, and stemming. After pre-processing, the classification method is then performed in classifying each complaint category and dividing the status into two parts, namely complaint and not complaint so that the status becomes a normal condition in text mining research. The purpose of this study is to obtain the most accurate algorithm in the classification of student complaints and can find out the results of the classification of the Naïve Bayes algorithm method and Support vector Machine used and compared. In this study, the results of testing by measuring the performance of these two algorithms using Cross-Validation, Confusion Matrix, and ROC Curves. The obtained Support vector Machine algorithm has the highest accuracy value compared to Naïve Bayes. AUC value = 0.922. for the Support vector machine method using the student academic data collection dataset (students.bsi.ac.id) has 84.45%, from the Naïve Bayes algorithm has an accuracy rate of about 69.75% and AUC value = 0.679.


Sign in / Sign up

Export Citation Format

Share Document