IMPLEMENTASI ALGORITMA MULTICLASS SVM PADA OPINI PUBLIK BERBAHASA INDONESIA DI TWITTER

Klasifikasi pada proses text mining dapat dikerjakan dengan menggunakan berbagai jenis metode klasifikasi yang salah satunya yaitu metode SVM. SVM merupakan singkatan dari Support Vector Machine, SVM bekerja dengan membagi dua kelompok kelas data menggunakan fungsi linear dalam sebuah ruang fitur berdimensi tinggi dengan proses menemukan garis pemisah (hyperplane) terbaik sehingga dapat menemukan ukuran margin yang maksimal antara ruang input dengan ruang ciri menggunakan kaidah kernel. SVM telah dikembangkan dengan menggabungkan semua data yang terdiri dari beberapa kelas kedalam sebuah bentuk optimasi untuk memecahkan permasalahan yang terdapat pada penelitian ini dengan jumlah kelas yang melebihi dari dua kelas dan akan diuji dengan berbagai jenis pendekatan multiclass yaitu SVM One Against One dan One Against Rest. Data merupakan opini publik berbahasa Indonesia yang didapatkan dari twitter berjumlah 2000 dataset mengenai jaringan telekomunikasi seluler dan layanan BPJS. Hasil penelitian ini didapatkan bahwa untuk penilaian kinerja metode multiclass SVM dengan tingkat akurasi yang lebih tinggi didapatkan dari kinerja metode SVM One Against Rest dengan nilai perbedaan sebesar 0,06 untuk proses klasifikasi tiga kelas yaitu positif, negatif dan netral. Dapat disimpulkan bahwa dalam proses klasifikasi yang memiliki lebih dari dua kelas dapat dilakukan dengan menggunakan metode klasifikasi SVM melalui pendekatan SVM One Against One dan One Against Rest dengan nilai akurasi yang lebih baik.

Download Full-text

Multiclass SVM Algorithm for Sarcasm Text in Twitter

JATISI (Jurnal Teknik Informatika dan Sistem Informasi) ◽

10.35957/jatisi.v8i1.646 ◽

2021 ◽

Vol 8 (1) ◽

pp. 118-128

Author(s):

Debby Alita

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Support Vector ◽

Svm Algorithm ◽

Multiclass Svm

Penelitian dibidang text mining sekarang ini semakin marak dilakukan karena berbagai industry dan tokoh public yang ingin mendapatkan informasi terkait pendapat publik tentang produk atau penilaian individual yang didapatkan dari media social baik pendapat yang bersifat pendapat biasa maupun sarkasme. Pada proses melakukan text mining banyak sekali metode klasifikasi yang dapat digunakan, salah satunya yaitu metode Support Vector Machine yang dapat dioptimasi sehingga bisa mengklasifikasikan data menjadi tiga kelas klasifikasi yaitu SVM One Againts One dan One Againts Rest. Data yang digunakan pada penelitian adalah sebanyak 2072 data yang berasal dari media social twitter. Hasil yang didapatkan dari penelitian ini adalah nilai akurasi yang memiliki nilai yang sama besarnya baik yang dilakukan secara acak maupun tidak acak dengan nilai sebesar 60,82% dilakukan secara acak dan 60,93% secara tidak acak. Pada nilai lainnya seperti presisi, recall dan F1score metode SVM One Againts Rest memiliki nilai yang lebih unggul dibandingkan dengan nilai SVM One Againts One.

Download Full-text

Support Vector Machine VS Information Gain: Analisis Sentimen Cyberbullying di Twitter Indonesia

Jurnal ULTIMA InfoSys ◽

10.31937/si.v11i2.1740 ◽

2020 ◽

Vol 11 (2) ◽

pp. 107-111

Author(s):

Christevan Destitus ◽

Wella Wella ◽

Suryasari Suryasari

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Text Mining ◽

Information Gain ◽

Text Processing ◽

Support Vector ◽

Term Weighting ◽

System Process ◽

Research Stage

This study aims to clarify tweets on twitter using the Support Vector Machine and Information Gain methods. The clarification itself aims to find a hyperplane that separates the negative and positive classes. In the research stage, there is a system process, namely text mining, text processing which has stages of tokenizing, filtering, stemming, and term weighting. After that, a feature selection is made by information gain which calculates the entropy value of each word. After that, clarify based on the features that have been selected and the output is in the form of identifying whether the tweet is bully or not. The results of this study found that the Support Vector Machine and Information Gain methods have sufficiently maximum results.

Download Full-text

Sentiment Classification

Advances in Linguistics and Communication Studies - Modern Computational Models of Semantic Discovery in Natural Language ◽

10.4018/978-1-4666-8690-8.ch001 ◽

2015 ◽

pp. 1-26

Author(s):

Jalel Akaichi

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Training Model ◽

Sentiment Classification ◽

The Other ◽

Support Vector ◽

Analysis Techniques ◽

Sentiment Lexicon ◽

And Behavior

In this work, we focus on the application of text mining and sentiment analysis techniques for analyzing Tunisian users' statuses updates on Facebook. We aim to extract useful information, about their sentiment and behavior, especially during the “Arabic spring” era. To achieve this task, we describe a method for sentiment analysis using Support Vector Machine and Naïve Bayes algorithms, and applying a combination of more than two features. The output of this work consists, on one hand, on the construction of a sentiment lexicon based on the Emoticons and Acronyms' lexicons that we developed based on the extracted statuses updates; and on the other hand, it consists on the realization of detailed comparative experiments between the above algorithms by creating a training model for sentiment classification.

Download Full-text

Implementation of Integrated Bayes Formula and Support Vector Machine for Analysing Airline’s Passengers Review

E3S Web of Conferences ◽

10.1051/e3sconf/202020215004 ◽

2020 ◽

Vol 202 ◽

pp. 15004

Author(s):

Aditya Tegar Satria ◽

Mustafid ◽

Dinar Mutiara Kusumo Nugraheni

Keyword(s):

Support Vector Machine ◽

Information System ◽

Text Mining ◽

Input Data ◽

Tourism Industry ◽

Support Vector ◽

Accuracy Score ◽

Bayes Formula ◽

Performance Results ◽

Mining Algorithms

Nowadays, the utilization of Internet of Things (IoT) is commonly used in the tourism industry, including aviation, where passengers of flight services can rate their satisfaction levels towards the product and service they use by writing their reviews in the form of text-based data on many popular websites. These passenger reviews are collections of potential big data and can be analyzed in order to extract meaningful informations. Some text mining algorithms are already in common use, including the Bayes formula and Support Vector Machine methods. This research proposes an implementation of the Bayes and SVM methods where these algorithms will operate independently yet integrated with other modules such as input data, text pre-processing and shows output result concisely in one single information system. The proposed system was successfully delivered 1000 documents of passenger reviews as input data, then after implemented the pre-processing method, the Bayes formula was used to classify the document reviews into 5 categories, including plane condition, flight comfort, staff service, food and entertainment, and price. While simultanously, the positive and negative sentiment contained in the review document was analyzed with SVM method and shows the accuracy score of 83.6% for a training to testing set ratio of 50:50, while 82.75% accuracy for the 60:40 ratio, and 83.3% accuracy for the 70:30 ratio. This research shows that two different text mining algorithms can be implemented simultaneously in a effective and efficient way, while still providing an accurate and satisfying performance results in one integrated information system.

Download Full-text

Ooredoo Rayek

International Journal of Technology Diffusion ◽

10.4018/ijtd.2020040105 ◽

2020 ◽

Vol 11 (2) ◽

pp. 66-81

Author(s):

Badia Klouche ◽

Sidi Mohamed Benslimane ◽

Sakina Rim Bennabi

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Experimental Results ◽

Support Vector ◽

Textual Data ◽

New Strategy ◽

Set Up

Sentiment analysis is one of the recent areas of emerging research in the classification of sentiment polarity and text mining, particularly with the considerable number of opinions available on social media. The Algerian Operator Telephone Ooredoo, as other operators, deploys in its new strategy to conquer new customers, by exploiting their opinions through a sentiments analysis. The purpose of this work is to set up a system called “Ooredoo Rayek”, whose objective is to collect, transliterate, translate and classify the textual data expressed by the Ooredoo operator's customers. This article developed a set of rules allowing the transliteration from Algerian Arabizi to Algerian dialect. Furthermore, the authors used Naïve Bayes (NB) and (Support Vector Machine) SVM classifiers to assign polarity tags to Facebook comments from the official pages of Ooredoo written in multilingual and multi-dialect context. Experimental results show that the system obtains good performance with 83% of accuracy.

Download Full-text

Building and analysis of protein-protein interactions related to diabetes mellitus using support vector machine, biomedical text mining and network analysis

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2016.09.011 ◽

2016 ◽

Vol 65 ◽

pp. 37-44 ◽

Cited By ~ 11

Author(s):

Renu Vyas ◽

Sanket Bapat ◽

Esha Jain ◽

Muthukumarasamy Karthikeyan ◽

Sanjeev Tambe ◽

...

Keyword(s):

Diabetes Mellitus ◽

Support Vector Machine ◽

Network Analysis ◽

Text Mining ◽

Protein Interactions ◽

Support Vector ◽

Biomedical Text ◽

Biomedical Text Mining ◽

Protein Protein Interactions

Download Full-text

Embedded Speech Recognition Based on Multiclass Support Vector Machine

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.467-469.1905 ◽

2011 ◽

Vol 467-469 ◽

pp. 1905-1910

Author(s):

Jun Feng Zhao ◽

Ye Ping Zhu

Keyword(s):

Support Vector Machine ◽

Speech Recognition ◽

Recognition System ◽

Support Vector ◽

Decision Tree Classifier ◽

Advantages And Disadvantages ◽

Embedded Platform ◽

Tree Classifier ◽

Multiclass Support Vector Machine ◽

Multiclass Svm

This paper introduces the characteristics and requirements of speech recognition technology based on embedded platform. It also describes the basic theory and related properties of Support Vector Machine. The advantages and disadvantages of the Multiclass SVM algorithms are analyzed, providing the algorithms principles for training and recognition of SVM application in the embedded speech recognition system. Finally, we proposed a design strategy based on multiclass SVM decision tree classifier, combined with the features of the embedded speech recognition.

Download Full-text

NAIVE BAYES CLASSIFIER DAN SUPPORT VECTOR MACHINE SEBAGAI ALTERNATIF SOLUSI UNTUK TEXT MINING

Jurnal Teknologi Informasi dan Pendidikan ◽

10.24036/tip.v12i2.219 ◽

2019 ◽

Vol 12 (2) ◽

pp. 32-38

Author(s):

Iin Ernawati

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Classification Algorithms ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

The Relationship

This study was conducted to text-based data mining or often called text mining, classification methods commonly used method Naïve bayes classifier (NBC) and support vector machine (SVM). This classification is emphasized for Indonesian language documents, while the relationship between documents is measured by the probability that can be proven with other classification algorithms. This evident from the conclusion that the probability result Naïve Bayes Classifier (NBC) word “party” at least in the economic document and political. Then the result of the algorithm support vector machine (svm) with the word “price” and “kpk” contains in both economic and politic document.

Download Full-text

Analisis Sentimen Pengguna Twitter Terhadap Layanan Internet Provider Menggunakan Algoritma Support Vector Machine

Matrik Jurnal Manajemen Teknik Informatika dan Rekayasa Komputer ◽

10.30812/matrik.v20i2.1130 ◽

2021 ◽

Vol 20 (2) ◽

pp. 407-416

Author(s):

Fadhilah Dwi Ananda ◽

Yoga Pristyanto

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Support Vector

Media sosial saat ini merupakan media komunikasi yang sering digunakan oleh kalangan masyarakat Indonesia dalam menyampaikan sebuah opini. Salah satu media yang sering digunakan masyarat adalah twitter. Twitter merupakan media sosial yang memberikan banyak informasi melalui tweet, dari informasi yang ditulis tersebut terdapat data yang dapat diolah. Penelitian ini menggunakan teknik text mining dengan menerapkan algoritma Support Vector Machine dipergunakan untuk klasifikasi sentimen pengguna twitter terhadap layanan internet Biznet. Kernel yang digunakan adalah kernel Linear dan kernel RBF. Pengujian dilakukan dengan 3 skenario, pada skenario 1 menggunakan 800 data, skenario 2 menggunakan 900 data dan skenario 3 menggunakan 1000 data, untuk pembagiannya yaitu 90% data training dan 10% data testing dari masing-masing skenario. Berdasarkan hasil pengujian yang dilakukan menggunakan kernel linear dan kernel RBF dapat diambil kesimpulan sebagai berikut. Algoritma SVM menggunakan dengan kernel linear maupun kernel RBF memiliki hasil kinerja evaluasi baik dari sisi akurasi, presisi dan recall yang relatif sama. Sehingga dapat dikatakan bahwa algoritma SVM baik dengan kernel RBF maupun Linear sama sama dapat digunakan dengan baik dalam menentukan sentimen pengguna internet Biznet. Selain itu dengan 3 skenario pengujian dengan jumlah data yang berbeda algoritma SVM baik dengan kernel RBF maupun Linear sama sama konsisten kinerjanya.

Download Full-text

Analisis Perbandingan Kernel Algoritma Support Vector Machine dalam Mengklasifikasikan Skripsi Teknik Informatika berdasarkan Abstrak

JOINS (Journal of Information System) ◽

10.33633/joins.v5i2.3715 ◽

2020 ◽

Vol 5 (2) ◽

pp. 240-249

Author(s):

Anggri Liani

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Text Mining ◽

Cross Validation ◽

Support Vector

Mahasiswa memiliki kewajiban menyelesaikan skripsi untuk menyelesaikan pendidikan jenjang S-1, namun justru menentukan topik skripsi adalah kesulitan pertama mahasiswa dalam pembuatan skripsi yang menjadi salah satu faktor mahasiswa lulus terlambat, Dengan melakukan pengklasifikasian skripsi berdasarkan abstrak dapat membantu mahasiswa dalam mencari referensi untuk menentukan topik skripsi. Metodelogi yang digunakan adalah proses text Mining dengan proses case folding, tekonizing, filtering, stemming, TF-IDF, data mining dan evaluation. Pembagian data menggunakan rasio 80% data latih dan 20% data uji. Pengklasifikasian menggunakan algoritma Support Vector Machine (SVM). Algoritma support vector machine (SVM) adalah salah satu algoritma pengklasifikasian yang memiliki beberapa kernel yaitu liniear dan 3 kernel yang paling dipertimbangkan. Validasi data menggunakan cross validation dengan 10-fold. Tingkat akurasi didapatkan 81%, presisi 82% dan recall 81% pada kernel liniear.

Download Full-text