Penerapan SVM dan Information Gain Pada Analisis Sentimen Pelaksanaan Pilkada Saat Pandemi

Sosial media pada masa kini banyak dimanfaatkan untuk berbagai aktifitas, salah satunya adalah untuk menumpahkan segala tanggapannya terhadap kejadian-kejadian yang tengah terjadi di masyarakat. Seperti banyaknya masyarakat yang memberikan tanggapan terhadap kebijakan pemerintah Indonesia mengenai perlaksanaan Pilkada 2020 yang tetap diselenggarakan meski di tengah pandemi Covid-19 di Twitter. Berbagai tanggapan masyarakat ada yang mendukung maupun tidak setuju dengan diadakannya pilkada 2020 karna dilaksanakan di masa pandemi. Untuk itu maka dilakukan penerapan data mining dengan algoritma Support Vector Machine dan seleksi fitur information gain untuk menganalisis berbagai tanggapan masyarakat mengenai pelaksanaan pilkada 2020. Data yang digunakan merupakan tweet dari aplikasi Twitter sebanyak 496 data. Sebelum tahap data mining, dilakukan pembagian data menjadi 80% data traning dan 20% data testing. Hasil klasifikasi data tweet dengan Support Vector Machine menggunakan kernel linear menghasilkan nilai akurasi yang besar yaitu 92%, precision 90%, dan recall 92%.

Download Full-text

KLASIFIKASI SMS SPAM MENGGUNAKAN SUPPORT VECTOR MACHINE

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i2.693 ◽

2019 ◽

Vol 15 (2) ◽

pp. 275-280

Author(s):

Agus Setiyono ◽

Hilman F Pardede

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Spam Detection ◽

Support Vector Machine Algorithm ◽

Data Mining Techniques ◽

To Receive

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam. One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.

Download Full-text

Combining support vector machine with radial basis function kernel and information gain for sentiment analysis of movie reviews

Journal of Physics Conference Series ◽

10.1088/1742-6596/1918/4/042157 ◽

2021 ◽

Vol 1918 (4) ◽

pp. 042157

Author(s):

Z Abidin ◽

W Destian ◽

R Umer

Keyword(s):

Support Vector Machine ◽

Radial Basis Function ◽

Sentiment Analysis ◽

Basis Function ◽

Information Gain ◽

Support Vector ◽

Radial Basis Function Kernel ◽

Radial Basis

Download Full-text

Data Mining Approach to Analyze COVID-19 Clinical Dataset

10.53350/pjmhs211561812 ◽

2021 ◽

Vol 15 (6) ◽

pp. 1812-1819

Author(s):

Azita Yazdani ◽

Ramin Ravangard ◽

Roxana Sharifian

Keyword(s):

Artificial Intelligence ◽

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Clinical Signs ◽

Study Data ◽

Mining Machine ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Approach

The new coronavirus has been spreading since the beginning of 2020 and many efforts have been made to develop vaccines to help patients recover. It is now clear that the world needs a rapid solution to curb the spread of COVID-19 worldwide with non-clinical approaches such as data mining, enhanced intelligence, and other artificial intelligence techniques. These approaches can be effective in reducing the burden on the health care system to provide the best possible way to diagnose and predict the COVID-19 epidemic. In this study, data mining models for early detection of Covid-19 in patients were developed using the epidemiological dataset of patients and individuals suspected of having Covid-19 in Iran. C4.5, support vector machine, Naive Bayes, logistic regression, Random Forest, and k-nearest neighbor algorithm were used directly on the dataset using Rapid miner to develop the models. By receiving clinical signs, this model diagnosis the risk of contracting the COVID-19 virus. Examination of the models in this study has shown that the support vector machine with 93.41% accuracy is more efficient in the diagnosis of patients with COVID-19 pandemic, which is the best model among other developed models. Keywords: COVID-19, Data mining, Machine Learning, Artificial Intelligence, Classification

Download Full-text

Support Vector Machine VS Information Gain: Analisis Sentimen Cyberbullying di Twitter Indonesia

Jurnal ULTIMA InfoSys ◽

10.31937/si.v11i2.1740 ◽

2020 ◽

Vol 11 (2) ◽

pp. 107-111

Author(s):

Christevan Destitus ◽

Wella Wella ◽

Suryasari Suryasari

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Text Mining ◽

Information Gain ◽

Text Processing ◽

Support Vector ◽

Term Weighting ◽

System Process ◽

Research Stage

This study aims to clarify tweets on twitter using the Support Vector Machine and Information Gain methods. The clarification itself aims to find a hyperplane that separates the negative and positive classes. In the research stage, there is a system process, namely text mining, text processing which has stages of tokenizing, filtering, stemming, and term weighting. After that, a feature selection is made by information gain which calculates the entropy value of each word. After that, clarify based on the features that have been selected and the output is in the form of identifying whether the tweet is bully or not. The results of this study found that the Support Vector Machine and Information Gain methods have sufficiently maximum results.

Download Full-text

Using Stratified Sample and Grid Search to Improve Disease Prediction Accuracy of SVM

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.295-298.644 ◽

2013 ◽

Vol 295-298 ◽

pp. 644-647 ◽

Cited By ~ 1

Author(s):

Yu Kai Yao ◽

Hong Mei Cui ◽

Ming Wei Len ◽

Xiao Yun Chen

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Classification Accuracy ◽

Prediction Accuracy ◽

Support Vector ◽

Disease Prediction ◽

Data Mining Algorithm ◽

Grid Search ◽

Mining Algorithm ◽

Stratified Sample

SVM (Support Vector Machine) is a powerful data mining algorithm, and is mainly used to finish classification or regression tasks. In this literature, SVM is used to conduct disease prediction. We focus on integrating with stratified sample and grid search technology to improve the classification accuracy of SVM, thus, we propose an improved algorithm named SGSVM: Stratified sample and Grid search based SVM. To testify the performance of SGSVM, heart-disease data from UCI are used in our experiment, and the results show SGSVM has obvious improvement in classification accuracy, and this is very valuable especially in disease prediction.

Download Full-text

Analysis of Decision Tree and Smooth Support Vector Machine Methods on Data Mining

Journal of Physics Conference Series ◽

10.1088/1742-6596/1255/1/012067 ◽

2019 ◽

Vol 1255 ◽

pp. 012067

Author(s):

Natalina Br Sitepu ◽

Sawaluddin ◽

M Zarlis ◽

Syahril Efendi ◽

Hanna Willa Dhany

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Support Vector ◽

Smooth Support Vector Machine

Download Full-text

An Improved Hybrid Model for Early Prediction of Autism

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.e6812.018520 ◽

2020 ◽

Vol 8 (5) ◽

pp. 4358-4361

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Intellectual Disabilities ◽

Hybrid Model ◽

Support Vector ◽

Early Prediction ◽

Creative Mind ◽

Social Collaboration

Autism is described by extreme, unavoidable intellectual disabilities which are adverse on perspectives related with social collaboration, correspondence, creative mind and conduct. Treating Autism has secured an exceptional spot, as a few heuristic and measurable models are proposed by scientists working around there. Henceforth kids influenced with such issue should be upheld with recognition of an early, well-planned and singular scholarly endeavours created in adjusted settings bringing about early location and accurately diagnose the issues of Autism. Requirements of Data mining and soft computational methodologies are thought as a characteristic qualified for finding confounded examples. The paper defines a definite investigation and proposes the hybrid improved methodologies of Bee Hive Optimization with Support Vector Machine for the requirement of versatile and early prediction of Autism among developing youngsters with more Accuracy and with the less error and time.

Download Full-text

Integration of synthetic minority oversampling technique for imbalanced class

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v13.i1.pp102-108 ◽

2019 ◽

Vol 13 (1) ◽

pp. 102

Author(s):

Noviyanti Santoso ◽

Wahyu Wibowo ◽

Hilda Hikmawati

Keyword(s):

Machine Learning ◽

Data Mining ◽

Support Vector Machine ◽

Class Imbalance ◽

Original Data ◽

Support Vector ◽

Classification Methods ◽

Problematic Issue ◽

Imbalanced Class ◽

F Measure

In the data mining, a class imbalance is a problematic issue to look for the solutions. It probably because machine learning is constructed by using algorithms with assuming the number of instances in each balanced class, so when using a class imbalance, it is possible that the prediction results are not appropriate. They are solutions offered to solve class imbalance issues, including oversampling, undersampling, and synthetic minority oversampling technique (SMOTE). Both oversampling and undersampling have its disadvantages, so SMOTE is an alternative to overcome it. By integrating SMOTE in the data mining classification method such as Naive Bayes, Support Vector Machine (SVM), and Random Forest (RF) is expected to improve the performance of accuracy. In this research, it was found that the data of SMOTE gave better accuracy than the original data. In addition to the three classification methods used, RF gives the highest average AUC, F-measure, and G-means score.

Download Full-text