scholarly journals Penerapan SVM dan Information Gain Pada Analisis Sentimen Pelaksanaan Pilkada Saat Pandemi

2021 ◽  
Vol 7 (2) ◽  
pp. 101-109
Author(s):  
Aliffia Kulsumarwati ◽  
Intan Purnamasari ◽  
Budi Arif Darmawan

Sosial media pada masa kini banyak dimanfaatkan untuk berbagai aktifitas, salah satunya adalah untuk menumpahkan segala tanggapannya terhadap kejadian-kejadian yang tengah terjadi di masyarakat. Seperti banyaknya masyarakat yang memberikan tanggapan terhadap kebijakan pemerintah Indonesia mengenai perlaksanaan Pilkada 2020 yang tetap diselenggarakan meski di tengah pandemi Covid-19 di Twitter. Berbagai tanggapan masyarakat ada yang mendukung maupun tidak setuju dengan diadakannya pilkada 2020 karna dilaksanakan di masa pandemi. Untuk itu maka dilakukan penerapan data mining dengan algoritma Support Vector Machine dan seleksi fitur information gain untuk menganalisis berbagai tanggapan masyarakat mengenai pelaksanaan pilkada 2020. Data yang digunakan merupakan tweet dari aplikasi Twitter sebanyak 496 data. Sebelum tahap data mining, dilakukan pembagian data menjadi 80% data traning dan 20% data testing. Hasil klasifikasi  data tweet dengan Support Vector Machine menggunakan kernel linear menghasilkan nilai akurasi yang besar yaitu 92%, precision 90%, dan recall 92%.

2019 ◽  
Vol 15 (2) ◽  
pp. 275-280
Author(s):  
Agus Setiyono ◽  
Hilman F Pardede

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam.  One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.


2021 ◽  
Vol 15 (6) ◽  
pp. 1812-1819
Author(s):  
Azita Yazdani ◽  
Ramin Ravangard ◽  
Roxana Sharifian

The new coronavirus has been spreading since the beginning of 2020 and many efforts have been made to develop vaccines to help patients recover. It is now clear that the world needs a rapid solution to curb the spread of COVID-19 worldwide with non-clinical approaches such as data mining, enhanced intelligence, and other artificial intelligence techniques. These approaches can be effective in reducing the burden on the health care system to provide the best possible way to diagnose and predict the COVID-19 epidemic. In this study, data mining models for early detection of Covid-19 in patients were developed using the epidemiological dataset of patients and individuals suspected of having Covid-19 in Iran. C4.5, support vector machine, Naive Bayes, logistic regression, Random Forest, and k-nearest neighbor algorithm were used directly on the dataset using Rapid miner to develop the models. By receiving clinical signs, this model diagnosis the risk of contracting the COVID-19 virus. Examination of the models in this study has shown that the support vector machine with 93.41% accuracy is more efficient in the diagnosis of patients with COVID-19 pandemic, which is the best model among other developed models. Keywords: COVID-19, Data mining, Machine Learning, Artificial Intelligence, Classification


2020 ◽  
Vol 11 (2) ◽  
pp. 107-111
Author(s):  
Christevan Destitus ◽  
Wella Wella ◽  
Suryasari Suryasari

This study aims to clarify tweets on twitter using the Support Vector Machine and Information Gain methods. The clarification itself aims to find a hyperplane that separates the negative and positive classes. In the research stage, there is a system process, namely text mining, text processing which has stages of tokenizing, filtering, stemming, and term weighting. After that, a feature selection is made by information gain which calculates the entropy value of each word. After that, clarify based on the features that have been selected and the output is in the form of identifying whether the tweet is bully or not. The results of this study found that the Support Vector Machine and Information Gain methods have sufficiently maximum results.


2013 ◽  
Vol 295-298 ◽  
pp. 644-647 ◽  
Author(s):  
Yu Kai Yao ◽  
Hong Mei Cui ◽  
Ming Wei Len ◽  
Xiao Yun Chen

SVM (Support Vector Machine) is a powerful data mining algorithm, and is mainly used to finish classification or regression tasks. In this literature, SVM is used to conduct disease prediction. We focus on integrating with stratified sample and grid search technology to improve the classification accuracy of SVM, thus, we propose an improved algorithm named SGSVM: Stratified sample and Grid search based SVM. To testify the performance of SGSVM, heart-disease data from UCI are used in our experiment, and the results show SGSVM has obvious improvement in classification accuracy, and this is very valuable especially in disease prediction.


2019 ◽  
Vol 1255 ◽  
pp. 012067
Author(s):  
Natalina Br Sitepu ◽  
Sawaluddin ◽  
M Zarlis ◽  
Syahril Efendi ◽  
Hanna Willa Dhany

2020 ◽  
Vol 8 (5) ◽  
pp. 4358-4361

Autism is described by extreme, unavoidable intellectual disabilities which are adverse on perspectives related with social collaboration, correspondence, creative mind and conduct. Treating Autism has secured an exceptional spot, as a few heuristic and measurable models are proposed by scientists working around there. Henceforth kids influenced with such issue should be upheld with recognition of an early, well-planned and singular scholarly endeavours created in adjusted settings bringing about early location and accurately diagnose the issues of Autism. Requirements of Data mining and soft computational methodologies are thought as a characteristic qualified for finding confounded examples. The paper defines a definite investigation and proposes the hybrid improved methodologies of Bee Hive Optimization with Support Vector Machine for the requirement of versatile and early prediction of Autism among developing youngsters with more Accuracy and with the less error and time.


Author(s):  
Noviyanti Santoso ◽  
Wahyu Wibowo ◽  
Hilda Hikmawati

In the data mining, a class imbalance is a problematic issue to look for the solutions. It probably because machine learning is constructed by using algorithms with assuming the number of instances in each balanced class, so when using a class imbalance, it is possible that the prediction results are not appropriate. They are solutions offered to solve class imbalance issues, including oversampling, undersampling, and synthetic minority oversampling technique (SMOTE). Both oversampling and undersampling have its disadvantages, so SMOTE is an alternative to overcome it. By integrating SMOTE in the data mining classification method such as Naive Bayes, Support Vector Machine (SVM), and Random Forest (RF) is expected to improve the performance of accuracy. In this research, it was found that the data of SMOTE gave better accuracy than the original data. In addition to the three classification methods used, RF gives the highest average AUC, F-measure, and G-means score.


Sign in / Sign up

Export Citation Format

Share Document