scholarly journals ANALISIS SENTIMEN PENGGUNA GOPAY MENGGUNAKAN METODE LEXICON BASED DAN SUPPORT VECTOR MACHINE

KOMPUTEK ◽  
2019 ◽  
Vol 3 (2) ◽  
pp. 52
Author(s):  
Rachmad Mahendrajaya ◽  
Ghulam Asrofi Buntoro ◽  
Moh Bhanu Setyawan

Go-Pay is part of the Gojek application and one of the most popular finteches in Indonesia. Although the most popular, not all users have positive or even negative comments. Now users can submit various media opinions, one of which is Twitter. Twitter media has the advantage of a simple display, updated topics, open access to tweets and express opinions quickly. From a variety of comments on Twitter it takes a technique to divide into classes positive or negative opinions. This study uses prepocessing and labeling opinions into positive and negative classes with the lexicon Based method. As for the classification using the Support Vector Machine (SVM) method. The data used in the form of opinions about Go- Pay reviews from social media Twitter, amounting to 1210. The results of labeling with Lexicon Based amounted to 923 for positive and 287 for negative. While the classification of the SVM method using the Linear kernel produces 89.17% and 84.38% for the Polynomial kernel.

2020 ◽  
Vol 11 (2) ◽  
pp. 66-81
Author(s):  
Badia Klouche ◽  
Sidi Mohamed Benslimane ◽  
Sakina Rim Bennabi

Sentiment analysis is one of the recent areas of emerging research in the classification of sentiment polarity and text mining, particularly with the considerable number of opinions available on social media. The Algerian Operator Telephone Ooredoo, as other operators, deploys in its new strategy to conquer new customers, by exploiting their opinions through a sentiments analysis. The purpose of this work is to set up a system called “Ooredoo Rayek”, whose objective is to collect, transliterate, translate and classify the textual data expressed by the Ooredoo operator's customers. This article developed a set of rules allowing the transliteration from Algerian Arabizi to Algerian dialect. Furthermore, the authors used Naïve Bayes (NB) and (Support Vector Machine) SVM classifiers to assign polarity tags to Facebook comments from the official pages of Ooredoo written in multilingual and multi-dialect context. Experimental results show that the system obtains good performance with 83% of accuracy.


2020 ◽  
Vol 1 (1) ◽  
pp. 78-90
Author(s):  
Leonardo Leonardo ◽  
Yohannes Yohannes ◽  
Ery Hartati

Garbage is one of the problems that always arise in Indonesia and even in the world. Increasingly, the production of waste is increased along with the increase in population and consumption. Therefore, need a prevention to stop wasting or producing garbage through recycle. This research do garbage recycle classification of cardboard, glass, metal, paper and plastic by using Local Binary Pattern (LBP) texture feature extraction methode and Support Vector Machine (SVM) as classification methode. For examination technic and dataset distribution is using K-Fold Cross Validation methode type Leave One Out (LOO). From examination result had been done were using fold 5 until fold 10. Polynomial kernel get highest accuracy result from every fold used with mean point 87.82%. Based on SVM classification examination result whether linear kernel, polynomial nor gaussian by using fold 5 until fold 10. The best accuracy point for cardboard garbage is 96.01%. For glass garbage, the best accuracy point is 90.62%. Then, metal garbage get the best accuracy point 89.72%. While paper garbage with highest accuracy point 96.01%. And plastic garbage with highest accuracy point 87.64%.


Author(s):  
Nor Ain Maisarah Samsudin, Et. al.

This study proposed a statistical investigate the pattern of students’ academic performance before and after online learning due to the Movement Control Order (MCO) during pandemic outbreak and a modelling students’ academic performance based on classification in Support Vector Machine (SVM). Data sample were taken from undergraduate students of Faculty of Science and Mathematics, Universiti Pendidikan Sultan Idris (UPSI). Student’s Grade Point Average (GPA) were obtained to developed model of academic performances during Covid-19 outbreak. The prediction model was used to predict the academic performances of university students when online classes was conducted. The algorithm of Support Vector Machine (SVM) was used to develop a model of students’ academic performance in university. For the Support Vector Machine (SVM) algorithm, there are two important parameters which are C (misclassification tolerance parameter) and epsilon  need to identify before proceed the further analysis. The parameters was applied to four different types of kernel which is linear kernel, radial basis function kernel, polynomial kernel and sigmoid kernel and the result was found that the best accuracy achieved by SVM are 73.68% by using linear kernel and the worst accuracy obtained from a sigmoid kernel which is 67.99% with parameter of misclassification tolerance C is 128 and epsilon is 0.6.


Author(s):  
Suhas S ◽  
Dr. C. R. Venugopal

An enhanced classification system for classification of MR images using association of kernels with support vector machine is developed and presented in this paper along with the design and development of content-based image retrieval (CBIR) system. Content of image retrieval is the process of finding relevant image from large collection of image database using visual queries. Medical images have led to growth in large image collection. Oriented Rician Noise Reduction Anisotropic Diffusion filter is used for image denoising. A modified hybrid Otsu algorithm termed is used for image segmentation. The texture features are extracted using GLCM method. Genetic algorithm with Joint entropy is adopted for feature selection. The classification is done by support vector machine along with various kernels and the performance is validated. A classification accuracy of 98.83% is obtained using SVM with GRBF kernel. Various features have been extracted and these features are used to classify MR images into five different categories. Performance of the MC-SVM classifier is compared with different kernel functions. From the analysis and performance measures like classification accuracy, it is inferred that the brain and spinal cord MRI classification is best done using MC- SVM with Gaussian RBF kernel function than linear and polynomial kernel functions. The proposed system can provide best classification performance with high accuracy and low error rate.


Author(s):  
Nadhia Azzahra ◽  
Danang Murdiansyah ◽  
Kemas Lhaksmana

The use of social media in society continues to increase over time and the ease of access and familiarity of social media then make it easier for an irresponsible user to do unethical things such as spreading hatred, defamation, radicalism, pornography so on. Although there are regulations that govern all the activities on social media. However, the regulations are still not working effectively. In this study, we conducted a classification of toxic comments containing unethical matters using the SVM method with TF-IDF as the feature extraction and Chi Square as the feature selection. The best performance result based on the experiment that has been carried out is by using the SVM model with a linear kernel, without implementing Chi Square, and using stemming and stopwords removal with the F1 − Score equal to 76.57%.


CCIT Journal ◽  
2017 ◽  
Vol 10 (2) ◽  
pp. 197-206
Author(s):  
Atika Rahmawati ◽  
Aris Marjuni ◽  
Junta Zeniarja

Pilkada Serentak is a very important event for the future viability regions and countries. Through this election people can cast their vote and elect representatives of the people according to their choice. Public respond can be expressed through twitter social media. Using twitter social media sentiment analysis can then be made about the public response to the implementation of the election simultaneously. The classification process can be detected via text tweeted by twitter users. In this study, the classification of responses detected by text because it is easily obtained and applied. This study determined the classification of the response to the Indonesian language text and increase accuracy by using SVM.Tweet classification method used by the categorical approach is divided into two classes tweet basic level: positive and negative. Data collected from Indonesian twitter tweet as much as 3000. The labeling is not done manually but using clustering method that divides the 3000 data into two groups. Cluster 1 as a group of positive tweets and Cluster 2 as a negative group tweet.2700 for training data and 300 for the test data. The stage of pre-processing the data includetokenization, casenormalization, stop word detection, and stemming. The process of classification using Support Vector Machine (SVM). Accuracy of SVM showed the highest yield that is 91% compared to the k-means clustering with the results of 82%.


2019 ◽  
Vol 2019 ◽  
pp. 1-11 ◽  
Author(s):  
Davies Segera ◽  
Mwangi Mbuthia ◽  
Abraham Nyete

Determining an optimal decision model is an important but difficult combinatorial task in imbalanced microarray-based cancer classification. Though the multiclass support vector machine (MCSVM) has already made an important contribution in this field, its performance solely depends on three aspects: the penalty factor C, the type of kernel, and its parameters. To improve the performance of this classifier in microarray-based cancer analysis, this paper proposes PSO-PCA-LGP-MCSVM model that is based on particle swarm optimization (PSO), principal component analysis (PCA), and multiclass support vector machine (MCSVM). The MCSVM is based on a hybrid kernel, i.e., linear-Gaussian-polynomial (LGP) that combines the advantages of three standard kernels (linear, Gaussian, and polynomial) in a novel manner, where the linear kernel is linearly combined with the Gaussian kernel embedding the polynomial kernel. Further, this paper proves and makes sure that the LGP kernel confirms the features of a valid kernel. In order to reveal the effectiveness of our model, several experiments were conducted and the obtained results compared between our model and other three single kernel-based models, namely, PSO-PCA-L-MCSVM (utilizing a linear kernel), PSO-PCA-G-MCSVM (utilizing a Gaussian kernel), and PSO-PCA-P-MCSVM (utilizing a polynomial kernel). In comparison, two dual and two multiclass imbalanced standard microarray datasets were used. Experimental results in terms of three extended assessment metrics (F-score, G-mean, and Accuracy) reveal the superior global feature extraction, prediction, and learning abilities of this model against three single kernel-based models.


Author(s):  
Erwin B. Setiawan ◽  
Dwi H. Widyantoro ◽  
Kridanto Surendro

Information credibility in social media is becoming the most important part of information sharing in the society. The literatures have shown that there is no labeling information credibility based on user competencies and their posted topics. This study increases the information credibility by adding new 17 features for Twitter and 49 features for Facebook. In the first step, we perform a labeling process based on user competencies and their posted topic to classify the users into two groups, credible and not credible users, regarding their posted topics. These approaches are evaluated over ten thousand samples of real-field data obtained from Twitter and Facebook networks using classification of Naive Bayes (NB), Support Vector Machine (SVM), Logistic Regression (Logit) and J48 algorithm (J48). With the proposed new features, the credibility of information provided in social media is increasing significantly indicated by better accuracy compared to the existing technique for all classifiers.


Breast cancer (BC) most diagnosed invasive disorder and important cause of casualty for women worldwide. Indian contest BC most commonly spread disease among females. This problem is more alarming to economically developing country like India. Government of India made a lot of effort to make aware the women of the country, but despite of availability of diagnostic tool, prediction of disease in real situation is still a puzzle for researchers. Timely detection and categorization of BC using the evolving techniques like Machine Learning (ML) can show a significant role in BC identification and this could be a preventive policy which effectively reduces the risk of BC patients. Although there are four Kernels in ML, are widely in use but their performance varies with the kind of data available. In this study we, apply four different Kernels such as Linear Kernel (LK), Polynomial Kernel (PK), Sigmoid Kernel (SK) and Radial Basis Function Kernel (RBFK) on BC dataset. We estimated the performance of Support Vector Machine Kernels (SVM-K) on BC dataset .The basic idea is to check the exactness of SVM-K to classify WBCD in terms of effectiveness with respect to accuracy, runtime, specificity and precision. The investigations outcome displays that RBFK provides greater accuracy with minimal errors


2021 ◽  
Vol 13 (2) ◽  
pp. 168-174
Author(s):  
Rifqatul Mukarramah ◽  
Dedy Atmajaya ◽  
Lutfi Budi Ilmawan

Sentiment analysis is a technique to extract information of one’s perception, called sentiment, on an issue or event. This study employs sentiment analysis to classify society’s response on covid-19 virus posted at twitter into 4 polars, namely happy, sad, angry, and scared. Classification technique used is support vector machine (SVM) method which compares the classification performance figure of 2 linear kernel functions, linear and polynomial. There were 400 tweet data used where each sentiment class consists of 100 data. Using the testing method of k-fold cross validation, the result shows the accuracy value of linear kernel function is 0.28 for unigram feature and 0.36 for trigram feature. These figures are lower compared to accuracy value of kernel polynomial with 0.34 and 0.48 for unigram and trigram feature respectively. On the other hand, testing method of confusion matrix suggests the highest performance is obtained by using kernel polynomial with accuracy value of 0.51, precision of 0.43, recall of 0.45, and f-measure of 0.51.


Sign in / Sign up

Export Citation Format

Share Document