Sentiment Analysis of Movie Reviews Using Support Vector Machine Classifier with Linear Kernel Function

Support vector machine (SVM) is a known method for supervised learning in sentiment analysis and there are many studies about the use of SVM in classifying the sentiments in lecturer evaluation. SVM has various parameters that can be tuned and kernels that can be chosen to improve the classifier accuracy. However, not all options have been explored. Therefore, in this study we compared the four SVM kernels: radial, linear, polynomial, and sigmoid, to discover how each kernel influences the accuracy of the classifier. To make a proper assessment, we used our labeled dataset of students’ evaluations toward the lecturer. The dataset was split, one for training the classifier, and another one for testing the model. As an addition, we also used several different ratios of the training:testing dataset. The split ratios are 0.5 to 0.95, with the increment factor of 0.05. The dataset was split randomly, hence the splitting-training-testing processes were repeated 1,000 times for each kernel and splitting ratio. Therefore, at the end of the experiment, we got 40,000 accuracy data. Later, we applied statistical methods to see whether the differences are significant. Based on the statistical test, we found that in this particular case, the linear kernel significantly has higher accuracy compared to the other kernels. However, there is a tradeoff, where the results are getting more varied with a higher proportion of data used for training.

Download Full-text

A support vector machine classifier based on a new kernel function model for hyperspectral data

GIScience & Remote Sensing ◽

10.1080/15481603.2015.1114199 ◽

2015 ◽

Vol 53 (1) ◽

pp. 85-101 ◽

Cited By ~ 19

Author(s):

Zhilei Lin ◽

Luming Yan

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Support Vector Machine Classifier ◽

Hyperspectral Data ◽

Support Vector ◽

Function Model

Download Full-text

Development of the cubic least squares mapping linear-kernel support vector machine classifier for improving the characterization of breast lesions on ultrasound

Computerized Medical Imaging and Graphics ◽

10.1016/j.compmedimag.2004.04.003 ◽

2004 ◽

Vol 28 (5) ◽

pp. 247-255 ◽

Cited By ~ 20

Author(s):

N. Piliouras ◽

I. Kalatzis ◽

N. Dimitropoulos ◽

D. Cavouras

Keyword(s):

Support Vector Machine ◽

Least Squares ◽

Support Vector Machine Classifier ◽

Support Vector ◽

Breast Lesions ◽

Linear Kernel ◽

Kernel Support Vector Machine

Download Full-text

Linguistic Rule Extraction from Support Vector Machine Classifiers

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch072 ◽

2008 ◽

pp. 1269-1279

Author(s):

Xiuju Fu ◽

Lipo Wang ◽

GihGuang Hung ◽

Liping Goh

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Rule Extraction ◽

Support Vector ◽

Svm Classifier ◽

Data Sets ◽

Linear Kernel ◽

Linguistic Rule ◽

Linguistic Rules ◽

Rbf Kernel

Classification decisions from linguistic rules are more desirable compared to complex mathematical formulas from support vector machine (SVM) classifiers due to the explicit explanation capability of linguistic rules. Linguistic rule extraction has been attracting much attention in explaining knowledge hidden in data. In this chapter, we show that the decisions from an SVM classifier can be decoded into linguistic rules based on the information provided by support vectors and decision function. Given a support vector of a certain class, cross points between each line, which is extended from the support vector along each axis, and an SVM decision hyper-curve are searched first. A hyper-rectangular rule is derived from these cross points. The hyper-rectangle is tuned by a tuning phase in order to exclude those out-class data points. Finally, redundant rules are merged to produce a compact rule set. Simultaneously, important attributes could be highlighted in the extracted rules. Rule extraction results from our proposed method could follow SVM classifier decisions very well. We compare the rule extraction results from SVM with RBF kernel function and linear kernel function. Experiment results show that rules extracted from SVM with RBF nonlinear kernel function are with better accuracy than rules extracted from SVM with linear kernel function. Comparisons between our method and other rule extraction methods are also carried out on several benchmark data sets. Higher rule accuracy is obtained in our method with fewer number of premises in each rule.

Download Full-text

Linguistic Rule Extraction from Support Vector Machine Classifiers

Research and Trends in Data Mining Technologies and Applications ◽

10.4018/978-1-59904-271-8.ch010 ◽

2007 ◽

pp. 276-290

Author(s):

Xiuju Fu ◽

Lipo Wang ◽

GihGuang Hung ◽

Liping Goh

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Rule Extraction ◽

Support Vector ◽

Svm Classifier ◽

Data Sets ◽

Linear Kernel ◽

Linguistic Rule ◽

Linguistic Rules ◽

Rbf Kernel

Classification decisions from linguistic rules are more desirable compared to complex mathematical formulas from support vector machine (SVM) classifiers due to the explicit explanation capability of linguistic rules. Linguistic rule extraction has been attracting much attention in explaining knowledge hidden in data. In this chapter, we show that the decisions from an SVM classifier can be decoded into linguistic rules based on the information provided by support vectors and decision function. Given a support vector of a certain class, cross points between each line, which is extended from the support vector along each axis, and an SVM decision hyper-curve are searched first. A hyper-rectangular rule is derived from these cross points. The hyper-rectangle is tuned by a tuning phase in order to exclude those out-class data points. Finally, redundant rules are merged to produce a compact rule set. Simultaneously, important attributes could be highlighted in the extracted rules. Rule extraction results from our proposed method could follow SVM classifier decisions very well. We compare the rule extraction results from SVM with RBF kernel function and linear kernel function. Experiment results show that rules extracted from SVM with RBF nonlinear kernel function are with better accuracy than rules extracted from SVM with linear kernel function. Comparisons between our method and other rule extraction methods are also carried out on several benchmark data sets. Higher rule accuracy is obtained in our method with fewer number of premises in each rule.

Download Full-text

Statistically–Induced Kernel Function for Support Vector Machine Classifier

Artificial Intelligence and Soft Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-29347-4_43 ◽

2012 ◽

pp. 369-377

Author(s):

Cezary Dendek ◽

Jacek Mańdziuk

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Support Vector Machine Classifier ◽

Support Vector

Download Full-text

Application of Support Vector Machine (SVM) in the Sentiment Analysis of Twitter DataSet

Applied Sciences ◽

10.3390/app10031125 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1125 ◽

Cited By ~ 1

Author(s):

Kai-Xu Han ◽

Wei Chien ◽

Chien-Ching Chiu ◽

Yu-Ting Cheng

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Kernel Function ◽

Latent Semantic Analysis ◽

Semantic Information ◽

Semantic Analysis ◽

Probabilistic Latent Semantic Analysis ◽

Support Vector ◽

Analysis Model ◽

Fisher Kernel

At present, in the mainstream sentiment analysis methods represented by the Support Vector Machine, the vocabulary and the latent semantic information involved in the text are not well considered, and sentiment analysis of text is dependent overly on the statistics of sentiment words. Thus, a Fisher kernel function based on Probabilistic Latent Semantic Analysis is proposed in this paper for sentiment analysis by Support Vector Machine. The Fisher kernel function based on the model is derived from the Probabilistic Latent Semantic Analysis model. By means of this method, latent semantic information involving the probability characteristics can be used as the classification characteristics, along with the improvement of the effect of classification for support vector machine, and the problem of ignoring the latent semantic characteristics in text sentiment analysis can be addressed. The results show that the effect of the method proposed in this paper, compared with the comparison method, is obviously improved.

Download Full-text

Aspect based Sentiment Analysis using support vector machine classifier

2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI) ◽

10.1109/icacci.2013.6637416 ◽

2013 ◽

Cited By ~ 15

Author(s):

Raisa Varghese ◽

M. Jayasree

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Support Vector Machine Classifier ◽

Support Vector

Download Full-text

Performance comparison of support vector machine (SVM) with linear kernel and polynomial kernel for multiclass sentiment analysis on twitter

ILKOM Jurnal Ilmiah ◽

10.33096/ilkom.v13i2.851.168-174 ◽

2021 ◽

Vol 13 (2) ◽

pp. 168-174

Author(s):

Rifqatul Mukarramah ◽

Dedy Atmajaya ◽

Lutfi Budi Ilmawan

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Confusion Matrix ◽

Classification Performance ◽

Performance Comparison ◽

Kernel Functions ◽

Polynomial Kernel ◽

Support Vector ◽

Linear Kernel ◽

Testing Method

Sentiment analysis is a technique to extract information of one’s perception, called sentiment, on an issue or event. This study employs sentiment analysis to classify society’s response on covid-19 virus posted at twitter into 4 polars, namely happy, sad, angry, and scared. Classification technique used is support vector machine (SVM) method which compares the classification performance figure of 2 linear kernel functions, linear and polynomial. There were 400 tweet data used where each sentiment class consists of 100 data. Using the testing method of k-fold cross validation, the result shows the accuracy value of linear kernel function is 0.28 for unigram feature and 0.36 for trigram feature. These figures are lower compared to accuracy value of kernel polynomial with 0.34 and 0.48 for unigram and trigram feature respectively. On the other hand, testing method of confusion matrix suggests the highest performance is obtained by using kernel polynomial with accuracy value of 0.51, precision of 0.43, recall of 0.45, and f-measure of 0.51.

Download Full-text

Comparison of Kernel Function on Support Vector Machine in Classification of Childbirth

Jurnal Matematika MANTIK ◽

10.15642/mantik.2019.5.2.90-99 ◽

2019 ◽

Vol 5 (2) ◽

pp. 90-99

Author(s):

Putroue Keumala Intan

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Kernel Functions ◽

Support Vector ◽

Medical Team ◽

Classification Methods ◽

Higher Dimension ◽

Linear Kernel ◽

Good Decision

The maternal mortality rate during childbirth can be reduced through the efforts of the medical team in determining the childbirth process that must be undertaken immediately. Machine learning in terms of classifying childbirth can be a solution for the medical team in determining the childbirth process. One of the classification methods that can be used is the Support Vector Machine (SVM) method which is able to determine a hyperplane that will form a good decision boundary so that it is able to classify data appropriately. In SVM, there is a kernel function that is useful for solving non-linear classification cases by transforming data to a higher dimension. In this study, four kernel functions will be used; Linear, Radial Basis Function (RBF), Polynomial, and Sigmoid in the classification process of childbirth in order to determine the kernel function that is capable of producing the highest accuracy value. Based on research that has been done, it is obtained that the accuracy value generated by SVM with linear kernel functions is higher than the other kernel functions.

Download Full-text