Selection of optimal hyper-parameter values of support vector machine for sentiment analysis tasks using nature-inspired optimization methods

Sentiment analysis and classification task is used in recommender systems to analyze movie reviews, tweets, Facebook posts, online product reviews, blogs, discussion forums, and online comments in social networks. Usually, the classification is performed using supervised machine learning methods such as support vector machine (SVM) classifier, which have many distinct parameters. The selection of the values for these parameters can greatly influence the classification accuracy and can be addressed as an optimization problem. Here we analyze the use of three heuristics, nature-inspired optimization techniques, cuckoo search optimization (CSO), ant lion optimizer (ALO), and polar bear optimization (PBO), for parameter tuning of SVM models using various kernel functions. We validate our approach for the sentiment classification task of Twitter dataset. The results are compared using classification accuracy metric and the Nemenyi test.

Download Full-text

Feature Extraction and Classification of MRI Using Hybrid RBF Kernel and SVM

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2176104 ◽

2021 ◽

pp. 418-426

Author(s):

Suhas S ◽

Dr. C. R. Venugopal

Keyword(s):

Support Vector Machine ◽

Image Retrieval ◽

Classification Accuracy ◽

Kernel Functions ◽

Polynomial Kernel ◽

Support Vector ◽

Svm Classifier ◽

Mr Images ◽

Rbf Kernel

An enhanced classification system for classification of MR images using association of kernels with support vector machine is developed and presented in this paper along with the design and development of content-based image retrieval (CBIR) system. Content of image retrieval is the process of finding relevant image from large collection of image database using visual queries. Medical images have led to growth in large image collection. Oriented Rician Noise Reduction Anisotropic Diffusion filter is used for image denoising. A modified hybrid Otsu algorithm termed is used for image segmentation. The texture features are extracted using GLCM method. Genetic algorithm with Joint entropy is adopted for feature selection. The classification is done by support vector machine along with various kernels and the performance is validated. A classification accuracy of 98.83% is obtained using SVM with GRBF kernel. Various features have been extracted and these features are used to classify MR images into five different categories. Performance of the MC-SVM classifier is compared with different kernel functions. From the analysis and performance measures like classification accuracy, it is inferred that the brain and spinal cord MRI classification is best done using MC- SVM with Gaussian RBF kernel function than linear and polynomial kernel functions. The proposed system can provide best classification performance with high accuracy and low error rate.

Download Full-text

The effect of gamma value on support vector machine performance with different kernels

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v10i5.pp5497-5506 ◽

2020 ◽

Vol 10 (5) ◽

pp. 5497

Author(s):

Intisar Shadeed Al-Mejibli ◽

Jwan K. Alwan ◽

Dhafar Hamed Abd

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Learning Algorithm ◽

Kernel Functions ◽

Supervised Machine Learning ◽

Support Vector ◽

Svm Classifier ◽

Machine Performance ◽

Rbf Kernel ◽

The Impact

Currently, the support vector machine (SVM) regarded as one of supervised machine learning algorithm that provides analysis of data for classification and regression. This technique is implemented in many fields such as bioinformatics, face recognition, text and hypertext categorization, generalized predictive control and many other different areas. The performance of SVM is affected by some parameters, which are used in the training phase, and the settings of parameters can have a profound impact on the resulting engine’s implementation. This paper investigated the SVM performance based on value of gamma parameter with used kernels. It studied the impact of gamma value on (SVM) efficiency classifier using different kernels on various datasets descriptions. SVM classifier has been implemented by using Python. The kernel functions that have been investigated are polynomials, radial based function (RBF) and sigmoid. UC irvine machine learning repository is the source of all the used datasets. Generally, the results show uneven effect on the classification accuracy of three kernels on used datasets. The changing of the gamma value taking on consideration the used dataset influences polynomial and sigmoid kernels. While the performance of RBF kernel function is more stable with different values of gamma as its accuracy is slightly changed.

Download Full-text

Feature Selection Method Based on Mutual Information and Support Vector Machine

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800142150021x ◽

2021 ◽

pp. 2150021

Author(s):

Gang Liu ◽

Chunlei Yang ◽

Sen Liu ◽

Chunbao Xiao ◽

Bin Song

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Mutual Information ◽

Classification Accuracy ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

Svm Classifier ◽

Standard Data ◽

Feature Dimension

A feature selection method based on mutual information and support vector machine (SVM) is proposed in order to eliminate redundant feature and improve classification accuracy. First, local correlation between features and overall correlation is calculated by mutual information. The correlation reflects the information inclusion relationship between features, so the features are evaluated and redundant features are eliminated with analyzing the correlation. Subsequently, the concept of mean impact value (MIV) is defined and the influence degree of input variables on output variables for SVM network based on MIV is calculated. The importance weights of the features described with MIV are sorted by descending order. Finally, the SVM classifier is used to implement feature selection according to the classification accuracy of feature combination which takes MIV order of feature as a reference. The simulation experiments are carried out with three standard data sets of UCI, and the results show that this method can not only effectively reduce the feature dimension and high classification accuracy, but also ensure good robustness.

Download Full-text

Evaluating Annotated Dataset of Customer Reviews for Aspect Based Sentiment Analysis

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2122 ◽

2021 ◽

Author(s):

Dimple Chehal ◽

Parul Gupta ◽

Payal Gulati

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Supervised Machine Learning ◽

Support Vector ◽

Product Reviews ◽

K Nearest Neighbor ◽

Customer Reviews ◽

Percent Accuracy

Sentiment analysis of product reviews on e-commerce platforms aids in determining the preferences of customers. Aspect-based sentiment analysis (ABSA) assists in identifying the contributing aspects and their corresponding polarity, thereby allowing for a more detailed analysis of the customer’s inclination toward product aspects. This analysis helps in the transition from the traditional rating-based recommendation process to an improved aspect-based process. To automate ABSA, a labelled dataset is required to train a supervised machine learning model. As the availability of such dataset is limited due to the involvement of human efforts, an annotated dataset has been provided here for performing ABSA on customer reviews of mobile phones. The dataset comprising of product reviews of Apple-iPhone11 has been manually annotated with predefined aspect categories and aspect sentiments. The dataset’s accuracy has been validated using state-of-the-art machine learning techniques such as Naïve Bayes, Support Vector Machine, Logistic Regression, Random Forest, K-Nearest Neighbor and Multi Layer Perceptron, a sequential model built with Keras API. The MLP model built through Keras Sequential API for classifying review text into aspect categories produced the most accurate result with 67.45 percent accuracy. K- nearest neighbor performed the worst with only 49.92 percent accuracy. The Support Vector Machine had the highest accuracy for classifying review text into aspect sentiments with an accuracy of 79.46 percent. The model built with Keras API had the lowest 76.30 percent accuracy. The contribution is beneficial as a benchmark dataset for ABSA of mobile phone reviews.

Download Full-text

Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/ijkdb.2017010103 ◽

2017 ◽

Vol 7 (1) ◽

pp. 30-41 ◽

Cited By ~ 12

Author(s):

Prayag Tiwari ◽

Brojo Kishore Mishra ◽

Sachin Kumar ◽

Vivek Kumar

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Learning Strategies ◽

Supervised Machine Learning ◽

Support Vector ◽

N Gram ◽

F Measure ◽

Blog Posts

Sentiment Analysis intends to get the basic perspective of the content, which may be anything that holds a subjective supposition, for example, an online audit, Comments on Blog posts, film rating and so forth. These surveys and websites might be characterized into various extremity gatherings, for example, negative, positive, and unbiased keeping in mind the end goal to concentrate data from the info dataset. Supervised machine learning strategies group these reviews. In this paper, three distinctive machine learning calculations, for example, Support Vector Machine (SVM), Maximum Entropy (ME) and Naive Bayes (NB), have been considered for the arrangement of human conclusions. The exactness of various strategies is basically inspected keeping in mind the end goal to get to their execution on the premise of parameters, e.g. accuracy, review, f-measure, and precision.

Download Full-text

Eye-Tracking Analysis for Emotion Recognition

Computational Intelligence and Neuroscience ◽

10.1155/2020/2909267 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13

Author(s):

Paweł Tarnowski ◽

Marcin Kołodziej ◽

Andrzej Majkowski ◽

Remigiusz Jan Rak

Keyword(s):

Support Vector Machine ◽

Eye Movements ◽

Eye Tracking ◽

Emotion Recognition ◽

Classification Accuracy ◽

Pupil Diameter ◽

Support Vector ◽

Svm Classifier ◽

High Arousal ◽

Validation Method

This article reports the results of the study related to emotion recognition by using eye-tracking. Emotions were evoked by presenting a dynamic movie material in the form of 21 video fragments. Eye-tracking signals recorded from 30 participants were used to calculate 18 features associated with eye movements (fixations and saccades) and pupil diameter. To ensure that the features were related to emotions, we investigated the influence of luminance and the dynamics of the presented movies. Three classes of emotions were considered: high arousal and low valence, low arousal and moderate valence, and high arousal and high valence. A maximum of 80% classification accuracy was obtained using the support vector machine (SVM) classifier and leave-one-subject-out validation method.

Download Full-text

Aspect Based Sentiment Analysis of Nepali Text Using Support Vector Machine and Naive Bayes

Technical Journal ◽

10.3126/tj.v2i1.32824 ◽

2020 ◽

Vol 2 (1) ◽

pp. 22-29

Author(s):

Sujan Tamrakar ◽

Bal Krishna Bal ◽

Rajendra Bahadur Thapa

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Svm Classifier ◽

Pos Tagging ◽

Part Of Speech ◽

Document Frequency ◽

Classifier Algorithms

Aspect-based Sentiment Analysis assists in understanding the opinion of the associated entities helping for a better quality of a service or a product. A model is developed to detect the aspect-based sentiment in Nepali text using Machine Learning (ML) classifier algorithms namely Support Vector Machine (SVM) and Naïve Bayes (NB). The system collects Nepali text data from various websites and Part of Speech (POS) tagging is applied to extract the desired features of aspect and sentiment. Manual labeling is done for each sentence to identify the sentiment of the sentence. Term Frequency – Inverse Document Frequency (TF-IDF) is applied to compute the importance of the words. The feature vectors thus produced are then applied to the Classifier algorithms to predict and classify the sentence. The accuracy obtained by the SVM classifier is 76.8% whereas Bernoulli NB is 77.5%.

Download Full-text

Aplikasi Mobile untuk Analisis Sentimen pada Google Play

IJCCS (Indonesian Journal of Computing and Cybernetics Systems) ◽

10.22146/ijccs.6640 ◽

2015 ◽

Vol 9 (1) ◽

pp. 53

Author(s):

Lutfi Budi Ilmawan ◽

Edi Winarko

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Mobile Device ◽

Naive Bayes ◽

Limited Resource ◽

Naïve Bayes ◽

Support Vector ◽

Svm Classifier ◽

Client Server ◽

Google Play

AbstrakGoogle dalam application store-nya, Google Play, saat ini telah menyediakan sekitar 1.200.000 aplikasi mobile. Dengan sejumlah aplikasi tersebut membuat pengguna memiliki banyak pilihan. Selain itu, pengembang aplikasi mengalami kesulitan dalam mencari tahu bagaimana meningkatkan kinerja aplikasinya. Dengan adanya permasalahan tersebut, maka dibutuhkan sebuah aplikasi analisis sentimen yang dapat mengolah sejumlah komentar untuk memperoleh informasi.Sistem yang dibangun memiliki tujuan untuk menentukan polaritas sentimen dari ulasan tekstual aplikasi pada Google Play yang dilakukan dari perangkat mobile. Perangkat mobile memiliki portabilitas yang tinggi dan sebagian dari perangkat tersebut memiliki resource yang terbatas. Hal tersebut diatasi dengan menggunakan arsitektur sistem berbasis client server, di mana server melakukan tugas-tugas yang berat sementara client-nya adalah perangkat mobile yang hanya mengerjakan tugas yang ringan. Dengan solusi tersebut maka Analisis sentimen dapat diaplikasikan pada mobile environment.Adapun metode klasifikasi yang digunakan adalah Naïve Bayes untuk aplikasi yang dikembangkan dan Support Vector Machine Linier sebagai pembanding. Nilai akurasi dari Naïve Bayes classifier dari aplikasi yang dibangun sebesar 83,87% lebih rendah jika dibandingkan dengan nilai akurasi dari SVM Linier classifier sebesar 89,49%. Adapun penggunaan semantic handling untuk mengatasi sinonim kata dapat mengurangi akurasi classifier. Kata kunci— analisis sentimen, google play, klasifikasi, naïve bayes, support vector machine AbstractGoogle's Google Play now providing approximately 1.200.000 mobile applications. With these number of applications, it makes the users have many options. In addition, application developers have difficulties in figuring out how to improve their application performance. Because of these problems, it is necessary to make a sentiment analysis applications that can process review comments to get valuable information.The purpose of this system is determining the polarity of sentiments from applications’s textual reviews on Google Play that can be performed on mobile devices. The mobile device has high portability and the majority of these devices have limited resource. That problem can be solved by using a client server based system architecture, where the server performs training and classification tasks while clients is a mobile device that perform some of sentiment analysis task. With this solution, the sentiment analysis can be applied to the mobile environment.The classification method that used are Naive Bayes for developed application and Linear Support Vector Machine that is used for comparing. Naïve Bayes classifier’s accuracy is 83.87%. The result is lower than the accuracy value of Linear SVM classifier that reach 89.49%. The use of semantic handling can reduce the accuracy of the classifier. Keywords—sentiment analysis, google play, classification, naïve bayes, support vector machine

Download Full-text

Kernel Parameter Selection for SVM Classification

Strategic Pervasive Computing Applications ◽

10.4018/978-1-61520-753-4.ch002 ◽

2011 ◽

pp. 44-55

Author(s):

Manju Bala ◽

R. K. Agrawal

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Classification Accuracy ◽

Parameter Selection ◽

Kernel Functions ◽

Gaussian Kernel ◽

Support Vector ◽

Svm Classification ◽

Kernel Parameter ◽

Benchmark Datasets

The choice of kernel function and its parameter is very important for better performance of support vector machine. In this chapter, the authors proposed few new kernel functions which satisfy the Mercer’s conditions and a robust algorithm to automatically determine the suitable kernel function and its parameters based on AdaBoost to improve the performance of support vector machine. The performance of proposed algorithm is evaluated on several benchmark datasets from UCI repository. The experimental results for different datasets show that the Gaussian kernel is not always the best choice to achieve high generalization of support vector machine classifier. However, with the proper choice of kernel function and its parameters using proposed algorithm, it is possible to achieve maximum classification accuracy for all datasets.

Download Full-text

Spectral-Spatial Classification of Hyperspectral Image Based on Support Vector Machine

International Journal of Information Technology and Web Engineering ◽

10.4018/ijitwe.2021010103 ◽

2021 ◽

Vol 16 (1) ◽

pp. 56-74

Author(s):

Weiwei Yang ◽

Haifeng Song

Keyword(s):

Support Vector Machine ◽

Classification Accuracy ◽

Spatial Information ◽

Hyperspectral Image ◽

State Of The Art ◽

Support Vector ◽

Svm Classifier ◽

Spatial Classification ◽

Homogeneous Regions

Recent research has shown that integration of spatial information has emerged as a powerful tool in improving the classification accuracy of hyperspectral image (HSI). However, partitioning homogeneous regions of the HSI remains a challenging task. This paper proposes a novel spectral-spatial classification method inspired by the support vector machine (SVM). The model consists of spectral-spatial feature extraction channel (SSC) and SVM classifier. SSC is mainly used to extract spatial-spectral features of HSI. SVM is mainly used to classify the extracted features. The model can automatically extract the features of HSI and classify them. Experiments are conducted on benchmark HSI dataset (Indian Pines). It is found that the proposed method yields more accurate classification results compared to the state-of-the-art techniques.

Download Full-text