An Improved Intelligent Approach to Enhance the Sentiment Classifier for Knowledge Discovery Using Machine Learning

Author(s):  
Midde Venkateswarlu Naik ◽  
D. Vasumathi ◽  
A.P. Siva Kumar

Aims: The proposed research work is on an evolutionary enhanced method for sentiment or emotion classification on unstructured review text in the big data field. The sentiment analysis plays a vital role for current generation of people for extracting valid decision points about any aspect such as movie ratings, education institute or politics ratings, etc. The proposed hybrid approach combined the optimal feature selection using Particle Swarm Optimization (PSO) and sentiment classification through Support Vector Machine (SVM). The current approach performance is evaluated with statistical measures, such as precision, recall, sensitivity, specificity, and was compared with the existing approaches. The earlier authors have achieved an accuracy of sentiment classifier in the English text up to 94% as of now. In the proposed scheme, an average accuracy of sentiment classifier on distinguishing datasets outperformed as 99% by tuning various parameters of SVM, such as constant c value and kernel gamma value in association with PSO optimization technique. The proposed method utilized three datasets, such as airline sentiment data, weather, and global warming datasets, that are publically available. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Background: The sentiment analysis plays a vital role for current generation people for extracting valid decisions about any aspect such as movie rating, education institute or even politics ratings, etc. Sentiment Analysis (SA) or opinion mining has become fascinated scientifically as a research domain for the present environment. The key area is sentiment classification on semi-structured or unstructured data in distinguish languages, which has become a major research aspect. User-Generated Content [UGC] from distinguishing sources has been hiked significantly with rapid growth in a web environment. The huge user-generated data over social media provides substantial value for discovering hidden knowledge or correlations, patterns, and trends or sentiment extraction about any specific entity. SA is a computational analysis to determine the actual opinion of an entity which is expressed in terms of text. SA is also called as computation of emotional polarity expressed over social media as natural text in miscellaneous languages. Usually, the automatic superlative sentiment classifier model depends on feature selection and classification algorithms. Methods: The proposed work used Support vector machine as classification technique and particle swarm optimization technique as feature selection purpose. In this methodology, we tune various permutations and combination parameters in order to obtain expected desired results with kernel and without kernel technique for sentiment classification on three datasets, including airline, global warming, weather sentiment datasets, that are freely hosted for research practices. Results: In the proposed scheme, The proposed method has outperformed with 99.2% of average accuracy to classify the sentiment on different datasets, among other machine learning techniques. The attained high accuracy in classifying sentiment or opinion about review text proves superior effectiveness over existing sentiment classifiers. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Conclusion: The objective of the research issue sentiment classifier accuracy has been hiked with the help of Kernel-based Support Vector Machine (SVM) based on parameter optimization. The optimal feature selection to classify sentiment or opinion towards review documents has been determined with the help of a particle swarm optimization approach. The proposed method utilized three datasets to simulate the results, such as airline sentiment data, weather sentiment data, and global warming data that are freely available datasets.

2020 ◽  
Vol 8 (2) ◽  
pp. 91-100
Author(s):  
Muhamad Azhar ◽  
Noor Hafidz ◽  
Biktra Rudianto ◽  
Windu Gata

Abstract   Technology implementation in the marketplace world has attracted the attention of researchers to analyze the reviews from customers. The Klik Indomaret application page on GooglePlay is one application that can be used to get information on review data collection. However, getting information on consumer’s opinion or review is not an easy task and need a specific method in categorizing or grouping these reviews into certain groups, i.e. positive or negative reviews. The sentiment analysis study of a review application in GooglePlay is still rare. Therefore, this paper analysis the customer’s sentiment from klikindomaret app using Naive Bayes Classifier (NB) algorithm that is compared to Support Vector Machine (SVM) as well as optimizing the Feature Selection (FS) using the Particle Swarm Optimization method. The results for NB without using FS optimization were 69.74% for accuracy and 0.518 for Area Under Curve (AUC) and for SVM without using FS optimization were 81.21% for accuracy and 0.896 for AUC. While the results of cross-validation NB with FS are 75.21% for accuracy and 0.598 for AUC and cross-validation of SVM with FS is 81.84% for accuracy and 0.898 for AUC, while there is an increase when using the Feature Selection (FS) Particle Swarm Optimization and also the modeling algorithm SVM has a higher value compared to NB for the dataset used in this study.   Keywords: Naive Bayes, Particle Swarm Optimization, Support Vector Machine, Feature Selection, Consumer Review.


2020 ◽  
Vol 2 (3) ◽  
pp. 169-178
Author(s):  
Zulia Imami Alfianti ◽  
Deni Gunawan ◽  
Ahmad Fikri Amin

Sentiment analysis is an area of ​​approach that solves problems by using reviews from various relevant scientific perspectives. Reading a review before buying a product is very important to know the advantages and disadvantages of the products we will use, besides reading a cosmetic review can find out the quality of the cosmetic brand is feasible or not be used. Before consumers decide to buy cosmetics, consumers should know in detail the products to be purchased, this can be learned from the testimonials or the results of reviews from consumers who have bought and used the previous product. The number of reviews is certainly very much making consumers reluctant to read reviews. Eventually, the reviews become useless. For this reason, the authors classify based on positive and negative classes, so consumers can find product comparisons quickly and precisely. The implementation of Particle Swarm Optimization (PSO) optimization can improve the accuracy of the Support Vector Machine (SVM) and Naïve Bayes (NB) algorithm can improve accuracy and provide solutions to the review classification problem to be more accurate and optimal. Comparison of accuracy resulting from testing this data is an SVM algorithm of 89.20% and AUC of 0.973, then compared to SVM based on PSO with an accuracy of 94.60% and AUC of 0.985. The results of testing the data for the NB algorithm are 88.50% accuracy and AUC is 0.536, then the accuracy is compared with the PSO based NB for 0.692. In these calculations prove that the application of PSO optimization can improve accuracy and provide more accurate and optimal solutions


2021 ◽  
Vol 4 (2) ◽  
pp. 232-239
Author(s):  
Retno Sari ◽  
Ratih Yulia Hayuningtyas

Sentiment analysis is used to analyze reviews of a place or item from an application or website that then classified the review into positive reviews or negative reviews. reviews from users are considered very important because it contains information that can make it easier for new users who want to choose the right digital payment. Reviews about digital payment ovo are so much that it is difficult for prospective users of ovo digital payment applications to draw conclusions about ovo digital payment information. For this reason, a classification method is needed in this study using support vector machine and PSO methods. In this study, we used 400 data that were reduced to 200 positive reviews and 200 negative reviews. The accuracy obtained by using the support vector machine method of 76.50% is in the fair classification, while the accuracy obtained by using the support vector machine and Particle Swarm Optimization (PSO) method is 82.75% which is in good classification.


2019 ◽  
Vol 4 (1) ◽  
Author(s):  
Amos O Bajeh ◽  
Bukola O Funso ◽  
Fatima E Usman-Hamza

One of the key task in data mining is the selection of relevant features from datasets with high dimensionality. This is expected to reduce the time and space complexity, and consequently improve the performance of data mining algorithms for tasks such as classification. This study presents an empirical study of the effect of particle swarm optimization as a feature selection technique on the performance of classification algorithms. Two dataset from different domains were used: SMS spam detection and sentiment analysis datasets. Particle swarm optimization is applied on the datasets for feature selection. Both the reduced and raw dataset are separately classified using C4.5 decision tree, k-nearest neighbour and support vector machine. The result of the analysis showed that the improvement of classifier performance is case-dependent; some significant improvements are noticed in the sentiment analysis datasets and not in the SMS spam dataset. Although some marginal effect are observed on performance, it implies that with particle swarm optimization features selection the space complexity is reduced while maintaining the accuracy of the classifiers. Keywords—classification, feature selection, machine learning, particle swarm optimization, text mining   


2020 ◽  
Vol 16 (1) ◽  
pp. 111-116
Author(s):  
Dedi Aridarma ◽  
Rifki Sadikin ◽  
Bobby Suryo Prakoso ◽  
Heru Sukma Utama

Religious lectures are activities that are identical to the religious presentation, delivered verbally by a person who has religious knowledge and then delivered to the community with the aim of the knowledge delivered can be understood. Ustadz Abdul Somad was one of the preachers who had been known to various levels of society, but his lectures were not all acceptable to the people who liked or disliked those who came from various positive and negative comments on social media. To solve these problems, Sentiment Analysis was used by applying the Support Vector Machine Algorithm method. The purpose of this study is to compile using the selection of feature Particle Swarm Optimization and Information Gain. The results for Particle Swarm Optimization Selection Feature resulted in Accuracy of 80.57%, Precision of 85.45%, and Recall of 79.52%, Selection Feature Information Gain resulted in Accuracy of 79.78%, Precision of 78.47%, and Recall of 78, 43%, Based on the results of this study, it can be concluded that using the Particle Swarm Optimization selection feature is better at the level of accuracy when compared to using the Information Gain selection feature.


Sign in / Sign up

Export Citation Format

Share Document