scholarly journals A Comparative Performance Study of Classification Models for Opinion Mining

Author(s):  
S Raja Rajeswari ◽  
Dr. A. John Sanjeev Kumar

Opinion mining has become a major part in today's economy. People would want to know more about a product and the customers opinion before buying it. Companies would also want to know the opinions of the customers. Therefore, analyzing the customer’s opinion is important. A new customer would consider a product as good by analyzing the opinions of other customers. The opinions are collected from various areas, which include blogs, web forums, and product review sites. Classifying these large set of opinions requires a good classifier. In view of this, a comparative study of three classification techniques - Naive Bayes classifier with Kernel Density Estimation (KDE), Support Vector Machine (SVM), Decision Tree and KNN was made. To evaluate the classifier accuracy, precision, recall and F-measure techniques are used. Experimental results show that the Naive Bayes with Kernel Density Estimation (KDE) classifier achieved higher accuracy among others.

2019 ◽  
Vol 15 (2) ◽  
pp. 247-254
Author(s):  
Heru Sukma Utama ◽  
Didi Rosiyadi ◽  
Dedi Aridarma ◽  
Bobby Suryo Prakoso

Analysis of the odd even-numbered sentiment systems in Bekasi toll using the Naïve Bayes Algorithm, is a process of understanding, extracting, and processing textual data automatically from social media. The purpose of this study was to determine the level of accuracy, recall and precision of opinion mining generated using the Naïve Bayes algorithm to provide information community sentiment towards the effectiveness of the odd system of Bekasi tiolls on social media. The research method used in this study was to do text mining in comments-comments regarding posts regarding even odd oddities on Bekasi toll on Twitter, Instagram, Youtube and Facebook. The steps taken are starting from preprocessing, transformation, datamining and evaluation, followed by information gaon feature selection, select by weight and applying NB Algorithm model. The results obtained from the study using the NB model are obtained Confusion Matrix result, namely accuracy of 79,55%, Precision of 80,51%, and Sensitivity or Recall of 80,91%. Thus this study concludes that the use of Support Vector Machine Algorithms can analyze even odd sentiments on the Bekasi toll road.


2021 ◽  
Author(s):  
Sridhar Krishnan ◽  
Shanshan Yang ◽  
Fang Zheng ◽  
Xin Luo ◽  
Suxian Cai ◽  
...  

Detection of dysphonia is useful for monitoring the progression of phonatory impairment for patients with Parkinson’s disease (PD), and also helps assess the disease severity. This paper describes the statistical pattern analysis methods to study different vocal measurements of sustained phonations. The feature dimension reduction procedure was implemented by using the sequential forward selection (SFS) and kernel principal component analysis (KPCA) methods. Four selected vocal measures were projected by the KPCA onto the bivariate feature space, in which the class-conditional feature densities can be approximated with the nonparametric kernel density estimation technique. In the vocal pattern classification experiments, Fisher’s linear discriminant analysis (FLDA) was applied to perform the linear classification of voice records for healthy control subjects and PD patients, and the maximum a posteriori (MAP) decision rule and support vector machine (SVM) with radial basis function kernels were employed for the nonlinear classification tasks. Based on the KPCA-mapped feature densities, the MAP classifier successfully distinguished 91.8% voice records, with a sensitivity rate of 0.986, a specificity rate of 0.708, and an area value of 0.94 under the receiver operating characteristic (ROC) curve. The diagnostic performance provided by the MAP classifier was superior to those of the FLDA and SVM classifiers. In addition, the classification results indicated that gender is insensitive to dysphonia detection, and the sustained phonations of PD patients with minimal functional disability are more difficult to be correctly identified.


2019 ◽  
Vol 11 (24) ◽  
pp. 6954
Author(s):  
Fuqiang Li ◽  
Shiying Zhang ◽  
Wenxuan Li ◽  
Wei Zhao ◽  
Bingkang Li ◽  
...  

In comparison with traditional point forecasting method, probability density forecasting can reflect the load fluctuation more effectively and provides more information. This paper proposes a hybrid hourly power load forecasting model, which integrates K-means clustering algorithm, Salp Swarm Algorithm (SSA), Least Square Support Vector Machine (LSSVM), and kernel density estimation (KDE) method. Firstly, the loads at 24 times a day are grouped into three categories according to the K-means clustering algorithm, which correspond to the valley period, flat period, and peak period of the load, respectively. Secondly, the load point forecasting value is obtained by LSSVM method optimized by SSA algorithm. Furthermore, the kernel density estimation method is employed to fit the forecasting error of SSA-LSSVM in different time periods, and the probability density function of the error distribution is obtained. The final load probability density forecasting result is obtained by combining the point forecasting value and the error fitting result, and then the upper and lower limits of the confidence interval under the given confidence level are solved. In this paper, the performance of the model is evaluated by two indicators named interval coverage and interval average width. Meanwhile, in comparison with several other models, it can be concluded that the proposed model can effectively improve the forecasting effect.


Sign in / Sign up

Export Citation Format

Share Document