scholarly journals Word Embedding and Feature Reduction for Sentiment Analysis Using GA

Author(s):  
Prajakta P. Shelke ◽  
Ankita N. Korde

Sentiment analysis (SA), also called as opinion mining is the technique for the removal of opinions of a specific entity or feature from reviews dataset. The opinions of other users help in decision making process of people. This paper studies different methods that are aimed at SA. These approaches vary from semantic based methods, machine learning, neural networks, syntactical methods with each having its own strength. Although hybrid approach also exists where the idea is to combine strengths of two or more methods to increase the accuracy. A framework in which sentiment analysis is done by using word embedding and feature reduction techniques is also proposed. Word embedding is a technique in which low-dimensional vector representation of words is provided. Feature reduction method is used with Support Vector Machine (SVM) classifier. The framework will perform sentiment analysis of user opinions by using a machine learning approach and provides a recommendation system for the ease of decision making for users. The proposed system in this paper has solved the scalability problem and improved the accuracy.

2020 ◽  
Author(s):  
Nalika Ulapane ◽  
Karthick Thiyagarajan ◽  
sarath kodagoda

<div>Classification has become a vital task in modern machine learning and Artificial Intelligence applications, including smart sensing. Numerous machine learning techniques are available to perform classification. Similarly, numerous practices, such as feature selection (i.e., selection of a subset of descriptor variables that optimally describe the output), are available to improve classifier performance. In this paper, we consider the case of a given supervised learning classification task that has to be performed making use of continuous-valued features. It is assumed that an optimal subset of features has already been selected. Therefore, no further feature reduction, or feature addition, is to be carried out. Then, we attempt to improve the classification performance by passing the given feature set through a transformation that produces a new feature set which we have named the “Binary Spectrum”. Via a case study example done on some Pulsed Eddy Current sensor data captured from an infrastructure monitoring task, we demonstrate how the classification accuracy of a Support Vector Machine (SVM) classifier increases through the use of this Binary Spectrum feature, indicating the feature transformation’s potential for broader usage.</div><div><br></div>


Author(s):  
Vijender Kumar Solanki ◽  
Nguyen Ha Huy Cuong ◽  
Zonghyu (Joan) Lu

The machine learning is the emerging research domain, from which number of emerging trends are available, among them opinion mining is the one technology attraction through which the we could get analysis of the interested domain or we can say about the review from the customer towards any product or we can say any upcoming trending information. These two are the emerging words and we can say it's the buzz word in the information technology. As you will see that its widely use by the corporate sector to uplift the business next level. Before two decade you will not read any words e.g., Opinion mining or Sentiment analysis, but in the last two decade these words have given a new life to information technology domain as well as to the business. The important question which runs in the mind is why use sentiment analysis or opinion mining. The information technology has given number of new programming languages, new innovation and within that the data mining has given this trends to the users. The chapter is covering the three major concept's which comes under the machine learning e.g., Decision tree, Bayesian network and Support vector machine. The chapter is describing the basic inputs, and how it helps in supporting stakeholders by adopting these technologies.


Sentiment Analysis is individuals' opinions and feedbacks study towards a substance, which can be items, services, movies, people or events. The opinions are mostly expressed as remarks or reviews. With the social network, gatherings and websites, these reviews rose as a significant factor for the client’s decision to buy anything or not. These days, a vast scalable computing environment provides us with very sophisticated way of carrying out various data-intensive natural language processing (NLP) and machine-learning tasks to examine these reviews. One such example is text classification, a compelling method for predicting the clients' sentiment. In this paper, we attempt to center our work of sentiment analysis on movie review database. We look at the sentiment expression to order the extremity of the movie reviews on a size of 0(highly disliked) to 4(highly preferred) and perform feature extraction and ranking and utilize these features to prepare our multilabel classifier to group the movie review into its right rating. This paper incorporates sentiment analysis utilizing feature-based opinion mining and managed machine learning. The principle center is to decide the extremity of reviews utilizing nouns, verbs, and adjectives as opinion words. In addition, a comparative study on different classification approaches has been performed to determine the most appropriate classifier to suit our concern problem space. In our study, we utilized six distinctive machine learning algorithms – Naïve Bayes, Logistic Regression, SVM (Support Vector Machine), RF (Random Forest) KNN (K nearest neighbors) and SoftMax Regression.


Author(s):  
Qiaoman Yang ◽  
Chunyu Liu

Classification modeling is one of the key issues in sentiment analysis. Support vector machine (SVM) has been widely used in classification as an effective machine learning method. Generally, a common SVM is only for decision-making that sacrifices the distribution of data. In practice, sentiment data are big and mazy, which results in the deficiency of accuracy and stability when common SVM is used. The study investigates sentiment analysis by applying the twin objective function SVM, including nonparallel SVM(NPSVM) and twin SVM (TWSVM). From the experiments, we concluded that twin objective function SVMs are superior to NB and single objective function SVM in accuracy and stability.


2020 ◽  
Vol 4 (1) ◽  
pp. 11-20

The increasing use of the internet enables users to share their opinion about what they like and dislike regarding products and services. For efficient decision making, there is a need to analyze these reviews. Sentiment analysis or opinion mining is commonly used to detect polarity (positive or negative) of reviews. But, it does not show the aspect or orientation of the text. In this study, state-of-art approaches based on supervised machine learning employed to perform three tasks on the dataset provided by SemEval. Tasks A and B are related to predicting the aspect of the restaurant’s reviews, whereas task C shows their polarity. Additionally, this study aims to compare the performance of two feature engineering techniques and five machine learning algorithms to evaluate their performance on a publicly available dataset named SemEval-2015 Task 12. The experimental results showed that the word2vec features when used with the support vector machine algorithm outperformed by giving 76%, 72% and 79% off overall accuracies for Task A, Task B, and Task C respectively. Our comparative study holds practical significance and can be used as a baseline study in the domain of aspect-based sentiment analysis.


2020 ◽  
Vol 8 (6) ◽  
pp. 2862-2867

E-commerce is a website or mobile application platform that help people to buy products. Before purchasing the product, customer will decide to buy it or not by reading the review from previous buyer. There is a problem that there are a lot of review so it will take a long time for customer to read it all. This research will be using sentiment analysis method to classify the review data. Sentiment analysis or opinion mining is a machine learning approach to classify and analyse texts or documents about human’s sentiments, emotions, and opinions. In this research, sentiment analysis was used to classify product reviews from e-commerce websites into positive or negative classes. The results could be processed further and be used to summarize customers' opinions about a certain product without reading every single review. The goal of this research is to optimize classification performance by using feature selection technique. Terms Frequency-Inverse Document Frequency (TF-IDF) feature extraction, Backward Elimination feature selection, and five different classifiers (Naïve Bayes, Support Vector Machine, K-Nearest Neighbour, Decision Tree, Random Forest) were used in analysing the sentiment of the reviews. In this research, the dataset used are Indonesian language and classified into two classes(positive and negative). The best accuracy is achieved by using TF-IDF, Backward Elimination and Support Vector Machine (SVM) with a score of 85.97%, which increases by 7.91% if compared to the process without feature selection. Based on the results, Backward Elimination feature selection succeeded in improving all performance for all classifiers used in this research.


Sentiment analysis is an area of natural language processing (NLP) and machine learning where the text is to be categorized into predefined classes i.e. positive and negative. As the field of internet and social media, both are increasing day by day, the product of these two nowadays is having many more feedbacks from the customer than before. Text generated through social media, blogs, post, review on any product, etc. has become the bested suited cases for consumer sentiment, providing a best-suited idea for that particular product. Features are an important source for the classification task as more the features are optimized, the more accurate are results. Therefore, this research paper proposes a hybrid feature selection which is a combination of Particle swarm optimization (PSO) and cuckoo search. Due to the subjective nature of social media reviews, hybrid feature selection technique outperforms the traditional technique. The performance factors like f-measure, recall, precision, and accuracy tested on twitter dataset using Support Vector Machine (SVM) classifier and compared with convolution neural network. Experimental results of this paper on the basis of different parameters show that the proposed work outperforms the existing work


Author(s):  
Ganesh K. Shinde

Abstract: Sentiment Analysis has improvement in online shopping platforms, scientific surveys from political polls, business intelligence, etc. In this we trying to analyse the twitter posts about Hashtag like #MakeinIndia using Machine Learning approach. By doing opinion mining in a specific area, it is possible to identify the effect of area information in sentiment analysis. We put forth a feature vector for classifying the tweets as positive, negative and neutral. After that applied machine learning algorithms namely: MaxEnt and SVM. We utilised Unigram, Bigram and Trigram Features to generate a set of features to train a linear MaxEnt and SVM classifiers. In the end we have measured the performance of classifier in terms of overall accuracy. Keywords: Sentiment analysis, support vector machine, maximum entropy, N-gram, Machine Learning


Author(s):  
Venkateswarlu Naik Midde ◽  
Vasumathi D ◽  
A.P. Siva Kumar

Introduction: Extraction of distinguishing semantic level emotions posed in multi-languages over social media is an essential task in the field of sentiment analysis or opinion mining. The extraction of emotions expressed in Dravidian or local languages combining with multi-languages over social media has become an essential challenge in the field of big data sentiment analysis. Methods: In the proposed approach, an innovative framework to recognize the sentiments of users in multi-languages or Dravidian languages text data using scientific linguistic theories has been defined. The proposed method used machine learning techniques such as naïve Bayes, support vector machine for fine-grained classification of multilingual text with help of lexicon-based features groups. Results: The results obtained by the experiments conducted on collected benchmark datasets in the proposed approach are outperformed and better in comparison with corpus-based and world level, phrase-level sentiment analysis for multilanguages text. Conclusion: Machine learning technnique SVM has outperformed for sentiment and emotion extraction.


Author(s):  
Abhishek Sharma

Abstract: In today’s world social networking platforms like Facebook, YouTube, twitter etc. are a great source of communication for internet users and loaded with large number of emotions, views and opinions of the people. Sentiment analysis is the study of attitudes, emotions and opinions of the people and is also known as opinion mining. Sentiment analysis is used to find the opinion i.e. negative or positive about a particular subject. In this paper an Enhanced sentiment analysis approach is presented by using the Association rule mining i.e. Apriori and machine learning approach such as Support Vector Machine. The Enhanced approach is compared with the baseline approach, on accuracy, precision, recall, and F1-score measures. The Enhanced approach for sentiment analysis is implemented using the R programming language. The Enhanced approach shows better performance in comparison to the baseline approach. Keyword: Sentiment Analysis, Opinion Mining, Support Vector Machine, Association Rule Mining, Machine Learning


Sign in / Sign up

Export Citation Format

Share Document