A Machine Learning-Based Lexicon Approach for Sentiment Analysis

2020 ◽  
Vol 16 (2) ◽  
pp. 8-22
Author(s):  
Tirath Prasad Sahu ◽  
Sarang Khandekar

Sentiment analysis can be a very useful aspect for the extraction of useful information from text documents. The main idea for sentiment analysis is how people think for a particular online review, i.e. product reviews, movie reviews, etc. Sentiment analysis is the process where these reviews are classified as positive or negative. The web is enriched with huge amount of reviews which can be analyzed to make it meaningful. This article presents the use of lexicon resources for sentiment analysis of different publicly available reviews. First, the polarity shift of reviews is handled by negations. Intensifiers, punctuation and acronyms are also taken into consideration during the processing phase. Second, words are extracted which have some opinion; these words are then used for computing score. Third, machine learning algorithms are applied and the experimental results show that the proposed model is effective in identifying the sentiments of reviews and opinions.

The process of discovering and analyzing the customer feedback using Natural Language Processing (NLP) is said to be sentiment analysis. Based on the surge over the concept of rating level in sentiment analysis, sentiment is utilized as an attribute for certain aspects or features that get expressed and more attention are provided to the problem of detecting the customer reviews. Despite the wide use and popularity of some methods, a better technique for identifying the polarity of a text data is hard to find. Machine learning has recently attracted attention as an approach for sentiment analysis. This work extends the idea of evaluating the performance of various Machine Learning (ML) classifiers namely logistic regression, Naive Bayes, Support Vector Machine (SVM) and Neural Network (NN).To show their effectiveness in sentiment mining of customer product reviews, the customer feedback has been collected from Grocery and Gourmet Food. Nearly 90 thousands customers feedback reviews of various product related categories namely Product ID, rating, review test, review time reviewer ID and reviewer name are used in this analysis. The performance of the classifiers is measured in terms of accuracy, specificity and sensitivity. From the experimental results, the better machine learning classification algorithm is proposed for sentiment mining using online shopping customer review data.


2019 ◽  
Vol 32 (1) ◽  
pp. 203 ◽  
Author(s):  
Hayder Mahmood Salman

The huge amount of documents in the internet led to the rapid need of text classification (TC). TC is used to organize these text documents. In this research paper, a new model is based on Extreme Machine learning (EML) is used. The proposed model consists of many phases including: preprocessing, feature extraction, Multiple Linear Regression (MLR) and ELM. The basic idea of the proposed model is built upon the calculation of feature weights by using MLR. These feature weights with the extracted features introduced as an input to the ELM that produced weighted Extreme Learning Machine (WELM). The results showed   a great competence of the proposed WELM compared to the ELM. 


2020 ◽  
Vol 17 (9) ◽  
pp. 4535-4542
Author(s):  
Ramneet ◽  
Deepali Gupta ◽  
Mani Madhukar

For the past few years, sentiment analysis has been growing rapidly and with the abundance of computation power and plethora of machine learning algorithms, sentiment analysis has found numerous applications and acceptance as research area in machine learning. This paper covers analysis of sentiment analysis dealing with different aspects of its applications such as customer reviews, product reviews, film reviews, emotion detection, market research or many more such areas. To conduct sentiment analysis, data is extracted from various social media platforms like Twitter, Facebook etc. The data available on these social media platforms is primarily unstructured, therefore to analyze this data it must be pre-processed, feature vector identified and further implementation of models to trained and tested on different algorithms. There are several algorithms such as SVM, Naïve Bayes, K-means, KNN, decision tree, random forest and other algorithms, which are used to evaluate and hybrid to improve the efficiency and accuracy of the model.


2020 ◽  
Vol 38 (3) ◽  
pp. 633-657
Author(s):  
Ammara Zamir ◽  
Hikmat Ullah Khan ◽  
Waqar Mehmood ◽  
Tassawar Iqbal ◽  
Abubakker Usman Akram

Purpose This research study proposes a feature-centric spam email detection model (FSEDM) based on content, sentiment, semantic, user and spam-lexicon features set. The purpose of this study is to exploit the role of sentiment features along with other proposed features to evaluate the classification accuracy of machine learning algorithms for spam email detection. Design/methodology/approach Existing studies primarily exploits content-based feature engineering approach; however, a limited number of features is considered. In this regard, this research study proposed a feature-centric framework (FSEDM) based on existing and novel features of email data set, which are extracted after pre-processing. Afterwards, diverse supervised learning techniques are applied on the proposed features in conjunction with feature selection techniques such as information gain, gain ratio and Relief-F to rank most prominent features and classify the emails into spam or ham (not spam). Findings Analysis and experimental results indicated that the proposed model with sentiment analysis is competitive approach for spam email detection. Using the proposed model, deep neural network applied with sentiment features outperformed other classifiers in terms of classification accuracy up to 97.2%. Originality/value This research is novel in this regard that no previous research focuses on sentiment analysis in conjunction with other email features for detection of spam emails.


Sign in / Sign up

Export Citation Format

Share Document