A Novel Sentiment Analysis for Amazon Data with TSA based Feature Selection

Sentiment analysis of online product reviews has become a mainstream way for businesses on e-commerce platforms to promote their products and improve user satisfaction. Hence, it is necessary to construct an automatic sentiment analyser for automatic identification of sentiment polarity of the online product reviews. Traditional lexicon-based approaches used for sentiment analysis suffered from several accuracy issues while machine learning techniques require labelled training data. This paper introduces a hybrid sentiment analysis framework to bond the gap between both machine learning and lexicon-based approaches. A novel tunicate swarm algorithm (TSA) based feature reduction is integrated with the proposed hybrid method to solve the scalability issue that arises due to a large feature set. It reduces the feature set size to 43% without changing the accuracy (93%). Besides, it improves the scalability, reduces the computation time and enhances the overall performance of the proposed framework. From experimental analysis, it can be observed that TSA outperforms existing feature selection techniques such as particle swarm optimization and genetic algorithm. Moreover, the proposed approach is analysed with performance metrics such as recall, precision, F1-score, feature size and computation time.

Download Full-text

A hybrid sentiment analysis approach using black widow optimization based feature selection

Journal of Engineering Research ◽

10.36909/jer.12039 ◽

2021 ◽

Author(s):

Anand Joseph Daniel ◽

◽

M Janaki Meena ◽

Keyword(s):

Feature Selection ◽

Sentiment Analysis ◽

Performance Metrics ◽

Computation Time ◽

Online Reviews ◽

Reduction Technique ◽

Feature Reduction ◽

Analysis Approach ◽

Black Widow ◽

Feature Selection Technique

With the massive development of Internet technologies and e-commerce technology, people rely on the product reviews provided by users through web. Sentiment analysis of online reviews has become a mainstream way for businesses on e-commerce platforms to satisfy the customers. This paper proposes a novel hybrid framework with Black Widow Optimization (BWO) based feature reduction technique which combines the merits of both machine learning and lexicon-based approaches to attain better scalability and accuracy. The scalability problem arises due to noisy, irrelevant and unique features present in the extracted features from proposed approach, which can be eliminated by adopting an effective feature reduction technique. In our proposed BWO approach, without changing the accuracy (90%), the feature-set size is reduced up to 43%. The proposed feature selection technique outperforms other commonly used Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) based feature selection techniques with reduced computation time of 21 sec. Moreover, our sentiment analysis approach is analyzed using performance metrics such as precision, recall, F-measure, and computation time. Many organizations can use these online reviews to make well-informed decisions towards the users’ interests and preferences to enhance customer satisfaction, product quality and to find the aspects to improve the products, thereby to generate more profits.

Download Full-text

A machine learning-based sentiment analysis of online product reviews with a novel term weighting and feature selection approach

Information Processing & Management ◽

10.1016/j.ipm.2021.102656 ◽

2021 ◽

Vol 58 (5) ◽

pp. 102656

Author(s):

Huiliang Zhao ◽

Zhenghong Liu ◽

Xuemei Yao ◽

Qin Yang

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Sentiment Analysis ◽

Product Reviews ◽

Term Weighting ◽

Online Product Reviews ◽

Selection Approach ◽

Feature Selection Approach

Download Full-text

Sentiment Analysis on E-commerce Product using Machine Learning and Combination of TF-IDF and Backward Elimination

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f7889.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 2862-2867

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Feature Selection ◽

Sentiment Analysis ◽

Opinion Mining ◽

Classification Performance ◽

Support Vector ◽

Product Reviews ◽

Feature Selection Technique ◽

Backward Elimination

E-commerce is a website or mobile application platform that help people to buy products. Before purchasing the product, customer will decide to buy it or not by reading the review from previous buyer. There is a problem that there are a lot of review so it will take a long time for customer to read it all. This research will be using sentiment analysis method to classify the review data. Sentiment analysis or opinion mining is a machine learning approach to classify and analyse texts or documents about human’s sentiments, emotions, and opinions. In this research, sentiment analysis was used to classify product reviews from e-commerce websites into positive or negative classes. The results could be processed further and be used to summarize customers' opinions about a certain product without reading every single review. The goal of this research is to optimize classification performance by using feature selection technique. Terms Frequency-Inverse Document Frequency (TF-IDF) feature extraction, Backward Elimination feature selection, and five different classifiers (Naïve Bayes, Support Vector Machine, K-Nearest Neighbour, Decision Tree, Random Forest) were used in analysing the sentiment of the reviews. In this research, the dataset used are Indonesian language and classified into two classes(positive and negative). The best accuracy is achieved by using TF-IDF, Backward Elimination and Support Vector Machine (SVM) with a score of 85.97%, which increases by 7.91% if compared to the process without feature selection. Based on the results, Backward Elimination feature selection succeeded in improving all performance for all classifiers used in this research.

Download Full-text

A HYBRID SENTIMENT ANALYSIS APPROACH USING BLACK WIDOW OPTIMIZATION BASED FEATURE SELECTION

International Journal of Information Retrieval Research ◽

10.4018/ijirr.289955 ◽

2022 ◽

Vol 12 (1) ◽

pp. 0-0

Keyword(s):

Feature Selection ◽

Sentiment Analysis ◽

Computation Time ◽

Online Reviews ◽

Reduction Technique ◽

Feature Reduction ◽

Analysis Approach ◽

Feature Selection Technique ◽

Set Size ◽

Feature Selection Techniques

This paper proposes a novel hybrid framework with BWO based feature reduction technique which combines the merits of both machine learning and lexicon-based approaches to attain better scalability and accuracy. The scalability problem arises due to noisy, irrelevant and unique features present in the extracted features from proposed approach, which can be eliminated by adopting an effective feature reduction technique. In our proposed BWO approach, without changing the accuracy (90%), the feature-set size is reduced up to 43%. The proposed feature selection technique outperforms other commonly used PSO and GAbased feature selection techniques with reduced computation time of 21 sec. Moreover, our sentiment analysis approach is analysed using performance metrices such as precision, recall, F-measure, and computation time. Many organizations can use these online reviews to make well-informed decisions towards the users’ interests and preferences to enhance customer satisfaction, product quality and to find the aspects to improve the products, thereby to generate more profits.

Download Full-text

Fuzzy based Sentiment Analysis of Online Product Reviews using Machine Learning Techniques

International Journal of Computer Applications ◽

10.5120/17463-8243 ◽

2014 ◽

Vol 99 (17) ◽

pp. 9-16 ◽

Cited By ~ 3

Author(s):

Haseena RahmathP ◽

Tanvir Ahmad

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Machine Learning Techniques ◽

Product Reviews ◽

Online Product Reviews ◽

Learning Techniques

Download Full-text

Analysis of the Effect of Feature Reduction on Accuracy and Computational Time in Mushroom Dataset Classification

JELIKU (Jurnal Elektronik Ilmu Komputer Udayana) ◽

10.24843/jlk.2021.v10.i01.p15 ◽

2021 ◽

Vol 10 (1) ◽

pp. 117

Author(s):

Agus Prayogo ◽

I Gede Santi Astawa

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Computation Time ◽

Feature Reduction ◽

Computational Time ◽

Test Scenario ◽

Classification Result ◽

Distant Relationship ◽

Significant Difference ◽

Feature Values

Classification is a technique to mapping the class of a certain data from its attribute or feature values. One of things that affects the classification result is the correlation of its features to the class classification results. Research conducted to determine the effect of the reduction in features that are least correlated or have a distant relationship with the classification result class (dependent variable). Because features that do not have much correlation, have no effect on the classification results. From the research, the accuracy of the reduction of each feature per test scenario has a range between 83% -88% higher than the initial accuracy without feature selection at 82% accuracy. Meanwhile, the computation time obtained does not have a significant difference in changing compared to without feature reduction, in the range of 2.3-2.7. For the data used is the Mushroom dataset obtained from the UCI Machine Learning Repository

Download Full-text

Sentiment Analysis Of Product Reviews Using Machine Learning Algorithm

Journal of Environmental Science Computer Science and Engineering & Technology ◽

10.24214/jecet.b.7.1.04857 ◽

2017 ◽

Vol 7 (1) ◽

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Product Reviews

Download Full-text

Sentiment Analysis of Movie Reviews: A Study of Machine Learning Algorithms with Various Feature Selection Methods

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v5i9.113121 ◽

2017 ◽

Vol 5 (9) ◽

Cited By ~ 1

Author(s):

Rajwinder Kaur

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Selection Methods

Download Full-text

Sentiment analysis of online product reviews using Lexical Semantic Corpus-Based technique

2021 IEEE 11th IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE) ◽

10.1109/iscaie51753.2021.9431818 ◽

2021 ◽

Author(s):

Raihah Aminuddin ◽

Aina Zuliana Zulkefli ◽

Nor Aiza Moketar ◽

Khyrina Airin Fariza Abu Samah

Keyword(s):

Sentiment Analysis ◽

Product Reviews ◽

Lexical Semantic ◽

Online Product Reviews

Download Full-text

A review: preprocessing techniques and data augmentation for sentiment analysis

Computational Social Networks ◽

10.1186/s40649-020-00080-x ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Huu-Thanh Duong ◽

Tram-Anh Nguyen-Thi

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Supervised Learning ◽

Data Augmentation ◽

Original Data ◽

Training Data ◽

Unseen Data ◽

Augmentation Techniques ◽

User Intervention

AbstractIn literature, the machine learning-based studies of sentiment analysis are usually supervised learning which must have pre-labeled datasets to be large enough in certain domains. Obviously, this task is tedious, expensive and time-consuming to build, and hard to handle unseen data. This paper has approached semi-supervised learning for Vietnamese sentiment analysis which has limited datasets. We have summarized many preprocessing techniques which were performed to clean and normalize data, negation handling, intensification handling to improve the performances. Moreover, data augmentation techniques, which generate new data from the original data to enrich training data without user intervention, have also been presented. In experiments, we have performed various aspects and obtained competitive results which may motivate the next propositions.

Download Full-text