An Enhanced Approach for Sentiment Analysis Using Association Rule Mining

Abhishek Sharma

doi:10.22214/ijraset.2021.39404

An Enhanced Approach for Sentiment Analysis Using Association Rule Mining

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.39404 ◽

2021 ◽

Vol 9 (12) ◽

pp. 913-918

Author(s):

Abhishek Sharma

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Association Rule ◽

Association Rule Mining ◽

Opinion Mining ◽

Support Vector ◽

Rule Mining ◽

The People ◽

Baseline Approach

Abstract: In today’s world social networking platforms like Facebook, YouTube, twitter etc. are a great source of communication for internet users and loaded with large number of emotions, views and opinions of the people. Sentiment analysis is the study of attitudes, emotions and opinions of the people and is also known as opinion mining. Sentiment analysis is used to find the opinion i.e. negative or positive about a particular subject. In this paper an Enhanced sentiment analysis approach is presented by using the Association rule mining i.e. Apriori and machine learning approach such as Support Vector Machine. The Enhanced approach is compared with the baseline approach, on accuracy, precision, recall, and F1-score measures. The Enhanced approach for sentiment analysis is implemented using the R programming language. The Enhanced approach shows better performance in comparison to the baseline approach. Keyword: Sentiment Analysis, Opinion Mining, Support Vector Machine, Association Rule Mining, Machine Learning

Download Full-text

Efficient time series data classification using sliding window technique based improved association rule mining with enhanced support vector machine

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.33.13890 ◽

2018 ◽

Vol 7 (3.3) ◽

pp. 218 ◽

Cited By ~ 2

Author(s):

D Senthil ◽

G Suseendran

Keyword(s):

Time Series ◽

Support Vector Machine ◽

Association Rule ◽

Association Rule Mining ◽

Time Series Data ◽

Sliding Window ◽

Series Data ◽

Support Vector ◽

Rule Mining ◽

Window Technique

Time series analysis is an important and complex problem in machine learning and statistics. In the existing system, Support Vector Machine (SVM) and Association Rule Mining (ARM) is introduced to implement the time series data. However it has issues with lower accuracy and higher time complexity. Also it has issue with optimal rules discovery and segmentation on time series data. To avoid the above mentioned issues, in the proposed research Sliding Window Technique based Improved ARM with Enhanced SVM (SWT-IARM with ESVM) is proposed. In the proposed system, the preprocessing is performed using Modified K-Means Clustering (MKMC). The indexing process is done by using R-tree which is used to provide faster results. Segmentation is performed by using SWT and it reduces the cost complexity by optimal segments. Then IARM is applied on efficient rule discovery process by generating the most frequent rules. By using ESVM classification approach, the rules are classified more accurately.

Download Full-text

Effective Classification by Integrating Support Vector Machine and Association Rule Mining

Intelligent Data Engineering and Automated Learning – IDEAL 2006 - Lecture Notes in Computer Science ◽

10.1007/11875581_110 ◽

2006 ◽

pp. 920-927 ◽

Cited By ~ 5

Author(s):

Keivan Kianmehr ◽

Reda Alhajj

Keyword(s):

Support Vector Machine ◽

Association Rule ◽

Association Rule Mining ◽

Support Vector ◽

Rule Mining

Download Full-text

Sentiment Analysis on E-commerce Product using Machine Learning and Combination of TF-IDF and Backward Elimination

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f7889.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 2862-2867

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Feature Selection ◽

Sentiment Analysis ◽

Opinion Mining ◽

Classification Performance ◽

Support Vector ◽

Product Reviews ◽

Feature Selection Technique ◽

Backward Elimination

E-commerce is a website or mobile application platform that help people to buy products. Before purchasing the product, customer will decide to buy it or not by reading the review from previous buyer. There is a problem that there are a lot of review so it will take a long time for customer to read it all. This research will be using sentiment analysis method to classify the review data. Sentiment analysis or opinion mining is a machine learning approach to classify and analyse texts or documents about human’s sentiments, emotions, and opinions. In this research, sentiment analysis was used to classify product reviews from e-commerce websites into positive or negative classes. The results could be processed further and be used to summarize customers' opinions about a certain product without reading every single review. The goal of this research is to optimize classification performance by using feature selection technique. Terms Frequency-Inverse Document Frequency (TF-IDF) feature extraction, Backward Elimination feature selection, and five different classifiers (Naïve Bayes, Support Vector Machine, K-Nearest Neighbour, Decision Tree, Random Forest) were used in analysing the sentiment of the reviews. In this research, the dataset used are Indonesian language and classified into two classes(positive and negative). The best accuracy is achieved by using TF-IDF, Backward Elimination and Support Vector Machine (SVM) with a score of 85.97%, which increases by 7.91% if compared to the process without feature selection. Based on the results, Backward Elimination feature selection succeeded in improving all performance for all classifiers used in this research.

Download Full-text

Understanding the Prediction of Transmembrane Proteins by Support Vector Machine using Association Rule Mining

2007 IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology ◽

10.1109/cibcb.2007.4221252 ◽

2007 ◽

Author(s):

Hae-Jin Hu ◽

Hao Wang ◽

R. Harrison ◽

P.C. Tai ◽

Yi Pan

Keyword(s):

Support Vector Machine ◽

Association Rule ◽

Association Rule Mining ◽

Transmembrane Proteins ◽

Support Vector ◽

Rule Mining

Download Full-text

A Comparison of Machine Learning Techniques for Sentiment Analysis

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i3.999 ◽

2021 ◽

Vol 12 (3) ◽

pp. 1738-1744

Author(s):

Shahzad Qaiser Et.al

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Modern Method ◽

Machine Learning Techniques ◽

Support Vector ◽

The People ◽

Learning Techniques ◽

Social Media Platforms ◽

Single Dataset

The availability of the data has increased tremendously due to the excess usage of social media platforms like Twitter and Facebook. Due to the abundant availability of data, scientists, businesses, educationalists and other people working under different roles have started using Sentiment Analysis (SA) to get in-depth knowledge about the sentiments of the people regarding any topic of interest. There are many techniques to implement SA, and one of them is Machine Learning (ML). This study is focused on the comparison of ancient ML methods such as Naïve Bayes (NB), Decision Tree (DT), Support Vector Machine (SVM), and a modern method, i.e., Deep Learning (DL). The ML techniques are applied to a single dataset to compare their performance in terms of accuracy to understand how they perform against each other. The study found that DL performed the best with 96.41% accuracy followed by NB and SVM with 87.18% and 82.05% respectively. DT performed the poorest with 68.21% accuracy.

Download Full-text

Study of high yielding crops cultivation in India using data mining techniques

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i1.7.9589 ◽

2018 ◽

Vol 7 (1.7) ◽

pp. 121

Author(s):

M J Carmel Mary Belinda ◽

Umamaheswari R ◽

Alex David S

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Rough Set ◽

Association Rule ◽

Association Rule Mining ◽

Bayesian Belief Network ◽

Support Vector ◽

Apriori Algorithm ◽

Rule Mining ◽

Using Data

Data mining in agriculture is a modern and emerging research technique. Data mining provide many techniques like k means algorithm, support vector machine, association rule mining and Bayesian belief network [1]. This technique can be used in agriculture for various purposes. This paper describes about how association rules mining and apriori algorithm can be used in agriculture field. This paper also describes about soil, its types and crops grown in each type of soil. The technique that has been used here can be a rough set study, but like this many efficient techniques can be applied to solve many problems in agriculture.

Download Full-text

SENTIMENT ANALYSIS OF ENGLISH TWEETS USING BIGRAM COLLOCATION

EPRA International Journal of Research & Development (IJRD) ◽

10.36713/epra8524 ◽

2021 ◽

pp. 220-227

Author(s):

Sumaya Ishrat Moyeen ◽

Md. Sadiqur Rahman Mabud ◽

Zannatun Nayem ◽

Md. Al Mamun

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Opinion Mining ◽

Naive Bayes ◽

Daily Life ◽

Naïve Bayes ◽

Support Vector ◽

Huge Amount

Community and portal websites like Twitter, Facebook, Tumbler, Instagram, and LinkedIn etc. have significant impact in our day-to-day life. One of the most popular micro-blogging platforms is twitter that can provide a huge amount of data which in future can be used for various applications of opinion mining like predictions, reviews, elections, marketing etc. The users use this platform to share their views, express sentiments on various events of their daily life. Previously, many researchers have worked with twitter sentiment analysis and compared various classifiers and got the accuracy below 82%. In this work for classifying tweets into sentiments, we have used various classifiers such as Naïve Bayes, Support Vector Machine and Maximum Entropy that segregate the positive and negative tweets. Using Bigram Collocation with classifiers, we’ve acquired 88.42% accuracy. KEYWORDS: Twitter; Sentiment Classification; Machine Learning; NLTK; Python; Naïve Bayes; Support Vector Machine (SVM); Maximum Entropy

Download Full-text

Classification and Association Rule Mining Technique for Predicting Chronic Kidney Disease

Journal of Information & Knowledge Management ◽

10.1142/s0219649220400158 ◽

2020 ◽

Vol 19 (01) ◽

pp. 2040015

Author(s):

Ahmad Alaiad ◽

Hassan Najadat ◽

Belal Mohsen ◽

Khaled Balhaf

Keyword(s):

Chronic Kidney Disease ◽

Kidney Disease ◽

Association Rule ◽

Association Rule Mining ◽

Support Vector ◽

Classification Algorithms ◽

Nearest Neighbour ◽

Rule Mining ◽

Medical Field ◽

Efficient System

Background and objective: Chronic kidney disease (CKD) is one of the deadly diseases that can affect a lot of vital organs in the human body such as heart, liver, and lungs. Many individuals might be at early stage of kidney disease and not have any signs, which might lead to a sudden death. Previous research showed that early prediction of CKD is very important in the medical field for physicians’ decision-making and patients’ health and life. To this end, constructing an efficient prediction system for CKD, which is the goal of this paper, often reduces medical errors and overall healthcare cost. Methods: Classification and association rule mining techniques were integrated and utilised to construct an efficient system for predicting and diagnosing CKD and its causes using weka and SPSS as platform environments. In particular, five classification algorithms, namely, naive Bayes, decision tree, support vector machine, K-nearest neighbour, and JRip were used to achieve the research goal. In addition, Apriori algorithm was used to discover strong relationship rules between attributes. The experiments were conducted on real medical dataset collected from hospitals and patient monitoring systems. Results: The experiments achieved high accuracy of 98.50% for K-nearest neighbour (KNN) classifier and achieved 96.00% when using classier based on association rule (JRip). Conclusions: We conclude by showing that applying integrative approach by combining classification algorithms and association rule mining can significantly improve the classification accuracy and be more useful for CKD prediction. This research has also several theoretical and practical implications for the medical field and healthcare industry.

Download Full-text

Aspect Term Extraction for Aspect Based Opinion Mining

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k2050.0981119 ◽

2019 ◽

Vol 8 (11) ◽

pp. 2228-2233

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Random Fields ◽

Opinion Mining ◽

Nearest Neighbor ◽

Conditional Random Fields ◽

International Workshop ◽

Support Vector ◽

K Nearest Neighbor ◽

Term Extraction

Opinion Mining (OM) is also called as Sentiment Analysis (SA). Aspect Based Opinion Mining (ABOM) is also called as Aspect Based Sentiment Analysis (ABSA). In this paper, three new features are proposed to extract the aspect term for Aspect Based Sentiment Analysis (ABSA). The influence of the proposed features is evaluated on five classifiers namely Decision Tree (DT), Naive Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Conditional Random Fields (CRF). The proposed features are evaluated on the Two datasets on Restaurant and Laptop domains available in International Workshop on Semantic Evaluation 2014 i.e. SemEval 2014. The influence of proposed features is evaluated using Precision, Recall and F1 measures. The proposed features are highly influencing for aspect term extraction on classifiers. The performance of SVM and CRF classifiers with proposed features is more influencing for aspect term extraction compared with NB, DT and KNN classifiers.

Download Full-text

Evaluating Annotated Dataset of Customer Reviews for Aspect Based Sentiment Analysis

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2122 ◽

2021 ◽

Author(s):

Dimple Chehal ◽

Parul Gupta ◽

Payal Gulati

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Supervised Machine Learning ◽

Support Vector ◽

Product Reviews ◽

K Nearest Neighbor ◽

Customer Reviews ◽

Percent Accuracy

Sentiment analysis of product reviews on e-commerce platforms aids in determining the preferences of customers. Aspect-based sentiment analysis (ABSA) assists in identifying the contributing aspects and their corresponding polarity, thereby allowing for a more detailed analysis of the customer’s inclination toward product aspects. This analysis helps in the transition from the traditional rating-based recommendation process to an improved aspect-based process. To automate ABSA, a labelled dataset is required to train a supervised machine learning model. As the availability of such dataset is limited due to the involvement of human efforts, an annotated dataset has been provided here for performing ABSA on customer reviews of mobile phones. The dataset comprising of product reviews of Apple-iPhone11 has been manually annotated with predefined aspect categories and aspect sentiments. The dataset’s accuracy has been validated using state-of-the-art machine learning techniques such as Naïve Bayes, Support Vector Machine, Logistic Regression, Random Forest, K-Nearest Neighbor and Multi Layer Perceptron, a sequential model built with Keras API. The MLP model built through Keras Sequential API for classifying review text into aspect categories produced the most accurate result with 67.45 percent accuracy. K- nearest neighbor performed the worst with only 49.92 percent accuracy. The Support Vector Machine had the highest accuracy for classifying review text into aspect sentiments with an accuracy of 79.46 percent. The model built with Keras API had the lowest 76.30 percent accuracy. The contribution is beneficial as a benchmark dataset for ABSA of mobile phone reviews.

Download Full-text