Modified framework for sarcasm detection and classification in sentiment analysis

Aims: The proposed research work is on an evolutionary enhanced method for sentiment or emotion classification on unstructured review text in the big data field. The sentiment analysis plays a vital role for current generation of people for extracting valid decision points about any aspect such as movie ratings, education institute or politics ratings, etc. The proposed hybrid approach combined the optimal feature selection using Particle Swarm Optimization (PSO) and sentiment classification through Support Vector Machine (SVM). The current approach performance is evaluated with statistical measures, such as precision, recall, sensitivity, specificity, and was compared with the existing approaches. The earlier authors have achieved an accuracy of sentiment classifier in the English text up to 94% as of now. In the proposed scheme, an average accuracy of sentiment classifier on distinguishing datasets outperformed as 99% by tuning various parameters of SVM, such as constant c value and kernel gamma value in association with PSO optimization technique. The proposed method utilized three datasets, such as airline sentiment data, weather, and global warming datasets, that are publically available. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Background: The sentiment analysis plays a vital role for current generation people for extracting valid decisions about any aspect such as movie rating, education institute or even politics ratings, etc. Sentiment Analysis (SA) or opinion mining has become fascinated scientifically as a research domain for the present environment. The key area is sentiment classification on semi-structured or unstructured data in distinguish languages, which has become a major research aspect. User-Generated Content [UGC] from distinguishing sources has been hiked significantly with rapid growth in a web environment. The huge user-generated data over social media provides substantial value for discovering hidden knowledge or correlations, patterns, and trends or sentiment extraction about any specific entity. SA is a computational analysis to determine the actual opinion of an entity which is expressed in terms of text. SA is also called as computation of emotional polarity expressed over social media as natural text in miscellaneous languages. Usually, the automatic superlative sentiment classifier model depends on feature selection and classification algorithms. Methods: The proposed work used Support vector machine as classification technique and particle swarm optimization technique as feature selection purpose. In this methodology, we tune various permutations and combination parameters in order to obtain expected desired results with kernel and without kernel technique for sentiment classification on three datasets, including airline, global warming, weather sentiment datasets, that are freely hosted for research practices. Results: In the proposed scheme, The proposed method has outperformed with 99.2% of average accuracy to classify the sentiment on different datasets, among other machine learning techniques. The attained high accuracy in classifying sentiment or opinion about review text proves superior effectiveness over existing sentiment classifiers. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Conclusion: The objective of the research issue sentiment classifier accuracy has been hiked with the help of Kernel-based Support Vector Machine (SVM) based on parameter optimization. The optimal feature selection to classify sentiment or opinion towards review documents has been determined with the help of a particle swarm optimization approach. The proposed method utilized three datasets to simulate the results, such as airline sentiment data, weather sentiment data, and global warming data that are freely available datasets.

Download Full-text

Sentiment Classification

Advances in Linguistics and Communication Studies - Modern Computational Models of Semantic Discovery in Natural Language ◽

10.4018/978-1-4666-8690-8.ch001 ◽

2015 ◽

pp. 1-26

Author(s):

Jalel Akaichi

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Training Model ◽

Sentiment Classification ◽

The Other ◽

Support Vector ◽

Analysis Techniques ◽

Sentiment Lexicon ◽

And Behavior

In this work, we focus on the application of text mining and sentiment analysis techniques for analyzing Tunisian users' statuses updates on Facebook. We aim to extract useful information, about their sentiment and behavior, especially during the “Arabic spring” era. To achieve this task, we describe a method for sentiment analysis using Support Vector Machine and Naïve Bayes algorithms, and applying a combination of more than two features. The output of this work consists, on one hand, on the construction of a sentiment lexicon based on the Emoticons and Acronyms' lexicons that we developed based on the extracted statuses updates; and on the other hand, it consists on the realization of detailed comparative experiments between the above algorithms by creating a training model for sentiment classification.

Download Full-text

Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/ijkdb.2017010103 ◽

2017 ◽

Vol 7 (1) ◽

pp. 30-41 ◽

Cited By ~ 12

Author(s):

Prayag Tiwari ◽

Brojo Kishore Mishra ◽

Sachin Kumar ◽

Vivek Kumar

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Learning Strategies ◽

Supervised Machine Learning ◽

Support Vector ◽

N Gram ◽

F Measure ◽

Blog Posts

Sentiment Analysis intends to get the basic perspective of the content, which may be anything that holds a subjective supposition, for example, an online audit, Comments on Blog posts, film rating and so forth. These surveys and websites might be characterized into various extremity gatherings, for example, negative, positive, and unbiased keeping in mind the end goal to concentrate data from the info dataset. Supervised machine learning strategies group these reviews. In this paper, three distinctive machine learning calculations, for example, Support Vector Machine (SVM), Maximum Entropy (ME) and Naive Bayes (NB), have been considered for the arrangement of human conclusions. The exactness of various strategies is basically inspected keeping in mind the end goal to get to their execution on the premise of parameters, e.g. accuracy, review, f-measure, and precision.

Download Full-text

Sentiment Analysis on E-commerce Product using Machine Learning and Combination of TF-IDF and Backward Elimination

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f7889.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 2862-2867

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Feature Selection ◽

Sentiment Analysis ◽

Opinion Mining ◽

Classification Performance ◽

Support Vector ◽

Product Reviews ◽

Feature Selection Technique ◽

Backward Elimination

E-commerce is a website or mobile application platform that help people to buy products. Before purchasing the product, customer will decide to buy it or not by reading the review from previous buyer. There is a problem that there are a lot of review so it will take a long time for customer to read it all. This research will be using sentiment analysis method to classify the review data. Sentiment analysis or opinion mining is a machine learning approach to classify and analyse texts or documents about human’s sentiments, emotions, and opinions. In this research, sentiment analysis was used to classify product reviews from e-commerce websites into positive or negative classes. The results could be processed further and be used to summarize customers' opinions about a certain product without reading every single review. The goal of this research is to optimize classification performance by using feature selection technique. Terms Frequency-Inverse Document Frequency (TF-IDF) feature extraction, Backward Elimination feature selection, and five different classifiers (Naïve Bayes, Support Vector Machine, K-Nearest Neighbour, Decision Tree, Random Forest) were used in analysing the sentiment of the reviews. In this research, the dataset used are Indonesian language and classified into two classes(positive and negative). The best accuracy is achieved by using TF-IDF, Backward Elimination and Support Vector Machine (SVM) with a score of 85.97%, which increases by 7.91% if compared to the process without feature selection. Based on the results, Backward Elimination feature selection succeeded in improving all performance for all classifiers used in this research.

Download Full-text

COVID-19 and Black Fungus: Analysis of the Public Perception through Machine Learning

10.1101/2021.07.08.21260188 ◽

2021 ◽

Author(s):

Muhammad Nazrul Islam ◽

Nafiz Imtiaz Khan ◽

Tahasin Mahmud

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Public Perception ◽

Public Perceptions ◽

Support Vector ◽

Machine Model ◽

The Public ◽

Class Labels ◽

Media Data

While COVID-19 is ravaging the lives of millions of people across the globe, a second pandemic 'black fungus' has surfaced robbing people of their lives especially people who are recovering from coronavirus. Again, the public perceptions regarding such pandemics can be investigated through sentiment analysis of social media data. Thus the objective of this study is to analyze public perceptions through sentiment analysis regarding black fungus during the time of the COVID-19 pandemic. To attain the objective, first, a Support Vector Machine model, with an average AUC of 82.75\%, was developed to classify user sentiments in terms of anger, fear, joy, and sad. Next, this Support Vector Machine is used to supervise the class labels of the public tweets (n = 6477) related to COVID-19 and black fungus. As outcome, this study found that public perceptions belong to sad (n = 2370, 36.59 \%), followed by joy ( n = 2095, 32.34\%), fear ( n = 1914, 29.55 \%) and anger ( n = 98, 1.51\%) towards black fungus during COVID-19 pandemic. This study also investigated public perceptions of some critical concerns (e.g., education, lockdown, hospital, oxygen, quarantine, and vaccine) and it was found that public perceptions of these issues varied. For example, for the most part, people exhibited fear in social media about education, hospital, vaccine while some people expressed joy about education, hospital, vaccine, and oxygen.

Download Full-text

Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis

Cognitive Analytics ◽

10.4018/978-1-7998-2460-2.ch036 ◽

2020 ◽

pp. 689-701

Author(s):

Prayag Tiwari ◽

Brojo Kishore Mishra ◽

Sachin Kumar ◽

Vivek Kumar

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Learning Strategies ◽

Naive Bayes ◽

Supervised Machine Learning ◽

Support Vector ◽

N Gram ◽

F Measure

Sentiment Analysis intends to get the basic perspective of the content, which may be anything that holds a subjective supposition, for example, an online audit, Comments on Blog posts, film rating and so forth. These surveys and websites might be characterized into various extremity gatherings, for example, negative, positive, and unbiased keeping in mind the end goal to concentrate data from the info dataset. Supervised machine learning strategies group these reviews. In this paper, three distinctive machine learning calculations, for example, Support Vector Machine (SVM), Maximum Entropy (ME) and Naive Bayes (NB), have been considered for the arrangement of human conclusions. The exactness of various strategies is basically inspected keeping in mind the end goal to get to their execution on the premise of parameters, e.g. accuracy, review, f-measure, and precision.

Download Full-text

ANALISIS SENTIMEN APLIKASI E-LEARNING SELAMA PANDEMI COVID-19 DENGAN MENGGUNAKAN METODE SUPPORT VECTOR MACHINE DAN CONVOLUTIONAL NEURAL NETWORK

SEMINASTIKA ◽

10.47002/seminastika.v3i1.236 ◽

2021 ◽

Vol 3 (1) ◽

pp. 16-25

Author(s):

Alek Sander Simbolon ◽

Nina Ismaya Pangaribuan ◽

Nenni Mona Aruan

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Natural Language ◽

Convolutional Neural Network ◽

Sentiment Analysis ◽

Support Vector ◽

E Learning ◽

F Measure

Aplikasi e-learning dibutuhkan masyarakat dalam meningkatkan pendidikan di mana e-learning yang menjadi objek penelitian adalah Ruangguru dan Zenius karena memiliki jumlah pengguna lebih dari 16 juta dengan kepuasan pengguna lebih dari 8.5/10 dan lebih dari 1 juta kali di download di play store. Aplikasi tersebut memberikan ruang bagi pengguna aplikasi untuk mendapatkan tingkat kepuasan dari pengguna aplikasi. Sentiment analysis merupakan natural language preprocessing yang dapat digunakan dalam melakukan ekstraksi opini dari data berupa teks di mana tujuan penelitian ini melakukan evaluasi pada peningkatan hal positif dan memperbaiki hal yang negatif. Data ulasan yang diambil dari Twitter dan play store memiliki promosi dan giveaway yang akan berpengaruh pada pengolahan data dalam penentuan opini dan bukan opini. Penulis menggunakan metode lexicon based dalam memberikan label atau nilai sentiment pada setiap data. Pendekatan yang digunakan algoritma Support Vector Machine (SVM) dan Convolutional Neural Network (CNN) dalam melakukan klasifikasi terhadap data test yang di uji dari model yang telah dibangun. Berdasarkan hasil klasifikasi opini menjadi tiga kelas yaitu kelas positif, negatif, dan netral dari ulasan aplikasi Ruangguru dan Zenius. Dari nilai akurasi dan F-measure diperoleh bahwa klasifikasi yang terbaik adalah menggunakan algoritma CNN dengan akurasi dan F-measure memiliki nilai 86%.

Download Full-text

Performance comparison of support vector machine (SVM) with linear kernel and polynomial kernel for multiclass sentiment analysis on twitter

ILKOM Jurnal Ilmiah ◽

10.33096/ilkom.v13i2.851.168-174 ◽

2021 ◽

Vol 13 (2) ◽

pp. 168-174

Author(s):

Rifqatul Mukarramah ◽

Dedy Atmajaya ◽

Lutfi Budi Ilmawan

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Confusion Matrix ◽

Classification Performance ◽

Performance Comparison ◽

Kernel Functions ◽

Polynomial Kernel ◽

Support Vector ◽

Linear Kernel ◽

Testing Method

Sentiment analysis is a technique to extract information of one’s perception, called sentiment, on an issue or event. This study employs sentiment analysis to classify society’s response on covid-19 virus posted at twitter into 4 polars, namely happy, sad, angry, and scared. Classification technique used is support vector machine (SVM) method which compares the classification performance figure of 2 linear kernel functions, linear and polynomial. There were 400 tweet data used where each sentiment class consists of 100 data. Using the testing method of k-fold cross validation, the result shows the accuracy value of linear kernel function is 0.28 for unigram feature and 0.36 for trigram feature. These figures are lower compared to accuracy value of kernel polynomial with 0.34 and 0.48 for unigram and trigram feature respectively. On the other hand, testing method of confusion matrix suggests the highest performance is obtained by using kernel polynomial with accuracy value of 0.51, precision of 0.43, recall of 0.45, and f-measure of 0.51.

Download Full-text

Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training and multinomial Nave Bayes (NB) in News Classification

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i4.360363 ◽

2019 ◽

Vol 7 (4) ◽

pp. 360-363

Author(s):

Feroz Ahmed ◽

Shabina Ghafir

Keyword(s):

Support Vector Machine ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Support Vector ◽

Linear Support Vector Machine

Download Full-text