Sentiment Analysis Based Approaches for Understanding User Context in Web Content

<p class="0abstract">Today, the influence of the social media on different aspects of our lives is increasing, many scholars from various disciplines and majors looking at the social media networks as the ongoing revolution. In Social media networks, many bonds and connections can be established whether being direct or indirect ties. In fact, Social networks are used not only by people but also by companies. People usually create their own profiles and join communities to discuss different common issues that they have interest in. On the other hand, companies also can create their virtual presence on the social media networks to benefit from this media to understand the customers and gather richer information about them. With all of the benefits and advantages of social media networks, they should not always be seen as a safe place for communicating, sharing information and ideas, and establishing virtual communities. These information and ideas could carry with them hatred speeches that must be detected to avoid raising violence. Therefore, web content mining can be used to handle this issue. Web content mining is gaining more concern because of its importance for many businesses and institutions. Sentiment Analysis (SA) is an important sub-area of web content mining. The purpose of SA is to determine the overall sentiment attitude of writer towards a specific entity and classify these opinions automatically. There are two main approaches to build systems of sentiment analysis: the machine learning approach and the lexicon-based approach. This research presents the design and implementation for violence detection over social media using machine learning approach. Our system works on Jordanian Arabic dialect instead of Modern Standard Arabic (MSA). The data was collected from two popular social media websites (Facebook, Twitter) and has used native speakers to annotate the data. Moreover, different preprocessing techniques have been used to show their effect on our model accuracy. The Arabic lexicon was used for generating feature vectors and separate them to features set. Here, we have three well known machine learning algorithms: Support Vector Machine (SVM), Naive Bayes (NB) and k-Nearest Neighbors (KNN). Building on this view, Information Science Research Institute’s (ISRI) stemming and stop word file as a result of preprocessing were used to extract the features. Indeed, several features have been extracted; however, using the SVM classifier reveals that unigram and features extracted from lexicon are characterized by the highest accuracy to detect violence.</p>

Download Full-text

Web Content Classification based on Topic and Sentiment Analysis of Text

Proceedings of the International Conference on Knowledge Discovery and Information Retrieval ◽

10.5220/0005101803000307 ◽

2014 ◽

Author(s):

Shuhua Liu ◽

Thomas Forss

Keyword(s):

Sentiment Analysis ◽

Web Content ◽

Content Classification

Download Full-text

Effectiveness of Normalization Over Processing of Textual Data Using Hybrid Approach Sentiment Analysis

International Journal of Grid and High Performance Computing ◽

10.4018/ijghpc.2020070103 ◽

2020 ◽

Vol 12 (3) ◽

pp. 43-56 ◽

Cited By ~ 1

Author(s):

Sukhnandan Kaur Johal ◽

Rajni Mohana

Keyword(s):

Natural Language Processing ◽

Decision Support ◽

Natural Language ◽

Data Processing ◽

Sentiment Analysis ◽

Language Processing ◽

Hybrid Approach ◽

Web Content ◽

Process Data ◽

Computerized Decision Support

Various natural language processing tasks are carried out to feed into computerized decision support systems. Among these, sentiment analysis is gaining more attention. The majority of sentiment analysis relies on the social media content. This web content is highly un-normalized in nature. This hinders the performance of decision support system. To enhance the performance, it is required to process data efficiently. This article proposes a novel method of normalization of web data during the pre-processing phase. It is aimed to get better results for different natural language processing tasks. This research applies this technique on data for sentiment analysis. Performance of different learning models is analysed using precision, recall, f-measure, fallout for normalize and un-normalize sentiment analysis. Results shows after normalization, some documents shift their polarity i.e. negative to positive. Experimental results show normalized data processing outperforms un-normalized data processing with better accuracy.

Download Full-text

Semi-automated Methods for the Annotation and Design of a Semantic Network Designed for Sentiment Analysis of Social Web Content

2011 22nd International Workshop on Database and Expert Systems Applications ◽

10.1109/dexa.2011.75 ◽

2011 ◽

Cited By ~ 1

Author(s):

Donato Barbagallo ◽

Leonardo Bruni ◽

Chiara Francalanci ◽

Paolo Giacomazzi ◽

Francesco Merlo ◽

...

Keyword(s):

Sentiment Analysis ◽

Semantic Network ◽

Social Web ◽

Web Content ◽

Automated Methods

Download Full-text

A Hybrid CNN-LSTM: A Deep Learning Approach for Consumer Sentiment Analysis Using Qualitative User-Generated Contents

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3457206 ◽

2021 ◽

Vol 20 (5) ◽

pp. 1-15

Author(s):

Praphula Kumar Jain ◽

Vijayalakshmi Saravanan ◽

Rajendra Pamula

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Short Term Memory ◽

Online Reviews ◽

Web Content ◽

Proposed Model ◽

Social Media Platforms ◽

Information And Communication ◽

Better Than

With the fastest growth of information and communication technology (ICT), the availability of web content on social media platforms is increasing day by day. Sentiment analysis from online reviews drawing researchers’ attention from various organizations such as academics, government, and private industries. Sentiment analysis has been a hot research topic in Machine Learning (ML) and Natural Language Processing (NLP). Currently, Deep Learning (DL) techniques are implemented in sentiment analysis to get excellent results. This study proposed a hybrid convolutional neural network-long short-term memory (CNN-LSTM) model for sentiment analysis. Our proposed model is being applied with dropout, max pooling, and batch normalization to get results. Experimental analysis carried out on Airlinequality and Twitter airline sentiment datasets. We employed the Keras word embedding approach, which converts texts into vectors of numeric values, where similar words have small vector distances between them. We calculated various parameters, such as accuracy, precision, recall, and F1-measure, to measure the model’s performance. These parameters for the proposed model are better than the classical ML models in sentiment analysis. Our results analysis demonstrates that the proposed model outperforms with 91.3% accuracy in sentiment analysis.

Download Full-text

Holistic User Context-Aware Recommender Algorithm

Mathematical Problems in Engineering ◽

10.1155/2019/3965845 ◽

2019 ◽

Vol 2019 ◽

pp. 1-15 ◽

Cited By ~ 1

Author(s):

Tatenda D. Kavu ◽

Kudakwashe Dube ◽

Peter G. Raeth

Keyword(s):

Social Network ◽

Collaborative Filtering ◽

Context Aware ◽

Web Content ◽

Web Content Mining ◽

User Context ◽

Novel Approach ◽

User Influence ◽

Content Mining ◽

Recommender Algorithm

Existing recommender algorithms lack dynamism, human focus, and serendipitous recommendations. The literature indicates that the context of a user influences user decisions, and when incorporated in recommender systems (RSs), novel and serendipitous recommendations can be realized. This article shows that social, cultural, psychological, and economic contexts of a user influence user traits or decisions. The article demonstrates a novel approach of incorporating holistic user context-aware knowledge in an algorithm to solve the highlighted problems. Web content mining and collaborative filtering approaches were used to develop a holistic user context-aware (HUC) algorithm. The algorithm was evaluated on a social network using online experimental evaluations. The algorithm demonstrated dynamism, novelty, and serendipity with an average of 84% novelty and 85% serendipity.

Download Full-text

A Rule Based Stemmer

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9545.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 2026-2029

Keyword(s):

Sentiment Analysis ◽

Web Mining ◽

Web Content ◽

Rule Based ◽

Web Content Mining ◽

Digital World ◽

Stop Word ◽

Enormous Amount ◽

Content Mining ◽

Preprocessing Technique

The present digital world generates enormous amount of data instantaneously. The need to effectively mine knowledge seems to be the need of the hour. Sentiment Analysis, a part of web content mining which is a subpart of web mining has gained momentum in the field of research. It analyses the opinion of variety of people all over the world. Sentiment Analysis encompasses preprocessing, feature selection, classification and sentiment prediction. Preprocessing is an important process and it deals with many techniques. Stop word removal, punctuation removal, conversion of numbers to number names are some of the basic techniques. Stemming is yet another important preprocessing technique that reduces the different words form to its root. There are basically three types of stemmers namely truncating, statistical and hybrid. The aim of this paper is to propose a rule based stemmer that is a truncating stemmer. It deals with rules for truncation and replacement. The data given as input passes through a series of rules. If the condition specified gets satisfied then the associated rule gets executed otherwise the input is checked with the next rule and the process continues further. The result of execution is stemmed words. The performance of the proposed rule based stemmer is compared with the existing stemmers under the same rule based category namely Porter and Lancaster. Various metrics have been used for evaluation. The observations reveal the fact that the proposed stemmer out performs the Porter and Lancaster stemmers in terms of correctly stemmed words factor and shows a good average conflation factor and lesser over stemming and under stemming errors.

Download Full-text

Study of Sentiment of Governor's Election Opinion in 2018

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset21841124 ◽

2018 ◽

pp. 231-238

Author(s):

Agung Eddy Suryo Saputro ◽

Khairil Anwar Notodiputro ◽

Indahwati A

Keyword(s):

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Addition Method ◽

Sentiment Mining ◽

Positive Sentiment ◽

An historical analysis of species references in American English

Corpora ◽

10.3366/cor.2019.0177 ◽

2019 ◽

Vol 14 (3) ◽

pp. 327-349

Author(s):

Craig Frayne

Keyword(s):

Environmental Change ◽

Sentiment Analysis ◽

Quantitative Methods ◽

English Language ◽

Language Use ◽

American English ◽

Historical Analysis ◽

The Past ◽

Corpus Studies ◽

Google Books

This study uses the two largest available American English language corpora, Google Books and the Corpus of Historical American English (coha), to investigate relations between ecology and language. The paper introduces ecolinguistics as a promising theme for corpus research. While some previous ecolinguistic research has used corpus approaches, there is a case to be made for quantitative methods that draw on larger datasets. Building on other corpus studies that have made connections between language use and environmental change, this paper investigates whether linguistic references to other species have changed in the past two centuries and, if so, how. The methodology consists of two main parts: an examination of the frequency of common names of species followed by aspect-level sentiment analysis of concordance lines. Results point to both opportunities and challenges associated with applying corpus methods to ecolinguistc research.

Download Full-text

Sentiment Analysis Based Approaches for Understanding User Context in Web Content

Combining N-gram based Similarity Analysis with Sentiment Analysis in Web Content Classification

Violence Detection over Online Social Networks: An Arabic Sentiment Analysis Approach

Web Content Classification based on Topic and Sentiment Analysis of Text

Effectiveness of Normalization Over Processing of Textual Data Using Hybrid Approach Sentiment Analysis

Semi-automated Methods for the Annotation and Design of a Semantic Network Designed for Sentiment Analysis of Social Web Content

A Hybrid CNN-LSTM: A Deep Learning Approach for Consumer Sentiment Analysis Using Qualitative User-Generated Contents

Holistic User Context-Aware Recommender Algorithm

A Rule Based Stemmer

Study of Sentiment of Governor's Election Opinion in 2018

An historical analysis of species references in American English

Export Citation Format