Negation detection for sentiment analysis: A case study in Spanish

2020 ◽  
pp. 1-24
Author(s):  
Salud María Jiménez-Zafra ◽  
Noa P. Cruz-Díaz ◽  
Maite Taboada ◽  
María Teresa Martín-Valdivia

Abstract Accurate negation identification is one of the most important tasks in the context of sentiment analysis. In order to correctly interpret the sentiment value of a particular expression, we need to identify whether it is in the scope of negation. While much of the work on negation detection has focused on English, we have seen recent developments that provide accurate identification of negation in other languages. In this paper, we provide an overview of negation detection systems and describe an implementation of a Spanish system for negation cue detection and scope identification. We apply this system to the sentiment analysis task, confirming also for Spanish that improvements can be gained from accurate negation detection. The paper contributes an implementation of negation detection for sentiment analysis in Spanish and a detailed error analysis. This is the first work in Spanish in which a machine learning negation processing system is applied to the sentiment analysis task. Existing methods have used negation rules that have not been assessed, perhaps because the first Spanish corpus annotated with negation for sentiment analysis has only recently become available.

2022 ◽  
pp. 176-194
Author(s):  
Suania Acampa ◽  
Ciro Clemente De Falco ◽  
Domenico Trezza

The uncritical application of automatic analysis techniques can be insidious. For this reason, the scientific community is very interested in the supervised approach. Can this be enough? This chapter aims to these issues by comparing three machine learning approaches to measuring the sentiment. The case study is the analysis of the sentiment expressed by the Italians on Twitter during the first post-lockdown day. To start the supervised model, it has been necessary to build a stratified sample of tweets by daily and classifying them manually. The model to be test provides for further analysis at the end of the process useful for comparing the three models: index will be built on the tweets processed with the aim of detecting the goodness of the results produced. The comparison of the three algorithms helps the authors to understand not only which is the best approach for the Italian language but tries to understand which strategy is to verify the quality of the data obtained.


2020 ◽  
Vol 9 (4) ◽  
pp. 1411-1419
Author(s):  
Nashwan Dheyaa Zaki ◽  
Nada Yousif Hashim ◽  
Yasmin Makki Mohialden ◽  
Mostafa Abdulghafoor Mohammed ◽  
Tole Sutikno ◽  
...  

The scale of data streaming in social networks, such as Twitter, is increasing exponentially. Twitter is one of the most important and suitable big data sources for machine learning research in terms of analysis, prediction, extract knowledge, and opinions. People use Twitter platform daily to express their opinion which is a fundamental fact that influence their behaviors. In recent years, the flow of Iraqi dialect has been increased, especially on the Twitter platform. Sentiment analysis for different dialects and opinion mining has become a hot topic in data science researches. In this paper, we will attempt to develop a real-time analytic model for sentiment analysis and opinion mining to Iraqi tweets using spark streaming, also create a dataset for researcher in this field. The Twitter handle Bassam AlRawi is the case study here. The new method is more suitable in the current day machine learning applications and fast online prediction. 


Author(s):  
Georgios Alexandridis ◽  
John Aliprantis ◽  
Konstantinos Michalakis ◽  
Konstantinos Korovesis ◽  
Panagiotis Tsantilas ◽  
...  

The task of sentiment analysis tries to predict the affective state of a document by examining its content and metadata through the application of machine learning techniques. Recent advances in the field consider sentiment to be a multi-dimensional quantity that pertains to different interpretations (or aspects), rather than a single one. Based on earlier research, the current work examines the said task in the framework of a larger architecture that crawls documents from various online sources. Subsequently, the collected data are pre-processed, in order to extract useful features that assist the machine learning algorithms in the sentiment analysis task. More specifically, the words that comprise each text are mapped to a neural embedding space and are provided to a hybrid, bi-directional long short-term memory network, coupled with convolutional layers and an attention mechanism that outputs the final textual features. Additionally, a number of document metadata are extracted, including the number of a document’s repetitions in the collected corpus (i.e. number of reposts/retweets), the frequency and type of emoji ideograms and the presence of keywords, either extracted automatically or assigned manually, in the form of hashtags. The novelty of the proposed approach lies in the semantic annotation of the retrieved keywords, since an ontology-based knowledge management system is queried, with the purpose of retrieving the classes the aforementioned keywords belong to. Finally, all features are provided to a fully connected, multi-layered, feed-forward artificial neural network that performs the analysis task. The overall architecture is compared, on a manually collected corpus of documents, with two other state-of-the-art approaches, achieving optimal results in identifying negative sentiment, which is of particular interest to certain parties (like for example, companies) that are interested in measuring their online reputation.


2021 ◽  
Vol 13 (9) ◽  
pp. 4986
Author(s):  
Imatitikua D. Aiyanyo ◽  
Hamman Samuel ◽  
Heuiseok Lim

In this study, we qualitatively and quantitatively examine the effects of COVID-19 on classrooms, students, and educators. Using a new Twitter dataset specific to South Korea during the pandemic, we sample the sentiment and strain on students and educators using applied machine learning techniques in order to identify various topical pain points emerging during the pandemic. Our contributions include a novel and open source geo-fenced dataset on student and educator opinion within South Korea that we are making available to other researchers as well. We also identify trends in sentiment and polarity over the pandemic timeline, as well as key drivers behind the sentiments. Moreover, we provide a comparative analysis of two widely used pre-trained sentiment analysis approaches with TextBlob and VADER using statistical significance tests. Ultimately, we analyze how public opinion shifted on the pandemic in terms of positive sentiments about accessing course materials, online support communities, access to classes, and creativity, to negative sentiments about mental fatigue, job loss, student concerns, and overwhelmed institutions. We also initiate initial discussions about the concept of actionable sentiment analysis by overlapping polarity with the concept of trigger management to assist users in coping with negative emotions. We hope that insights from this preliminary study can promote further utilization of social media datasets to evaluate government messaging, population sentiment, and multi-dimensional analysis of pandemics.


Author(s):  
G.Y. Fan ◽  
J.M. Cowley

In recent developments, the ASU HB5 has been modified so that the timing, positioning, and scanning of the finely focused electron probe can be entirely controlled by a host computer. This made the asynchronized handshake possible between the HB5 STEM and the image processing system which consists of host computer (PDP 11/34), DeAnza image processor (IP 5000) which is interfaced with a low-light level TV camera, array processor (AP 400) and various peripheral devices. This greatly facilitates the pattern recognition technique initiated by Monosmith and Cowley. Software called NANHB5 is under development which, instead of employing a set of photo-diodes to detect strong spots on a TV screen, uses various software techniques including on-line fast Fourier transform (FFT) to recognize patterns of greater complexity, taking advantage of the sophistication of our image processing system and the flexibility of computer software.


Sign in / Sign up

Export Citation Format

Share Document