Sentiment Mining Approaches for Big Data Classification and Clustering

Author(s):  
Ashok Kumar J ◽  
Abirami S ◽  
Tina Esther Trueman

Sentiment analysis is one of the most important applications in the field of text mining. It computes people's opinions, comments, posts, reviews, evaluations, and emotions which are expressed on products, sales, services, individuals, organizations, etc. Nowadays, large amounts of structured and unstructured data are being produced on the web. The categorizing and grouping of these data become a real-world problem. In this chapter, the authors address the current research in this field, issues and the problem of sentiment analysis on Big Data for classification and clustering. It suggests new methods, applications, algorithm extensions of classification and clustering and software tools in the field of sentiment analysis.

2017 ◽  
Vol 13 (3) ◽  
pp. 47-67 ◽  
Author(s):  
Carina Sofia Andrade ◽  
Maribel Yasmina Santos

The evolution of technology, along with the common use of different devices connected to the Internet, provides a vast growth in the volume and variety of data that are daily generated at high velocity, phenomenon commonly denominated as Big Data. Related with this, several Text Mining techniques make possible the extraction of useful insights from that data, benefiting the decision-making process across multiple areas, using the information, models, patterns or tendencies that these techniques are able to identify. With Sentiment Analysis, it is possible to understand which sentiments and opinions are implicit in this data. This paper proposes an architecture for Sentiment Analysis that uses data from the Twitter, which is able to collect, store, process and analyse data on a real-time fashion. To demonstrate its utility, practical applications are developed using real world examples where Sentiment Analysis brings benefits when applied. With the presented demonstration case, it is possible to verify the role of each used technology and the techniques adopted for Sentiment Analysis.


Author(s):  
Emrah Inan ◽  
Burak Yonyul ◽  
Fatih Tekbacak

Most of the data on the web is non-structural, and it is required that the data should be transformed into a machine operable structure. Therefore, it is appropriate to convert the unstructured data into a structured form according to the requirements and to store those data in different data models by considering use cases. As requirements and their types increase, it fails using one approach to perform on all. Thus, it is not suitable to use a single storage technology to carry out all storage requirements. Managing stores with various type of schemas in a joint and an integrated manner is named as 'multistore' and 'polystore' in the database literature. In this paper, Entity Linking task is leveraged to transform texts into wellformed data and this data is managed by an integrated environment of different data models. Finally, this integrated big data environment will be queried and be examined by presenting the method.


Author(s):  
Ricardo Baeza-Yates ◽  
Roi Blanco ◽  
Malú Castellanos

Web search has become a ubiquitous commodity for Internet users. This fact puts a large number of documents with plenty of text content at our fingertips. To make good use of this data, we need to mine web text. This triggers the two problems covered here: sentiment analysis and entity retrieval in the context of the Web. The first problem answers the question of what people think about a given product or a topic, in particular sentiment analysis in social media. The second problem addresses the issue of solving certain enquiries precisely by returning a particular object: for instance, where the next concert of my favourite band will be or who the best cooks are in a particular region. Where to find these objects and how to retrieve, rank, and display them are tasks related to the entity retrieval problem.


Author(s):  
Wafaa A. Al-Rabayah ◽  
Ahmad Al-Zyoud

Sentiment analysis is a process of determining the polarity (i.e. positive, negative or neutral) of a given text. The extremely increased amount of information available on the web, especially social media, create a challenge to be retrieved and analyzed on time, timely analyzed of unstructured data provide businesses a competitive advantage by better understanding their customers' needs and preferences. This literature review will cover a number of studies about sentiment analysis and finds the connection between sentiment analysis of social network content and customers retention; we will focus on sentiment analysis and discuss concepts related to this field, most important relevant studies and its results, its methods of applications, where it can be applied and its business applications, finally, we will discuss how can sentiment analysis improve the customer retention based on retrieved data.


Author(s):  
Wafaa A. Al-Rabayah ◽  
Ahmad Al-Zyoud

Sentiment analysis is a process of determining the polarity (i.e. positive, negative or neutral) of a given text. The extremely increased amount of information available on the web, especially social media, create a challenge to be retrieved and analyzed on time, timely analyzed of unstructured data provide businesses a competitive advantage by better understanding their customers' needs and preferences. This literature review will cover a number of studies about sentiment analysis and finds the connection between sentiment analysis of social network content and customers retention; we will focus on sentiment analysis and discuss concepts related to this field, most important relevant studies and its results, its methods of applications, where it can be applied and its business applications, finally, we will discuss how can sentiment analysis improve the customer retention based on retrieved data.


Author(s):  
Caio Saraiva Coneglian ◽  
Elvis Fusco

The data available on the Web is growing exponentially, providing information of high added value to organizations. Such information can be arranged in diverse bases and in varied formats, like videos and photos in social media. However, unstructured data present great difficulty for the information retrieval, not efficiently meeting the informational needs of the users, because there are problems in understanding the meaning of documents stored on the Web. In the context of an Information Retrieval architecture, this research aims to The implementation of a semantic extraction agent in the context of the Web that allows the location, treatment and retrieval of information in the context of Big Data in the most varied informational sources that serves as the basis for the implementation of informational environments that aid the Information Retrieval process , Using ontology to add semantics to the process of retrieval and presentation of results obtained to users, thus being able to meet their needs.


2021 ◽  
Vol 54 (7) ◽  
pp. 1-36
Author(s):  
Luciano Ignaczak ◽  
Guilherme Goldschmidt ◽  
Cristiano André Da Costa ◽  
Rodrigo Da Rosa Righi

The growth of data volume has changed cybersecurity activities, demanding a higher level of automation. In this new cybersecurity landscape, text mining emerged as an alternative to improve the efficiency of the activities involving unstructured data. This article proposes a Systematic Literature Review ( SLR ) to present the application of text mining in the cybersecurity domain. Using a systematic protocol, we identified 2,196 studies, out of which 83 were summarized. As a contribution, we propose a taxonomy to demonstrate the different activities in the cybersecurity domain supported by text mining. We also detail the strategies evaluated in the application of text mining tasks and the use of neural networks to support activities involving unstructured data. The work also discusses text classification performance aiming its application in real-world solutions. The SLR also highlights open gaps for future research, such as the analysis of non-English content and the intensification in the usage of neural networks.


Sign in / Sign up

Export Citation Format

Share Document