Sentiment Mining Approaches for Big Data Classification and Clustering

The evolution of technology, along with the common use of different devices connected to the Internet, provides a vast growth in the volume and variety of data that are daily generated at high velocity, phenomenon commonly denominated as Big Data. Related with this, several Text Mining techniques make possible the extraction of useful insights from that data, benefiting the decision-making process across multiple areas, using the information, models, patterns or tendencies that these techniques are able to identify. With Sentiment Analysis, it is possible to understand which sentiments and opinions are implicit in this data. This paper proposes an architecture for Sentiment Analysis that uses data from the Twitter, which is able to collect, store, process and analyse data on a real-time fashion. To demonstrate its utility, practical applications are developed using real world examples where Sentiment Analysis brings benefits when applied. With the presented demonstration case, it is possible to verify the role of each used technology and the techniques adopted for Sentiment Analysis.

Download Full-text

A Domain Specific Entity Linking Approach Consuming Multistore Environment

Journal of Intelligent Systems with Applications ◽

10.54856/jiswa.201805016 ◽

2018 ◽

pp. 46-52

Author(s):

Emrah Inan ◽

Burak Yonyul ◽

Fatih Tekbacak

Keyword(s):

Big Data ◽

Data Models ◽

Use Cases ◽

Unstructured Data ◽

Entity Linking ◽

Domain Specific ◽

Storage Technology ◽

Integrated Environment ◽

Data Environment ◽

The Web

Most of the data on the web is non-structural, and it is required that the data should be transformed into a machine operable structure. Therefore, it is appropriate to convert the unstructured data into a structured form according to the requirements and to store those data in different data models by considering use cases. As requirements and their types increase, it fails using one approach to perform on all. Thus, it is not suitable to use a single storage technology to carry out all storage requirements. Managing stores with various type of schemas in a joint and an integrated manner is named as 'multistore' and 'polystore' in the database literature. In this paper, Entity Linking task is leveraged to transform texts into wellformed data and this data is managed by an integrated environment of different data models. Finally, this integrated big data environment will be queried and be examined by presenting the method.

Download Full-text

Web Text Mining

The Oxford Handbook of Computational Linguistics 2nd edition ◽

10.1093/oxfordhb/9780199573691.013.27 ◽

2016 ◽

Author(s):

Ricardo Baeza-Yates ◽

Roi Blanco ◽

Malú Castellanos

Keyword(s):

Social Media ◽

Text Mining ◽

Sentiment Analysis ◽

Web Search ◽

Internet Users ◽

Entity Retrieval ◽

Web Text Mining ◽

Text Content ◽

The Web

Web search has become a ubiquitous commodity for Internet users. This fact puts a large number of documents with plenty of text content at our fingertips. To make good use of this data, we need to mine web text. This triggers the two problems covered here: sentiment analysis and entity retrieval in the context of the Web. The first problem answers the question of what people think about a given product or a topic, in particular sentiment analysis in social media. The second problem addresses the issue of solving certain enquiries precisely by returning a particular object: for instance, where the next concert of my favourite band will be or who the best cooks are in a particular region. Where to find these objects and how to retrieve, rank, and display them are tasks related to the entity retrieval problem.

Download Full-text

Sentiment Analysis of Social Media as Tool to Improve Customer Retention

Digital Marketing and Consumer Engagement ◽

10.4018/978-1-5225-5187-4.ch032 ◽

2017 ◽

pp. 635-648

Author(s):

Wafaa A. Al-Rabayah ◽

Ahmad Al-Zyoud

Keyword(s):

Social Media ◽

Social Network ◽

Literature Review ◽

Competitive Advantage ◽

Sentiment Analysis ◽

Customer Retention ◽

Unstructured Data ◽

Amount Of Information ◽

Business Applications ◽

The Web

Sentiment analysis is a process of determining the polarity (i.e. positive, negative or neutral) of a given text. The extremely increased amount of information available on the web, especially social media, create a challenge to be retrieved and analyzed on time, timely analyzed of unstructured data provide businesses a competitive advantage by better understanding their customers' needs and preferences. This literature review will cover a number of studies about sentiment analysis and finds the connection between sentiment analysis of social network content and customers retention; we will focus on sentiment analysis and discuss concepts related to this field, most important relevant studies and its results, its methods of applications, where it can be applied and its business applications, finally, we will discuss how can sentiment analysis improve the customer retention based on retrieved data.

Download Full-text

Sentiment Analysis of Social Media as Tool to Improve Customer Retention

Advances in Marketing, Customer Relationship Management, and E-Services - Strategic Uses of Social Media for Improved Customer Retention ◽

10.4018/978-1-5225-1686-6.ch011 ◽

2017 ◽

pp. 207-223

Author(s):

Wafaa A. Al-Rabayah ◽

Ahmad Al-Zyoud

Keyword(s):

Social Media ◽

Social Network ◽

Literature Review ◽

Competitive Advantage ◽

Sentiment Analysis ◽

Customer Retention ◽

Unstructured Data ◽

Amount Of Information ◽

Business Applications ◽

The Web

Sentiment analysis is a process of determining the polarity (i.e. positive, negative or neutral) of a given text. The extremely increased amount of information available on the web, especially social media, create a challenge to be retrieved and analyzed on time, timely analyzed of unstructured data provide businesses a competitive advantage by better understanding their customers' needs and preferences. This literature review will cover a number of studies about sentiment analysis and finds the connection between sentiment analysis of social network content and customers retention; we will focus on sentiment analysis and discuss concepts related to this field, most important relevant studies and its results, its methods of applications, where it can be applied and its business applications, finally, we will discuss how can sentiment analysis improve the customer retention based on retrieved data.

Download Full-text

Recuperação da Informação em Ambientes Semânticos: uma ferramenta aplicada à publicações científicas

Journal on Advances in Theoretical and Applied Informatics ◽

10.26729/jadi.v1i1.1042 ◽

2015 ◽

Vol 1 (1) ◽

pp. 30 ◽

Cited By ~ 1

Author(s):

Caio Saraiva Coneglian ◽

Elvis Fusco

Keyword(s):

Social Media ◽

Big Data ◽

Information Retrieval ◽

Great Difficulty ◽

Unstructured Data ◽

Added Value ◽

Informational Needs ◽

Retrieval Process ◽

Extraction Agent ◽

The Web

The data available on the Web is growing exponentially, providing information of high added value to organizations. Such information can be arranged in diverse bases and in varied formats, like videos and photos in social media. However, unstructured data present great difficulty for the information retrieval, not efficiently meeting the informational needs of the users, because there are problems in understanding the meaning of documents stored on the Web. In the context of an Information Retrieval architecture, this research aims to The implementation of a semantic extraction agent in the context of the Web that allows the location, treatment and retrieval of information in the context of Big Data in the most varied informational sources that serves as the basis for the implementation of informational environments that aid the Information Retrieval process , Using ontology to add semantics to the process of retrieval and presentation of results obtained to users, thus being able to meet their needs.

Download Full-text

An Analysis on the Validity of Audit Opinion Using Unstructured Data from Audit Reports and Audit Fees and Hours : Emphasis on Utilizing Text Mining and Sentiment Analysis

Korean Accounting Review ◽

10.24056/kar.2019.08.002 ◽

2019 ◽

Vol 44 (4) ◽

pp. 175-214

Author(s):

Hyung Jong Na ◽

Kun Chang Lee ◽

Seung Uk Choi

Keyword(s):

Text Mining ◽

Sentiment Analysis ◽

Audit Fees ◽

Unstructured Data ◽

Audit Opinion ◽

Audit Reports

Download Full-text

Text Mining in Cybersecurity

ACM Computing Surveys ◽

10.1145/3462477 ◽

2021 ◽

Vol 54 (7) ◽

pp. 1-36

Author(s):

Luciano Ignaczak ◽

Guilherme Goldschmidt ◽

Cristiano André Da Costa ◽

Rodrigo Da Rosa Righi

Keyword(s):

Neural Networks ◽

Text Mining ◽

Literature Review ◽

Text Classification ◽

Real World ◽

Classification Performance ◽

Unstructured Data ◽

Future Research ◽

Level Of Automation ◽

Data Volume

The growth of data volume has changed cybersecurity activities, demanding a higher level of automation. In this new cybersecurity landscape, text mining emerged as an alternative to improve the efficiency of the activities involving unstructured data. This article proposes a Systematic Literature Review ( SLR ) to present the application of text mining in the cybersecurity domain. Using a systematic protocol, we identified 2,196 studies, out of which 83 were summarized. As a contribution, we propose a taxonomy to demonstrate the different activities in the cybersecurity domain supported by text mining. We also detail the strategies evaluated in the application of text mining tasks and the use of neural networks to support activities involving unstructured data. The work also discusses text classification performance aiming its application in real-world solutions. The SLR also highlights open gaps for future research, such as the analysis of non-English content and the intensification in the usage of neural networks.

Download Full-text