a Schema Extraction of Document-Oriented Database for Data Warehouse

Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in F1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% F1-score is achieved in the aspect category prediction task.

Download Full-text

Sentiment Analysis of Social Media as Tool to Improve Customer Retention

Digital Marketing and Consumer Engagement ◽

10.4018/978-1-5225-5187-4.ch032 ◽

2017 ◽

pp. 635-648

Author(s):

Wafaa A. Al-Rabayah ◽

Ahmad Al-Zyoud

Keyword(s):

Social Media ◽

Social Network ◽

Literature Review ◽

Competitive Advantage ◽

Sentiment Analysis ◽

Customer Retention ◽

Unstructured Data ◽

Amount Of Information ◽

Business Applications ◽

The Web

Sentiment analysis is a process of determining the polarity (i.e. positive, negative or neutral) of a given text. The extremely increased amount of information available on the web, especially social media, create a challenge to be retrieved and analyzed on time, timely analyzed of unstructured data provide businesses a competitive advantage by better understanding their customers' needs and preferences. This literature review will cover a number of studies about sentiment analysis and finds the connection between sentiment analysis of social network content and customers retention; we will focus on sentiment analysis and discuss concepts related to this field, most important relevant studies and its results, its methods of applications, where it can be applied and its business applications, finally, we will discuss how can sentiment analysis improve the customer retention based on retrieved data.

Download Full-text

The Impact of Big Data on Security

Big Data ◽

10.4018/978-1-4666-9840-6.ch068 ◽

2016 ◽

pp. 1495-1518

Author(s):

Mohammad Alaa Hussain Al-Hamami

Keyword(s):

Social Media ◽

Big Data ◽

Management System ◽

Database Management ◽

Database Systems ◽

Structured Data ◽

Database Management System ◽

Unstructured Data ◽

And Behavior ◽

The Impact

Big Data is comprised systems, to remain competitive by techniques emerging due to Big Data. Big Data includes structured data, semi-structured and unstructured. Structured data are those data formatted for use in a database management system. Semi-structured and unstructured data include all types of unformatted data including multimedia and social media content. Among practitioners and applied researchers, the reaction to data available through blogs, Twitter, Facebook, or other social media can be described as a “data rush” promising new insights about consumers' choices and behavior and many other issues. In the past Big Data has been used just by very large organizations, governments and large enterprises that have the ability to create its own infrastructure for hosting and mining large amounts of data. This chapter will show the requirements for the Big Data environments to be protected using the same rigorous security strategies applied to traditional database systems.

Download Full-text

Sentiment Analysis of Social Media as Tool to Improve Customer Retention

Advances in Marketing, Customer Relationship Management, and E-Services - Strategic Uses of Social Media for Improved Customer Retention ◽

10.4018/978-1-5225-1686-6.ch011 ◽

2017 ◽

pp. 207-223

Author(s):

Wafaa A. Al-Rabayah ◽

Ahmad Al-Zyoud

Keyword(s):

Social Media ◽

Social Network ◽

Literature Review ◽

Competitive Advantage ◽

Sentiment Analysis ◽

Customer Retention ◽

Unstructured Data ◽

Amount Of Information ◽

Business Applications ◽

The Web

Sentiment analysis is a process of determining the polarity (i.e. positive, negative or neutral) of a given text. The extremely increased amount of information available on the web, especially social media, create a challenge to be retrieved and analyzed on time, timely analyzed of unstructured data provide businesses a competitive advantage by better understanding their customers' needs and preferences. This literature review will cover a number of studies about sentiment analysis and finds the connection between sentiment analysis of social network content and customers retention; we will focus on sentiment analysis and discuss concepts related to this field, most important relevant studies and its results, its methods of applications, where it can be applied and its business applications, finally, we will discuss how can sentiment analysis improve the customer retention based on retrieved data.

Download Full-text

Recuperação da Informação em Ambientes Semânticos: uma ferramenta aplicada à publicações científicas

Journal on Advances in Theoretical and Applied Informatics ◽

10.26729/jadi.v1i1.1042 ◽

2015 ◽

Vol 1 (1) ◽

pp. 30 ◽

Cited By ~ 1

Author(s):

Caio Saraiva Coneglian ◽

Elvis Fusco

Keyword(s):

Social Media ◽

Big Data ◽

Information Retrieval ◽

Great Difficulty ◽

Unstructured Data ◽

Added Value ◽

Informational Needs ◽

Retrieval Process ◽

Extraction Agent ◽

The Web

The data available on the Web is growing exponentially, providing information of high added value to organizations. Such information can be arranged in diverse bases and in varied formats, like videos and photos in social media. However, unstructured data present great difficulty for the information retrieval, not efficiently meeting the informational needs of the users, because there are problems in understanding the meaning of documents stored on the Web. In the context of an Information Retrieval architecture, this research aims to The implementation of a semantic extraction agent in the context of the Web that allows the location, treatment and retrieval of information in the context of Big Data in the most varied informational sources that serves as the basis for the implementation of informational environments that aid the Information Retrieval process , Using ontology to add semantics to the process of retrieval and presentation of results obtained to users, thus being able to meet their needs.

Download Full-text

Role of SPARQL in Leveraging Sematic Web Technology

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c5161.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 26-30

Keyword(s):

Heterogeneous Data ◽

Structured Data ◽

Unstructured Data ◽

Web Pages ◽

Query Execution ◽

Web Content ◽

Main Challenge ◽

Text Images ◽

The Web

Semantic web is not just a matter of translation from HTML to RDF/OWL languages. It is a matter of understanding the content of the web through knowledge graphs. Entities need to be related with relationships. This content is composed of resources (web pages) that contain, for example, text, images and audio. Thus, there is the need of extracting entities from these resources. Currently, most of the web content is in HTML5 format which is a W3C recommendation which enables describing the structure marginally with the help of annotations. The main challenge here is to transform unstructured data from plain HTML files to structured data (e.g RDF or OWL). The current work provides the first hand information for dealing with unstructured heterogeneous data residing on web using Twinkle, a Java tool for SPARQL query execution on FOAF (Friend Of A Friend) document.

Download Full-text

Performance Analysis of Machine Learning Algorithms for Big Data Classification

International Journal of E-Health and Medical Communications ◽

10.4018/ijehmc.20210701.oa4 ◽

2021 ◽

Vol 12 (4) ◽

pp. 60-75

Author(s):

Sanjeev Kumar Punia ◽

Manoj Kumar ◽

Thompson Stephan ◽

Ganesh Gopal Deverajan ◽

Rizwan Patan

Keyword(s):

Machine Learning ◽

Social Media ◽

Big Data ◽

Research Data ◽

Structured Data ◽

Unstructured Data ◽

Support Vector ◽

Classification Algorithms ◽

Data Set ◽

Machine Learning Classification

In broad, three machine learning classification algorithms are used to discover correlations, hidden patterns, and other useful information from different data sets known as big data. Today, Twitter, Facebook, Instagram, and many other social media networks are used to collect the unstructured data. The conversion of unstructured data into structured data or meaningful information is a very tedious task. The different machine learning classification algorithms are used to convert unstructured data into structured data. In this paper, the authors first collect the unstructured research data from a frequently used social media network (i.e., Twitter) by using a Twitter application program interface (API) stream. Secondly, they implement different machine classification algorithms (supervised, unsupervised, and reinforcement) like decision trees (DT), neural networks (NN), support vector machines (SVM), naive Bayes (NB), linear regression (LR), and k-nearest neighbor (K-NN) from the collected research data set. The comparison of different machine learning classification algorithms is concluded.

Download Full-text

The Impact of Big Data on Security

Handbook of Research on Threat Detection and Countermeasures in Network Security - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-4666-6583-5.ch015 ◽

2015 ◽

pp. 276-298 ◽

Cited By ~ 2

Author(s):

Mohammad Alaa Hussain Al-Hamami

Keyword(s):

Social Media ◽

Big Data ◽

Management System ◽

Database Management ◽

Database Systems ◽

Structured Data ◽

Database Management System ◽

Unstructured Data ◽

And Behavior ◽

The Impact

Big Data is comprised systems, to remain competitive by techniques emerging due to Big Data. Big Data includes structured data, semi-structured and unstructured. Structured data are those data formatted for use in a database management system. Semi-structured and unstructured data include all types of unformatted data including multimedia and social media content. Among practitioners and applied researchers, the reaction to data available through blogs, Twitter, Facebook, or other social media can be described as a “data rush” promising new insights about consumers' choices and behavior and many other issues. In the past Big Data has been used just by very large organizations, governments and large enterprises that have the ability to create its own infrastructure for hosting and mining large amounts of data. This chapter will show the requirements for the Big Data environments to be protected using the same rigorous security strategies applied to traditional database systems.

Download Full-text

Language in data processing and ontology engineering for journalism

Digital Journalism, Drones, and Automation ◽

10.1093/oso/9780190655860.003.0010 ◽

2020 ◽

pp. 205-233

Author(s):

Cate Dowd

Keyword(s):

Social Media ◽

Public Relations ◽

Data Processing ◽

Language Processing ◽

Online News ◽

Structured Data ◽

Unstructured Data ◽

New Journalism ◽

To Come ◽

Little Owl

Trigonometry in algorithms with NLP (Natural Language Processing) can sort word connotations. The Triple structure of grammar in the RDF also extends to semantics in machine learning and big-data processing, but ontologies and a metamodel are essential for meaningful relations across data. They should inform the design of new journalism systems. Major processing platforms used by Facebook and Yahoo are distributed systems, like Hadoop, with resource negotiation features and computations applied to text. NLP used by Google also uses cosine vectors for connotations of words. Data processing already works across structured data for online news tags and unstructured data, like social media tags, with folksonomy characteristics, but social media also uses structured data. However, journalism is yet to come up with semantic systems, from an ontological base. For that end, ontologies across journalism, social media, and public relations and a little OWL to reason about resources can inform AI sub-systems and wider system perspectives.

Download Full-text

A Study on the Web PR Strategy using Social Media - The Case of Chungcheong Tourism -

The e-Business Studies ◽

10.15719/geba.11.5.201012.97 ◽

2010 ◽

Vol 11 (5) ◽

pp. 97-116 ◽

Cited By ~ 2

Author(s):

유호종

Keyword(s):

Social Media ◽

The Web

Download Full-text