An analytical study of information extraction from unstructured and multidimensional big data

Abstract Process of information extraction (IE) is used to extract useful information from unstructured or semi-structured data. Big data arise new challenges for IE techniques with the rapid growth of multifaceted also called as multidimensional unstructured data. Traditional IE systems are inefficient to deal with this huge deluge of unstructured big data. The volume and variety of big data demand to improve the computational capabilities of these IE systems. It is necessary to understand the competency and limitations of the existing IE techniques related to data pre-processing, data extraction and transformation, and representations for huge volumes of multidimensional unstructured data. Numerous studies have been conducted on IE, addressing the challenges and issues for different data types such as text, image, audio and video. Very limited consolidated research work have been conducted to investigate the task-dependent and task-independent limitations of IE covering all data types in a single study. This research work address this limitation and present a systematic literature review of state-of-the-art techniques for a variety of big data, consolidating all data types. Recent challenges of IE are also identified and summarized. Potential solutions are proposed giving future research directions in big data IE. The research is significant in terms of recent trends and challenges related to big data analytics. The outcome of the research and recommendations will help to improve the big data analytics by making it more productive.

Download Full-text

An experimental investigation of the impact of using big data analytics on customers’ performance measurement

Accounting Research Journal ◽

10.1108/arj-04-2020-0080 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Marwa Rabe Mohamed Elkmash ◽

Magdy Gamal Abdel-Kader ◽

Bassant Badr El Din

Keyword(s):

Big Data ◽

Performance Measurement ◽

Data Analytics ◽

Positive Impact ◽

Big Data Analytics ◽

Unstructured Data ◽

Future Research ◽

Web Based ◽

Content Type ◽

The Impact

Purpose This study aims to investigate and explore the impact of big data analytics (BDA) as a mechanism that could develop the ability to measure customers’ performance. To accomplish the research aim, the theoretical discussion was developed through the combination of the diffusion of innovation theory with the technology acceptance model (TAM) that is less developed for the research field of this study. Design/methodology/approach Empirical data was obtained using Web-based quasi-experiments with 104 Egyptian accounting professionals. Further, the Wilcoxon signed-rank test and the chi-square goodness-of-fit test were used to analyze data. Findings The empirical results indicate that measuring customers’ performance based on BDA increase the organizations’ ability to analyze the customers’ unstructured data, decrease the cost of customers’ unstructured data analysis, increase the ability to handle the customers’ problems quickly, minimize the time spent to analyze the customers’ data and obtaining the customers’ performance reports and control managers’ bias when they measure customer satisfaction. The study findings supported the accounting professionals’ acceptance of BDA through the TAM elements: the intention to use (R), perceived usefulness (U) and the perceived ease of use (E). Research limitations/implications This study has several limitations that could be addressed in future research. First, this study focuses on customers’ performance measurement (CPM) only and ignores other performance measurements such as employees’ performance measurement and financial performance measurement. Future research can examine these areas. Second, this study conducts a Web-based experiment with Master of Business Administration students as a study’s participants, researchers could conduct a laboratory experiment and report if there are differences. Third, owing to the novelty of the topic, there was a lack of theoretical evidence in developing the study’s hypotheses. Practical implications This study succeeds to provide the much-needed empirical evidence for BDA positive impact in improving CPM efficiency through the proposed framework (i.e. CPM and BDA framework). Furthermore, this study contributes to the improvement of the performance measurement process, thus, the decision-making process with meaningful and proper insights through the capability of collecting and analyzing the customers’ unstructured data. On a practical level, the company could eventually use this study’s results and the new insights to make better decisions and develop its policies. Originality/value This study holds significance as it provides the much-needed empirical evidence for BDA positive impact in improving CPM efficiency. The study findings will contribute to the enhancement of the performance measurement process through the ability of gathering and analyzing the customers’ unstructured data.

Download Full-text

Semantic Technologies and Big Data Analytics for Cyber Defence

Web Services ◽

10.4018/978-1-5225-7501-6.ch074 ◽

2019 ◽

pp. 1430-1443

Author(s):

Louise Leenen ◽

Thomas Meyer

Keyword(s):

Decision Making ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Future Research ◽

Data Sets ◽

Semantic Technologies ◽

Data Types ◽

Intelligent Decision Making ◽

Big Data Technologies

The Governments, military forces and other organisations responsible for cybersecurity deal with vast amounts of data that has to be understood in order to lead to intelligent decision making. Due to the vast amounts of information pertinent to cybersecurity, automation is required for processing and decision making, specifically to present advance warning of possible threats. The ability to detect patterns in vast data sets, and being able to understanding the significance of detected patterns are essential in the cyber defence domain. Big data technologies supported by semantic technologies can improve cybersecurity, and thus cyber defence by providing support for the processing and understanding of the huge amounts of information in the cyber environment. The term big data analytics refers to advanced analytic techniques such as machine learning, predictive analysis, and other intelligent processing techniques applied to large data sets that contain different data types. The purpose is to detect patterns, correlations, trends and other useful information. Semantic technologies is a knowledge representation paradigm where the meaning of data is encoded separately from the data itself. The use of semantic technologies such as logic-based systems to support decision making is becoming increasingly popular. However, most automated systems are currently based on syntactic rules. These rules are generally not sophisticated enough to deal with the complexity of decisions required to be made. The incorporation of semantic information allows for increased understanding and sophistication in cyber defence systems. This paper argues that both big data analytics and semantic technologies are necessary to provide counter measures against cyber threats. An overview of the use of semantic technologies and big data technologies in cyber defence is provided, and important areas for future research in the combined domains are discussed.

Download Full-text

Big Data Analytics Driven Supply Chain Transformation

Advances in E-Business Research - Business Transformations in the Era of Digitalization ◽

10.4018/978-1-5225-7262-6.ch007 ◽

2019 ◽

pp. 106-124 ◽

Cited By ~ 2

Author(s):

Mondher Feki

Keyword(s):

Big Data ◽

Supply Chain ◽

Data Analytics ◽

Big Data Analytics ◽

Future Research ◽

Research Directions ◽

Digital Platforms ◽

New Frontier ◽

Future Research Directions ◽

Chain Transformation

Big data has emerged as the new frontier in supply chain management; however, few firms know how to embrace big data and capitalize on its value. The non-stop production of massive amounts of data on various digital platforms has prompted academics and practitioners to focus on the data economy. Companies must rethink how to harness big data and take full advantage of its possibilities. Big data analytics can help them in giving valuable insights. This chapter provides an overview of big data analytics use in the supply chain field and underlines its potential role in the supply chain transformation. The results show that big data analytics techniques can be categorized into three types: descriptive, predictive, and prescriptive. These techniques influence supply chain processes and create business value. This study sets out future research directions.

Download Full-text

Computational and Data Mining Perspectives on HIV/AIDS in Big Data Era

10.4018/978-1-6684-3662-2.ch072 ◽

2022 ◽

pp. 1477-1503

Author(s):

Ali Al Mazari

Keyword(s):

Data Mining ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Healthcare Sector ◽

Future Research ◽

Research Directions ◽

Scientific Disciplines ◽

Future Research Directions ◽

Hiv Aids

HIV/AIDS big data analytics evolved as a potential initiative enabling the connection between three major scientific disciplines: (1) the HIV biology emergence and evolution; (2) the clinical and medical complex problems and practices associated with the infections and diseases; and (3) the computational methods for the mining of HIV/AIDS biological, medical, and clinical big data. This chapter provides a review on the computational and data mining perspectives on HIV/AIDS in big data era. The chapter focuses on the research opportunities in this domain, identifies the challenges facing the development of big data analytics in HIV/AIDS domain, and then highlights the future research directions of big data in the healthcare sector.

Download Full-text

Limitations of information extraction methods and techniques for heterogeneous unstructured big data

International Journal of Engineering Business Management ◽

10.1177/1847979019890771 ◽

2019 ◽

Vol 11 ◽

pp. 184797901989077 ◽

Cited By ~ 1

Author(s):

Kiran Adnan ◽

Rehan Akbar

Keyword(s):

Big Data ◽

Information Extraction ◽

Extraction Methods ◽

Unstructured Data ◽

Future Research ◽

Data Sets ◽

Data Types ◽

Efficiency And Effectiveness ◽

Single Data ◽

The Impact

During the recent era of big data, a huge volume of unstructured data are being produced in various forms of audio, video, images, text, and animation. Effective use of these unstructured big data is a laborious and tedious task. Information extraction (IE) systems help to extract useful information from this large variety of unstructured data. Several techniques and methods have been presented for IE from unstructured data. However, numerous studies conducted on IE from a variety of unstructured data are limited to single data types such as text, image, audio, or video. This article reviews the existing IE techniques along with its subtasks, limitations, and challenges for the variety of unstructured data highlighting the impact of unstructured big data on IE techniques. To the best of our knowledge, there is no comprehensive study conducted to investigate the limitations of existing IE techniques for the variety of unstructured big data. The objective of the structured review presented in this article is twofold. First, it presents the overview of IE techniques from a variety of unstructured data such as text, image, audio, and video at one platform. Second, it investigates the limitations of these existing IE techniques due to the heterogeneity, dimensionality, and volume of unstructured big data. The review finds that advanced techniques for IE, particularly for multifaceted unstructured big data sets, are the utmost requirement of the organizations to manage big data and derive strategic information. Further, potential solutions are also presented to improve the unstructured big data IE systems for future research. These solutions will help to increase the efficiency and effectiveness of the data analytics process in terms of context-aware analytics systems, data-driven decision-making, and knowledge management.

Download Full-text

COVID‐19 Pandemic in the New Era of Big Data Analytics: Methodological Innovations and Future Research Directions

British Journal of Management ◽

10.1111/1467-8551.12441 ◽

2020 ◽

Author(s):

Jie Sheng ◽

Joseph Amankwah‐Amoah ◽

Zaheer Khan ◽

Xiaojun Wang

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Future Research ◽

Research Directions ◽

New Era ◽

Future Research Directions

Download Full-text

Integrated Data Architecture for Business

Advanced Methodologies and Technologies in Business Operations and Management - Advances in Logistics, Operations, and Management Science ◽

10.4018/978-1-5225-7362-3.ch035 ◽

2019 ◽

pp. 478-490

Author(s):

Richard Kumaradjaja

Keyword(s):

Big Data ◽

Data Integration ◽

Data Analytics ◽

Big Data Analytics ◽

Point Of View ◽

Future Research ◽

Integration Framework ◽

Architecture Framework ◽

Data Architecture ◽

Future Research Directions

This chapter describes data integration issues in big data analytics and proposes an integrated data integration framework for big data analytics. The main focus of this chapter is to address the issues of data integration from the architectural point of view. Addressing the issues of data integration from the architectural point of view will lead to a better understanding of the current situation and better construction of proposed solutions to those issues since architectural approach can give us a holistic and comprehensive view of the problems. The chapter also discusses future research directions of the proposed integrated data architecture framework.

Download Full-text

Information Extraction from Multifaceted Unstructured Big Data

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1074.0882s819 ◽

2019 ◽

Vol 8 (2S8) ◽

pp. 1398-1404

Keyword(s):

Big Data ◽

Information Extraction ◽

Data Analytics ◽

Extraction Methods ◽

Vital Role ◽

Unstructured Data ◽

Digital Data ◽

High Rate ◽

Smart Manufacturing ◽

Future Research

In the era of digital globalization, huge volume and variety of data are being produced at a very high rate. Every day, the world is producing around 2.5 quintillion bytes of data. According to IDC, by 2020, over 40 zettabytes of data will be generated and reproduced. Digital data have become a deluge, overwhelming in every field of information technology (IT), business, science and engineering. These fields are shifting to smart and advanced technologies such as smart manufacturing industries, data-aware medical sciences, and other smart applications. These applications are facilitating the industries in context of data-driven decision making, big data storage, and complex analysis of large data sets. Also, these applications are contributing to generate big data deluge where a variety of data necessitate the industries to use advanced IT approaches. 95% of the digital universe is unstructured data. It is rich data as it contains information that can play a vital role to improve big data analytics. The heterogeneity, complexity, lack of structured information, poor quality and scalability of unstructured data generates difficulties in adapting traditional information extraction techniques. Information extraction can play a vital role in transformation of unstructured data into useful information. A multistep pipeline with data preprocessing steps, extraction methods and representation are utmost requirement to improve the unstructured data analytics. In this regard, this paper presents a short review of information extraction process w.r.t. input data type, extraction methods with their corresponding techniques, and representation of extracted information. The issues with unstructured data and the challenges to information extraction from multifaceted unstructured big data as well as the future research directions have also been discussed

Download Full-text

Artificial Intelligence and Big Data Analytics in Support of Cyber Defense

Research Anthology on Artificial Intelligence Applications in Security ◽

10.4018/978-1-7998-7705-9.ch076 ◽

2021 ◽

pp. 1738-1753

Author(s):

Louise Leenen ◽

Thomas Meyer

Keyword(s):

Artificial Intelligence ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Future Research ◽

Data Sets ◽

Data Types ◽

Cyber Defense ◽

Intelligent Decision Making ◽

Big Data Technologies

Cybersecurity analysts rely on vast volumes of security event data to predict, identify, characterize, and deal with security threats. These analysts must understand and make sense of these huge datasets in order to discover patterns which lead to intelligent decision making and advance warnings of possible threats, and this ability requires automation. Big data analytics and artificial intelligence can improve cyber defense. Big data analytics methods are applied to large data sets that contain different data types. The purpose is to detect patterns, correlations, trends, and other useful information. Artificial intelligence provides algorithms that can reason or learn and improve their behavior, and includes semantic technologies. A large number of automated systems are currently based on syntactic rules which are generally not sophisticated enough to deal with the level of complexity in this domain. An overview of artificial intelligence and big data technologies in cyber defense is provided, and important areas for future research are identified and discussed.

Download Full-text