Transformation in Data Storing Technique- Big Data: A Literature Review

Author(s):  
Adarsh Neema

Abstract: Loss of data implies loss of valuable information. An appropriate gathering of data and finding hidden patterns out of it is key for any business organization to thrive fiscally. With exponential rise in the internet users from the early 2000’s, traditional databases fell short to collect the enormous amount of unstructured data/ semi-structured data, which contained extremely insightful information. Today, data accumulated is not only enormous, but also collected with high speed, having variety, which necessitated special database/software for data gathering and making key decisions based on that. These gigantic amounts of data generated can advocate companies to examine the market trends, market demands, and customer expectation, which endorses them to make relevant foremost decisions. This study discusses the stymie in conventional databases to process the immense data and entailment of advanced databases/software. Furthermore, a case study is presented later to understand the role of big data analytics in business and technical firms. Keywords: Big Data, structured data, unstructured data, NoSQL, Hadoop.

2017 ◽  
Vol 2 (6) ◽  
pp. 570
Author(s):  
Cungki Kusdarjito

The advancement of big data analytics is paving the way for knowledge creation based on very huge and unstructured data. Currently, information is scattered and growth tremendously, containing many information but difficult to be interpreted. Consequently, traditional approaches are no longer suitable for unstructured data but very rich in information. This situation is different from the role of previous information technology in which information is based on structured data, stored in the local storage, and in more advanced form, information can be retrieved through internet. Meanwhile, in Indonesia data are collected by many institutions with different measurement standard. The nature of the data collection is top-down, carried out by survey which is expensive yet unreliable and stored exclusively by respective institution. SIDeKa (Sistem Informasi Desa dan Kawasan/Village and Regional Information System), which are connected nationally, is proposed as a system of data collection in the village level and prepared by local people. Using SIDeKa, data reliability and readiness can be improved at the local level. The goals of the SIDeKa is not only local people have information in their hand such as poverty level, production, commodity price, the area of cultivated land, and the outbreak of diseases in their village, but also they have information from the neighboring villages or event at the national level. For government, data reliability will improve the policy effectiveness. This paper discusses the implementation and role of SIDeKa for knowledge creation in the village level, especially for the agricultural activities which has been initiated in 2015.Keywords: big data analytics; SIDeKa;  unstructured data.


Author(s):  
Andreas Schmidt ◽  
Martin Atzmueller ◽  
Martin Hollender

This chapter provides an overview of methods for preprocessing structured and unstructured data in the scope of Big Data. Specifically, this chapter summarizes according methods in the context of a real-world dataset in a petro-chemical production setting. The chapter describes state-of-the-art methods for data preparation for Big Data Analytics. Furthermore, the chapter discusses experiences and first insights in a specific project setting with respect to a real-world case study. Furthermore, interesting directions for future research are outlined.


2017 ◽  
Vol 31 (3) ◽  
pp. 63-79 ◽  
Author(s):  
Greg Richins ◽  
Andrea Stapleton ◽  
Theophanis C. Stratopoulos ◽  
Christopher Wong

ABSTRACT Contrary to Frey and Osborne's (2013) prediction that the accounting profession faces extinction, we argue that accountants can still create value in a world of Big Data analytics. To advance this position, we provide a conceptual framework based on structured/unstructured data and problem-driven/exploratory analysis. We argue that accountants already excel at problem-driven analysis of structured data, are well positioned to play a leading role in the problem-driven analysis of unstructured data, and can support data scientists performing exploratory analysis on Big Data. Our argument rests on two pillars: accountants are familiar with structured datasets, easing the transition to working with unstructured data, and possess knowledge of business fundamentals. Thus, rather than replacing accountants, we argue that Big Data analytics complements accountants' skills and knowledge. However, educators, standard setters, and professional bodies must adjust their curricula, standards, and frameworks to accommodate the challenges of Big Data analytics.


2019 ◽  
Vol 19 (1) ◽  
pp. 24-47 ◽  
Author(s):  
Matteo Golfarelli ◽  
Stefano Rizzi

In big data analytics, advanced analytic techniques operate on big datasets aimed at complementing the role of traditional OLAP for decision making. To enable companies to take benefit of these techniques despite the lack of in-house technical skills, the H2020 TOREADOR Project adopts a model-driven architecture for streamlining analysis processes, from data preparation to their visualization. In this article, we propose a new approach named SkyViz focused on the visualization area, in particular on (1) how to specify the user’s objectives and describe the dataset to be visualized, (2) how to translate this specification into a platform-independent visualization type, and (3) how to concretely implement this visualization type on the target execution platform. To support step (1), we define a visualization context based on seven prioritizable coordinates for assessing the user’s objectives and conceptually describing the data to be visualized. To automate step (2), we propose a skyline-based technique that translates a visualization context into a set of most suitable visualization types. Finally, to automate step (3), we propose a skyline-based technique that, with reference to a specific platform, finds the best bindings between the columns of the dataset and the graphical coordinates used by the visualization type chosen by the user. SkyViz can be transparently extended to include more visualization types on one hand, more visualization coordinates on the other. The article is completed by an evaluation of SkyViz based on a case study excerpted from the pilot applications of the TOREADOR Project.


2021 ◽  
Vol 13 ◽  
pp. 175628722199813
Author(s):  
B. M. Zeeshan Hameed ◽  
Aiswarya V. L. S. Dhavileswarapu ◽  
Nithesh Naik ◽  
Hadis Karimi ◽  
Padmaraj Hegde ◽  
...  

Artificial intelligence (AI) has a proven record of application in the field of medicine and is used in various urological conditions such as oncology, urolithiasis, paediatric urology, urogynaecology, infertility and reconstruction. Data is the driving force of AI and the past decades have undoubtedly witnessed an upsurge in healthcare data. Urology is a specialty that has always been at the forefront of innovation and research and has rapidly embraced technologies to improve patient outcomes and experience. Advancements made in Big Data Analytics raised the expectations about the future of urology. This review aims to investigate the role of big data and its blend with AI for trends and use in urology. We explore the different sources of big data in urology and explicate their current and future applications. A positive trend has been exhibited by the advent and implementation of AI in urology with data available from several databases. The extensive use of big data for the diagnosis and treatment of urological disorders is still in its early stage and under validation. In future however, big data will no doubt play a major role in the management of urological conditions.


2021 ◽  
pp. 097226292110225
Author(s):  
Shobhana Chandra ◽  
Sanjeev Verma

Big data (BD) is making advances in promoting sustainable consumption behaviour and has attracted the attention of researchers worldwide. Despite the increased focus, the findings of studies on this topic are fragmented, and future researchers need a systematic understanding of the existing literature for identification of the research scope. This study offers a systematic review of the role of BD in promoting sustainable-consumption behaviour with the help of a bibliometric analysis, followed by a thematic analysis. The findings suggest that businesses deploy BD to create sustainable consumer experiences, predict consumer buying patterns, design and alter business models and create nudges for sustainable consumption, while consumers are forcing businesses to develop green operations and supply chains to reduce the latter’s carbon footprint. The major research gaps for future researchers are in the following areas: the impact of big data analytics (BDA) on consumerism, the role of BD in the formation of sustainable habits and consumer knowledge creation for sustainable consumption and prediction of green consumer behaviour.


IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 54595-54614 ◽  
Author(s):  
Syed Attique Shah ◽  
Dursun Zafer Seker ◽  
Sufian Hameed ◽  
Dirk Draheim

2020 ◽  
Vol 98 ◽  
pp. 68-78 ◽  
Author(s):  
Aseem Kinra ◽  
Samaneh Beheshti-Kashi ◽  
Rasmus Buch ◽  
Thomas Alexander Sick Nielsen ◽  
Francisco Pereira

2015 ◽  
Vol 2015 ◽  
pp. 1-16 ◽  
Author(s):  
Ashwin Belle ◽  
Raghuram Thiagarajan ◽  
S. M. Reza Soroushmehr ◽  
Fatemeh Navidi ◽  
Daniel A. Beard ◽  
...  

The rapidly expanding field of big data analytics has started to play a pivotal role in the evolution of healthcare practices and research. It has provided tools to accumulate, manage, analyze, and assimilate large volumes of disparate, structured, and unstructured data produced by current healthcare systems. Big data analytics has been recently applied towards aiding the process of care delivery and disease exploration. However, the adoption rate and research development in this space is still hindered by some fundamental problems inherent within the big data paradigm. In this paper, we discuss some of these major challenges with a focus on three upcoming and promising areas of medical research: image, signal, and genomics based analytics. Recent research which targets utilization of large volumes of medical data while combining multimodal data from disparate sources is discussed. Potential areas of research within this field which have the ability to provide meaningful impact on healthcare delivery are also examined.


Sign in / Sign up

Export Citation Format

Share Document