Data Literacy

Author(s):  
Dimitar Christozov ◽  
Katia Rasheva-Yordanova

The article shares the authors' experiences in training bachelor-level students to explore Big Data applications in solving nowadays problems. The article discusses curriculum issues and pedagogical techniques connected to developing Big Data competencies. The following objectives are targeted: The importance and impact of making rational, data driven decisions in the Big Data era; Complexity of developing and exploring a Big Data Application in solving real life problems; Learning skills to adopt and explore emerging technologies; and Knowledge and skills to interpret and communicate results of data analysis via combining domain knowledge with system expertise. The curriculum covers: The two general uses of Big Data Analytics Applications, which are well distinguished from the point of view of end-user's objectives (presenting and visualizing data via aggregation and summarization [data warehousing: data cubes, dash boards, etc.] and learning from Data [data mining techniques]); Organization of Data Sources: distinction of Master Data from Operational Data, in particular; Extract-Transform-Load (ETL) process; and Informing vs. Misinforming, including the issue of over-trust vs. under-trust of obtained analytical results.

2021 ◽  
Vol 5 (2) ◽  
pp. 121-134
Author(s):  
Babek Erdebilli ◽  
Emine Nur NACAR

Aim: The purpose of this article is to present the latest advances in big data applications in the industries of the transportation sector such as airline, highway, and railway. It is difficult to analyze data in transportation because there is continuous real-time data flow. Since the improvements made are fast with the same logic, it is necessary to catch up with the new developments. Data should be analyzed with the big data concept because data stacks highly contain non-structural data types in transportation data. Although the mentioned industries are complementary to each other, the applications differ depending on the needs of the industry. Thus, solutions to specific problems in different industries using big data applications should be addressed. Design / Research methods: In accordance with the purpose of the study, big data studies that provide added value to the transportation sector were examined. Studies have been filtered through some criteria which are whether the application is adaptable to the industry, the study is available online in full-text, and its references are from respectable sources.   Conclusions / findings: All the big data application studies in the academy are not adaptable in real-life problems or suitable for all situations. For this reason, trying all of the applications will lead to moral and material losses for firms. This study is a guideline for companies to follow the developments in the big data concept and to choose the one that suits their problems. Thus, the gap between academia and industry was tried to close. Originality / value of the article: Although studies are referring to big data applications in the transportation sector, this study differs from others in terms of specifically analyzing big data applications in different industries such as airline, highway, and railway in the transportation sector


2021 ◽  
Vol 11 (5) ◽  
pp. 2340
Author(s):  
Sanjay Mathrani ◽  
Xusheng Lai

Web data have grown exponentially to reach zettabyte scales. Mountains of data come from several online applications, such as e-commerce, social media, web and sensor-based devices, business web sites, and other information types posted by users. Big data analytics (BDA) can help to derive new insights from this huge and fast-growing data source. The core advantage of BDA technology is in its ability to mine these data and provide information on underlying trends. BDA, however, faces innate difficulty in optimizing the process and capabilities that require merging of diverse data assets to generate viable information. This paper explores the BDA process and capabilities in leveraging data via three case studies who are prime users of BDA tools. Findings emphasize four key components of the BDA process framework: system coordination, data sourcing, big data application service, and end users. Further building blocks are data security, privacy, and management that represent services for providing functionality to the four components of the BDA process across information and technology value chains.


Author(s):  
Bernard Tuffour Atuahene ◽  
Sittimont Kanjanabootra ◽  
Thayaparan Gajendran

Big data applications consist of i) data collection using big data sources, ii) storing and processing the data, and iii) analysing data to gain insights for creating organisational benefit. The influx of digital technologies and digitization in the construction process includes big data as one newly emerging digital technology adopted in the construction industry. Big data application is in a nascent stage in construction, and there is a need to understand the tangible benefit(s) that big data can offer the construction industry. This study explores the benefits of big data in the construction industry. Using a qualitative case study design, construction professionals in an Australian Construction firm were interviewed. The research highlights that the benefits of big data include reduction of litigation amongst projects stakeholders, enablement of near to real-time communication, and facilitation of effective subcontractor selection. By implication, on a broader scale, these benefits can improve contract management, procurement, and management of construction projects. This study contributes to an ongoing discourse on big data application, and more generally, digitization in the construction industry.


Author(s):  
Aakriti Shukla ◽  
◽  
Dr Damodar Prasad Tiwari ◽  

Dimension reduction or feature selection is thought to be the backbone of big data applications in order to improve performance. Many scholars have shifted their attention in recent years to data science and analysis for real-time applications using big data integration. It takes a long time for humans to interact with big data. As a result, while handling high workload in a distributed system, it is necessary to make feature selection elastic and scalable. In this study, a survey of alternative optimizing techniques for feature selection are presented, as well as an analytical result analysis of their limits. This study contributes to the development of a method for improving the efficiency of feature selection in big complicated data sets.


Author(s):  
Jing Yang ◽  
Quan Zhang ◽  
Kunpeng Liu ◽  
Peng Jin ◽  
Guoyi Zhao

In recent years, electricity big data has extensive applications in the grid companies across the provinces. However, certain problems are encountered including, the inability to generate an ideal model using the isolated data possessed by each company, and the priority concerns for data privacy and safety during big data application and sharing. In this pursuit, the present research envisaged the application of federated learning to protect the local data, and to build a uniform model for different companies affiliated to the State Grid. Federated learning can serve as an essential means for realizing the grid-wide promotion of the achievements of big data applications, while ensuring the data safety.


Author(s):  
M. Asif Naeem ◽  
Gillian Dobbie ◽  
Gerald Weber

In order to make timely and effective decisions, businesses need the latest information from big data warehouse repositories. To keep these repositories up to date, real-time data integration is required. An important phase in real-time data integration is data transformation where a stream of updates, which is huge in volume and infinite, is joined with large disk-based master data. Stream processing is an important concept in Big Data, since large volumes of data are often best processed immediately. A well-known algorithm called Mesh Join (MESHJOIN) was proposed to process stream data with disk-based master data, which uses limited memory. MESHJOIN is a candidate for a resource-aware system setup. The problem that the authors consider in this chapter is that MESHJOIN is not very selective. In particular, the performance of the algorithm is always inversely proportional to the size of the master data table. As a consequence, the resource consumption is in some scenarios suboptimal. They present an algorithm called Cache Join (CACHEJOIN), which performs asymptotically at least as well as MESHJOIN but performs better in realistic scenarios, particularly if parts of the master data are used with different frequencies. In order to quantify the performance differences, the authors compare both algorithms with a synthetic dataset of a known skewed distribution as well as TPC-H and real-life datasets.


Author(s):  
Emmanuel N. A. Tetteh

The equilibration that underscores the internet of things (IoT) and big data analytics (BDA) cannot be underestimated at the behest of real-life social challenges and significant policy data generated to redress the concerns of epistemic communities, such as political policy actors, stakeholders, and the citizenry. The cognitive balancing of new information gathered by BDA and assimilated across the IoT is at the crossroads of ascertaining how the growing increases of such BDA can be better managed to transition from the big data state of disequilibration to reach a more stable equilibrium of policy data usefulness. In the quest for explicating the equilibration of policy data usefulness, an account of the curriculum-based MPA policy analysis and analytics concentration program at Norwich University is described as a case example of big data policy-analytic epistemology. The case study offers a symbolic ideology of an IoT action-learning solution model as a recommendation for fostering the stable equilibration of policy data usefulness.


Author(s):  
Sreenu G. ◽  
M.A. Saleem Durai

Advances in recent hardware technology have permitted to document transactions and other pieces of information of everyday life at an express pace. In addition of speed up and storage capacity, real-life perceptions tend to transform over time. However, there are so much prospective and highly functional values unseen in the vast volume of data. For this kind of applications conventional data mining is not suitable, so they should be tuned and changed or designed with new algorithms. Big data computing is inflowing to the category of most hopeful technologies that shows the way to new ways of thinking and decision making. This epoch of big data helps users to take benefit out of all available data to gain more precise systematic results or determine latent information, and then make best possible decisions. Depiction from a broad set of workloads, the author establishes a set of classifying measures based on the storage architecture, processing types, processing techniques and the tools and technologies used.


Author(s):  
Karthiga Shankar ◽  
Suganya R.

Consumers are spending more and more time on the web to search information and receive e-services. E-commerce, e-government, e-business, e-learning, e-science, etc. reflect the growing importance of the web in all aspects of our lives. Along with the tremendous growth of online information, the use of big data has become a vital force in growing revenues. Consumers are today shopping multiple products across multiple channels online. This transformation is substantial and many of the e-commerce companies have now turned to big data analytics for focused customer group targeting using opinion mining for evaluating campaign strategies and maintaining a competitive advantage, especially during the festive shopping season. So, the role of intelligent techniques in e-servicing is massive. This chapter focuses on the importance of big data (since there is a large volume of data online) and big data analytics in the field of e-servicing and explains the various applications, platforms to implement the big data applications, and security issues in the era of big data and e-servicing.


Sign in / Sign up

Export Citation Format

Share Document