PerTract: Model Extraction and Specification of Big Data Systems for Performance Prediction by the Example of Apache Spark and Hadoop

This chapter revises the most important aspects in how computing infrastructures should be configured and intelligently managed to fulfill the most notably security aspects required by Big Data applications. One of them is privacy. It is a pertinent aspect to be addressed because users share more and more personal data and content through their devices and computers to social networks and public clouds. So, a secure framework to social networks is a very hot topic research. This last topic is addressed in one of the two sections of the current chapter with case studies. In addition, the traditional mechanisms to support security such as firewalls and demilitarized zones are not suitable to be applied in computing systems to support Big Data. SDN is an emergent management solution that could become a convenient mechanism to implement security in Big Data systems, as we show through a second case study at the end of the chapter. This also discusses current relevant work and identifies open issues.

Download Full-text

An Empirical Research on Service-Oriented Architecture (SOA) for Data Exchange in BIGDATA Systems

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c1087.0193s20 ◽

2020 ◽

Vol 9 (3S) ◽

pp. 407-409

Keyword(s):

Big Data ◽

Data Exchange ◽

Success Factors ◽

Service Oriented Architecture ◽

Data Gathering ◽

Value Added ◽

Data Systems ◽

Big Data Applications ◽

Service Oriented ◽

Big Data Systems

Service-Oriented Architecture is a method for scheming dealing and organizing systems that represent ecofriendly business functionality. The objective of this study is to find out the critical success factors that need to implement SOA in BIG DATA systems. Our study aimed at classifying these erroneous performs in execution of SOA. The acceptance of SOA has interested creators to text its requests and applications. The analysed results would be very useful for researchers who would like to implement SOA with BIG DATA systems. SOA lead numerous advantages such as value-added flexibility and appropriate alignment among processes as well as reduced cost of integration and maintenance. Generally, BIG DATA anxieties large-volume, composite, rising figures groups with numerous, self-directed sources. BIG DATA claims where data gathering has grownup extremely and is elsewhere the aptitude of usually used software utensils to detention, accomplish and development within the rise [1]. The greatest essential task for the BIG DATA applications is to discover the large volumes of data and excerpt valuable material or information for upcoming actions. The main purpose of this study is to identify the important factors that are needed to implement SOA in BIG DATA systems. Zhang and Yang suggests a reengineering approach which will restructure the legacy systems that leads to SOA by considering of an organization. This paper also express various challenges of SOA and identify the problems that improve SOA based services for data exchange in BIG DATA systems.

Download Full-text

Security and Privacy Issues of Big Data

Web Services ◽

10.4018/978-1-5225-7501-6.ch114 ◽

2019 ◽

pp. 2197-2229

Author(s):

José Moura ◽

Carlos Serrão

Keyword(s):

Social Networks ◽

Big Data ◽

Personal Data ◽

Security And Privacy ◽

Data Systems ◽

Computing Systems ◽

Big Data Applications ◽

Big Data Systems ◽

Privacy Issues

This chapter revises the most important aspects in how computing infrastructures should be configured and intelligently managed to fulfill the most notably security aspects required by Big Data applications. One of them is privacy. It is a pertinent aspect to be addressed because users share more and more personal data and content through their devices and computers to social networks and public clouds. So, a secure framework to social networks is a very hot topic research. This last topic is addressed in one of the two sections of the current chapter with case studies. In addition, the traditional mechanisms to support security such as firewalls and demilitarized zones are not suitable to be applied in computing systems to support Big Data. SDN is an emergent management solution that could become a convenient mechanism to implement security in Big Data systems, as we show through a second case study at the end of the chapter. This also discusses current relevant work and identifies open issues.

Download Full-text

Big Data Management Canvas: A Reference Model for Value Creation from Data

Big Data and Cognitive Computing ◽

10.3390/bdcc3010019 ◽

2019 ◽

Vol 3 (1) ◽

pp. 19 ◽

Cited By ~ 5

Author(s):

Michael Kaufmann

Keyword(s):

Big Data ◽

Data Management ◽

Value Creation ◽

Reference Model ◽

Solution Space ◽

Use Cases ◽

Data Systems ◽

Big Data Applications ◽

Big Data Systems ◽

Map Data

Many big data projects are technology-driven and thus, expensive and inefficient. It is often unclear how to exploit existing data resources and map data, systems and analytics results to actual use cases. Existing big data reference models are mostly either technological or business-oriented in nature, but do not consequently align both aspects. To address this issue, a reference model for big data management is proposed that operationalizes value creation from big data by linking business targets with technical implementation. The purpose of this model is to provide a goal- and value-oriented framework to effectively map and plan purposeful big data systems aligned with a clear value proposition. Based on an epistemic model that conceptualizes big data management as a cognitive system, the solution space of data value creation is divided into five layers: preparation, analysis, interaction, effectuation, and intelligence. To operationalize the model, each of these layers is subdivided into corresponding business and IT aspects to create a link from use cases to technological implementation. The resulting reference model, the big data management canvas, can be applied to classify and extend existing big data applications and to derive and plan new big data solutions, visions, and strategies for future projects. To validate the model in the context of existing information systems, the paper describes three cases of big data management in existing companies.

Download Full-text

Time Estimation and Resource Minimization Scheme for Apache Spark and Hadoop Big Data Systems With Failures

IEEE Access ◽

10.1109/access.2019.2891001 ◽

2019 ◽

Vol 7 ◽

pp. 9658-9666 ◽

Cited By ~ 3

Author(s):

Jinbae Lee ◽

Bobae Kim ◽

Jong-Moon Chung

Keyword(s):

Big Data ◽

Time Estimation ◽

Apache Spark ◽

Data Systems ◽

Resource Minimization ◽

Big Data Systems ◽

Minimization Scheme

Download Full-text

A Survey of Data Mining Implementation in Smart City Applications

Qubahan Academic Journal ◽

10.48161/qaj.v1n2a52 ◽

2021 ◽

Vol 1 (2) ◽

pp. 91-99

Author(s):

Zainab Salih Ageed ◽

Subhi R. M. Zeebaree ◽

Mohammed Mohammed Sadeeq ◽

Shakir Fattah Kak ◽

Zryan Najat Rashid ◽

...

Keyword(s):

Big Data ◽

Smart Cities ◽

Modern Technology ◽

Data Systems ◽

Community Model ◽

Multiple Data ◽

Big Data Applications ◽

Intelligent City ◽

Intelligent Cities ◽

Big Data Systems

Many policymakers envisage using a community model and Big Data technology to achieve the sustainability demanded by intelligent city components and raise living standards. Smart cities use different technology to make their residents more successful in their health, housing, electricity, learning, and water supplies. This involves reducing prices and the utilization of resources and communicating more effectively and creatively for our employees. Extensive data analysis is a comparatively modern technology that is capable of expanding intelligent urban facilities. Digital extraction has resulted in the processing of large volumes of data that can be used in several valuable areas since digitalization is an essential part of daily life. In many businesses and utility domains, including the intelligent urban domain, successful exploitation and multiple data use is critical. This paper examines how big data can be used for more innovative societies. It explores the possibilities, challenges, and benefits of applying big data systems in intelligent cities and compares and contrasts different intelligent cities and big data ideas. It also seeks to define criteria for the creation of big data applications for innovative city services.

Download Full-text

Security and Privacy Issues of Big Data

Handbook of Research on Trends and Future Directions in Big Data and Web Intelligence - Advances in Data Mining and Database Management ◽

10.4018/978-1-4666-8505-5.ch002 ◽

2015 ◽

pp. 20-52 ◽

Cited By ~ 8

Author(s):

José Moura ◽

Carlos Serrão

Keyword(s):

Social Networks ◽

Big Data ◽

Personal Data ◽

Security And Privacy ◽

Data Systems ◽

Computing Systems ◽

Big Data Applications ◽

Big Data Systems ◽

Privacy Issues

This chapter revises the most important aspects in how computing infrastructures should be configured and intelligently managed to fulfill the most notably security aspects required by Big Data applications. One of them is privacy. It is a pertinent aspect to be addressed because users share more and more personal data and content through their devices and computers to social networks and public clouds. So, a secure framework to social networks is a very hot topic research. This last topic is addressed in one of the two sections of the current chapter with case studies. In addition, the traditional mechanisms to support security such as firewalls and demilitarized zones are not suitable to be applied in computing systems to support Big Data. SDN is an emergent management solution that could become a convenient mechanism to implement security in Big Data systems, as we show through a second case study at the end of the chapter. This also discusses current relevant work and identifies open issues.

Download Full-text

The Need to Consider Hardware Selection when Designing Big Data Applications Supported by Metadata

Big Data Management, Technologies, and Applications - Advances in Data Mining and Database Management ◽

10.4018/978-1-4666-4699-5.ch015 ◽

2013 ◽

pp. 381-396 ◽

Cited By ~ 2

Author(s):

Nathan Regola ◽

David A. Cieslak ◽

Nitesh V. Chawla

Keyword(s):

Cloud Computing ◽

Big Data ◽

Large Volume ◽

Virtual Machines ◽

Large Datasets ◽

Data Systems ◽

Big Data Applications ◽

Component Systems ◽

Big Data Systems ◽

Selection Of

The selection of hardware to support big data systems is complex. Even defining the term “big data” is difficult. “Big data” can mean a large volume of data in a database, a MapReduce cluster that processes data, analytics and reporting applications that must access large datasets to operate, algorithms that can effectively operate on large datasets, or even basic scripts that produce a needed resulted by leveraging data. Big data systems can be composed of many component systems. For these reasons, it appears difficult to create a universal, representative benchmark that approximates a “big data” workload. Along with the trend to utilize large datasets and sophisticated tools to analyze data, the trend of cloud computing has emerged as an effective method of leasing compute time. This chapter explores some of the issues at the intersection of virtualized computing (since cloud computing often uses virtual machines), metadata stores, and big data. Metadata is important because it enables many applications and users to access datasets and effectively use them without relying on extensive knowledge from humans about the data.

Download Full-text

Security and Privacy Issues of Big Data

Cloud Security ◽

10.4018/978-1-5225-8176-5.ch080 ◽

2019 ◽

pp. 1598-1630

Author(s):

José Moura ◽

Carlos Serrão

Keyword(s):

Social Networks ◽

Big Data ◽

Personal Data ◽

Security And Privacy ◽

Data Systems ◽

Computing Systems ◽

Big Data Applications ◽

Big Data Systems ◽

Privacy Issues

This chapter revises the most important aspects in how computing infrastructures should be configured and intelligently managed to fulfill the most notably security aspects required by Big Data applications. One of them is privacy. It is a pertinent aspect to be addressed because users share more and more personal data and content through their devices and computers to social networks and public clouds. So, a secure framework to social networks is a very hot topic research. This last topic is addressed in one of the two sections of the current chapter with case studies. In addition, the traditional mechanisms to support security such as firewalls and demilitarized zones are not suitable to be applied in computing systems to support Big Data. SDN is an emergent management solution that could become a convenient mechanism to implement security in Big Data systems, as we show through a second case study at the end of the chapter. This also discusses current relevant work and identifies open issues.

Download Full-text

Toward Efficient Ranked-key Algorithm for the Web notification of Big Data Systems

Proceedings of the 2nd international Conference on Big Data, Cloud and Applications ◽

10.1145/3090354.3090386 ◽

2017 ◽

Author(s):

Mohamedou Cheikh Tourad ◽

Abdelmounaim Abdali

Keyword(s):

Big Data ◽

Data Systems ◽

Big Data Systems ◽

The Web

Download Full-text