A Case Study on Effective Technique of Distributed Data Storage for Big Data Processing in the Wireless Internet Environment

NoSQL databases are designed to meet the huge data storage requirements of cloud computing and big data processing. NoSQL databases have lots of advanced features in addition to the conventional RDBMS features. Hence, the “NoSQL” databases are popularly known as “Not only SQL” databases. A variety of NoSQL databases having different features to deal with exponentially growing data-intensive applications are available with open source and proprietary option. This chapter discusses some of the popular NoSQL databases and their features on the light of CAP theorem.

Download Full-text

A case study of ICT used by big data processing in education

Proceedings of the 6th International Conference on Information and Education Technology - ICIET '18 ◽

10.1145/3178158.3178190 ◽

2018 ◽

Cited By ~ 2

Author(s):

Keiko Tsujioka

Keyword(s):

Big Data ◽

Data Processing ◽

Big Data Processing

Download Full-text

Cost Minimization for Big Data Processing in Geo-Distributed Data Centres

Asia-pacific Journal of Convergent Research Interchange ◽

10.21742/apjcri.2016.12.05 ◽

2016 ◽

Vol 2 (4) ◽

pp. 35-43

Author(s):

T. Sai Raaga Sowmya

Keyword(s):

Big Data ◽

Data Processing ◽

Cost Minimization ◽

Distributed Data ◽

Big Data Processing ◽

Data Centres

Download Full-text

Resource and Cost Aware Glowworm Mapreduce Optimization Based Big Data Processing in Geo Distributed Data Center

Wireless Personal Communications ◽

10.1007/s11277-020-07050-6 ◽

2020 ◽

Author(s):

S. Nithyanantham ◽

G. Singaravel

Keyword(s):

Big Data ◽

Data Processing ◽

Data Center ◽

Distributed Data ◽

Big Data Processing

Download Full-text

On the Use of Hyperparameter Optimization in Big Data Processing Pipelines: A Case Study

2019 Innovations in Intelligent Systems and Applications Conference (ASYU) ◽

10.1109/asyu48272.2019.8946352 ◽

2019 ◽

Author(s):

Jasser Dhaouadi ◽

Mehmet S. Aktas ◽

Oya Kalipsiz ◽

Erman Balcik

Keyword(s):

Big Data ◽

Data Processing ◽

Hyperparameter Optimization ◽

Big Data Processing

Download Full-text

Data Mining Library for Big Data Processing Platforms: A Case Study-Sparkling Water Platform

2018 3rd International Conference on Computer Science and Engineering (UBMK) ◽

10.1109/ubmk.2018.8566278 ◽

2018 ◽

Author(s):

Elif Cansu Yildiz ◽

Mehmet S. Aktas ◽

Oya Kalipsiz ◽

Alper Nebi Kanli ◽

Umut Orcun Turgut

Keyword(s):

Data Mining ◽

Big Data ◽

Data Processing ◽

Big Data Processing

Download Full-text

“Saksham Model” Performance Improvisation Using Node Capability Evaluation in Apache Hadoop

Big Data Analytics for Sustainable Computing - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9750-6.ch012 ◽

2020 ◽

pp. 206-230

Author(s):

Ankit Shah ◽

Mamta C. Padole

Keyword(s):

Big Data ◽

Distributed Computing ◽

Data Processing ◽

Data Storage ◽

Model Performance ◽

Big Data Processing ◽

Apache Hadoop ◽

Processing Capability ◽

Proposed Model ◽

Capability Evaluation

Big Data processing and analysis requires tremendous processing capability. Distributed computing brings many commodity systems under the common platform to answer the need for Big Data processing and analysis. Apache Hadoop is the most suitable set of tools for Big Data storage, processing, and analysis. But Hadoop found to be inefficient when it comes to heterogeneous set computers which have different processing capabilities. In this research, we propose the Saksham model which optimizes the processing time by efficient use of node processing capability and file management. The proposed model shows the performance improvement for Big Data processing. To achieve better performance, Saksham model uses two vital aspects of heterogeneous distributed computing: Effective block rearrangement policy and use of node processing capability. The results demonstrate that the proposed model successfully achieves better job execution time and improves data locality.

Download Full-text

Design of Big Data Processing System Architecture Based on Hadoop under the Cloud Computing

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.556-562.6302 ◽

2014 ◽

Vol 556-562 ◽

pp. 6302-6306 ◽

Cited By ~ 3

Author(s):

Chun Mei Duan

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Processing ◽

Data Storage ◽

System Architecture ◽

Processing System ◽

System Stability ◽

Security Model ◽

Data Processing System ◽

Big Data Processing

In allusion to limitations of traditional data processing technology in big data processing, big data processing system architecture based on hadoop is designed, using the characteristics of quantification, unstructured and dynamic of cloud computing.It uses HDFS be responsible for big data storage, and uses MapReduce be responsible for big data calculation and uses Hbase as unstructured data storage database, at the same time a system of storage and cloud computing security model are designed, in order to implement efficient storage, management, and retrieval of data,thus it can save construction cost, and guarantee system stability, reliability and security.

Download Full-text

A Case Study of Using Big Data Processing in Education: Method of Matching Members by Optimizing Collaborative Learning Environment

Social Media and Machine Learning ◽

10.5772/intechopen.85526 ◽

2020 ◽

Cited By ~ 3

Author(s):

Keiko Tsujioka

Keyword(s):

Big Data ◽

Collaborative Learning ◽

Learning Environment ◽

Data Processing ◽

Big Data Processing ◽

Collaborative Learning Environment

Download Full-text

Employing Vertical Elasticity for Efficient Big Data Processing in Container-Based Cloud Environments

Applied Sciences ◽

10.3390/app11136200 ◽

2021 ◽

Vol 11 (13) ◽

pp. 6200

Author(s):

Jin-young Choi ◽

Minkyoung Cho ◽

Jik-Soo Kim

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Processing ◽

Data Storage ◽

Resource Utilization ◽

System Throughput ◽

Big Data Processing ◽

Cloud Environments ◽

Utilization Scheme ◽

Adaptive Resource

Recently, “Big Data” platform technologies have become crucial for distributed processing of diverse unstructured or semi-structured data as the amount of data generated increases rapidly. In order to effectively manage these Big Data, Cloud Computing has been playing an important role by providing scalable data storage and computing resources for competitive and economical Big Data processing. Accordingly, server virtualization technologies that are the cornerstone of Cloud Computing have attracted a lot of research interests. However, conventional hypervisor-based virtualization can cause performance degradation problems due to its heavily loaded guest operating systems and rigid resource allocations. On the other hand, container-based virtualization technology can provide the same level of service faster with a lightweight capacity by effectively eliminating the guest OS layers. In addition, container-based virtualization enables efficient cloud resource management by dynamically adjusting the allocated computing resources (e.g., CPU and memory) during the runtime through “Vertical Elasticity”. In this paper, we present our practice and experience of employing an adaptive resource utilization scheme for Big Data workloads in container-based cloud environments by leveraging the vertical elasticity of Docker, a representative container-based virtualization technique. We perform extensive experiments running several Big Data workloads on representative Big Data platforms: Apache Hadoop and Spark. During the workload executions, our adaptive resource utilization scheme periodically monitors the resource usage patterns of running containers and dynamically adjusts allocated computing resources that could result in substantial improvements in the overall system throughput.

Download Full-text