Analysing Big Data in VANET via HADOOP Framework

Rahul Kumar Chawda; Ghanshyam Thakur

doi:10.4172/jcsb.1000281

Big Data Security Challenges and Solution of Distributed Computing in Hadoop Environment: A Security Framework

Recent Advances in Computer Science and Communications ◽

10.2174/2213275912666190822095422 ◽

2020 ◽

Vol 13 (4) ◽

pp. 790-797

Author(s):

Gurjit Singh Bhathal ◽

Amardeep Singh Dhiman

Keyword(s):

Big Data ◽

Data Security ◽

Data Sets ◽

Security Framework ◽

Hadoop Distributed File System ◽

Current Scenario ◽

Hadoop Cluster ◽

Ciphertext Policy ◽

In Transit ◽

Hadoop Framework

Background: In current scenario of internet, large amounts of data are generated and processed. Hadoop framework is widely used to store and process big data in a highly distributed manner. It is argued that Hadoop Framework is not mature enough to deal with the current cyberattacks on the data. Objective: The main objective of the proposed work is to provide a complete security approach comprising of authorisation and authentication for the user and the Hadoop cluster nodes and to secure the data at rest as well as in transit. Methods: The proposed algorithm uses Kerberos network authentication protocol for authorisation and authentication and to validate the users and the cluster nodes. The Ciphertext-Policy Attribute- Based Encryption (CP-ABE) is used for data at rest and data in transit. User encrypts the file with their own set of attributes and stores on Hadoop Distributed File System. Only intended users can decrypt that file with matching parameters. Results: The proposed algorithm was implemented with data sets of different sizes. The data was processed with and without encryption. The results show little difference in processing time. The performance was affected in range of 0.8% to 3.1%, which includes impact of other factors also, like system configuration, the number of parallel jobs running and virtual environment. Conclusion: The solutions available for handling the big data security problems faced in Hadoop framework are inefficient or incomplete. A complete security framework is proposed for Hadoop Environment. The solution is experimentally proven to have little effect on the performance of the system for datasets of different sizes.

Download Full-text

Early detection of diabetic retinopathy from big data in hadoop framework

Displays ◽

10.1016/j.displa.2021.102061 ◽

2021 ◽

Vol 70 ◽

pp. 102061

Author(s):

Amartya Hatua ◽

Badri Narayan Subudhi ◽

Veerakumar T. ◽

Ashish Ghosh

Keyword(s):

Big Data ◽

Diabetic Retinopathy ◽

Early Detection ◽

Hadoop Framework

Download Full-text

Leveraging Big Data Analytics Utilizing Hadoop Framework in Sports Science

Smart Computational Strategies: Theoretical and Practical Aspects ◽

10.1007/978-981-13-6295-8_22 ◽

2019 ◽

pp. 259-272

Author(s):

Gagandeep Jagdev ◽

Sarabjeet Kaur

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Sports Science ◽

Hadoop Framework

Download Full-text

Big Data Security Problem Based on Hadoop Framework

2019 4th International Conference on Computer Science and Engineering (UBMK) ◽

10.1109/ubmk.2019.8907074 ◽

2019 ◽

Author(s):

Refik Samet ◽

Ayhan Aydin ◽

Feridun Toy

Keyword(s):

Big Data ◽

Data Security ◽

Security Problem ◽

Hadoop Framework

Download Full-text

A Hierarchical Hadoop Framework to Handle Big Data in Geo-Distributed Computing Environments

International Journal of Information Technologies and Systems Approach ◽

10.4018/ijitsa.2018010102 ◽

2018 ◽

Vol 11 (1) ◽

pp. 16-47 ◽

Cited By ~ 3

Author(s):

Orazio Tomarchio ◽

Giuseppe Di Modica ◽

Marco Cavallo ◽

Carmelo Polito

Keyword(s):

Big Data ◽

Data Centers ◽

Poor Performance ◽

Communication Technologies ◽

Sources Of Information ◽

The Poor ◽

The Social ◽

Computing Environments ◽

Definition Of ◽

Hadoop Framework

Advances in the communication technologies, along with the birth of new communication paradigms leveraging on the power of the social, has fostered the production of huge amounts of data. Old-fashioned computing paradigms are unfit to handle the dimensions of the data daily produced by the countless, worldwide distributed sources of information. So far, the MapReduce has been able to keep the promise of speeding up the computation over Big Data within a cluster. This article focuses on scenarios of worldwide distributed Big Data. While stigmatizing the poor performance of the Hadoop framework when deployed in such scenarios, it proposes the definition of a Hierarchical Hadoop Framework (H2F) to cope with the issues arising when Big Data are scattered over geographically distant data centers. The article highlights the novelty introduced by the H2F with respect to other hierarchical approaches. Tests run on a software prototype are also reported to show the increase of performance that H2F is able to achieve in geographical scenarios over a plain Hadoop approach.

Download Full-text

Research and Practice of Big Data Analysis Process Based on Hadoop Framework

2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) ◽

10.1109/itnec.2019.8729522 ◽

2019 ◽

Author(s):

Hui Jiang

Keyword(s):

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Research And Practice ◽

Analysis Process ◽

Hadoop Framework

Download Full-text

Big Data: Hadoop framework vulnerabilities, security issues and attacks

Array ◽

10.1016/j.array.2019.100002 ◽

2019 ◽

Vol 1-2 ◽

pp. 100002 ◽

Cited By ~ 6

Author(s):

Gurjit Singh Bhathal ◽

Amardeep Singh

Keyword(s):

Big Data ◽

Security Issues ◽

Hadoop Framework

Download Full-text

Evaluation and Analysis of Capacity Scheduler and Fair Scheduler in Hadoop Framework on Big Data Technology

Proceedings of the 2018 International Conference on Artificial Intelligence and Virtual Reality - AIVR 2018 ◽

10.1145/3293663.3293680 ◽

2018 ◽

Author(s):

Muhammad Salman ◽

Diyanatul Husna ◽

Adhitya Wicaksono ◽

Anak Agung Putri Ratna

Keyword(s):

Big Data ◽

Fair Scheduler ◽

Hadoop Framework ◽

Big Data Technology

Download Full-text

Analyzing and scripting indian election strategies using big data via Apache Hadoop framework

2016 5th International Conference on Wireless Networks and Embedded Systems (WECON) ◽

10.1109/wecon.2016.7993431 ◽

2016 ◽

Author(s):

Gagandeep Jagdev ◽

Amandeep Kaur

Keyword(s):

Big Data ◽

Apache Hadoop ◽

Election Strategies ◽

Hadoop Framework

Download Full-text

Optimizing Hadoop Performance for Big Data Analytics in Smart Grid

Mathematical Problems in Engineering ◽

10.1155/2017/2198262 ◽

2017 ◽

Vol 2017 ◽

pp. 1-11 ◽

Cited By ~ 2

Author(s):

Mukhtaj Khan ◽

Zhengwen Huang ◽

Maozhen Li ◽

Gareth A. Taylor ◽

Phillip M. Ashton ◽

...

Keyword(s):

Big Data ◽

Smart Grid ◽

Power Systems ◽

Data Analytics ◽

High Performance ◽

Gene Expression Programming ◽

Big Data Analytics ◽

Fluctuation Analysis ◽

Grid Applications ◽

Hadoop Framework

The rapid deployment of Phasor Measurement Units (PMUs) in power systems globally is leading to Big Data challenges. New high performance computing techniques are now required to process an ever increasing volume of data from PMUs. To that extent the Hadoop framework, an open source implementation of the MapReduce computing model, is gaining momentum for Big Data analytics in smart grid applications. However, Hadoop has over 190 configuration parameters, which can have a significant impact on the performance of the Hadoop framework. This paper presents an Enhanced Parallel Detrended Fluctuation Analysis (EPDFA) algorithm for scalable analytics on massive volumes of PMU data. The novel EPDFA algorithm builds on an enhanced Hadoop platform whose configuration parameters are optimized by Gene Expression Programming. Experimental results show that the EPDFA is 29 times faster than the sequential DFA in processing PMU data and 1.87 times faster than a parallel DFA, which utilizes the default Hadoop configuration settings.

Download Full-text