map reduce Latest Research Papers

2021 ◽

Vol 15 ◽

pp. 1790-1802

Author(s):

Alexandros Gazis ◽

Eleftheria Katsiri

Keyword(s):

Cloud Computing ◽

Low Power ◽

Programming Model ◽

Fault Tolerant ◽

Low Cost ◽

Single Point ◽

Network Connectivity ◽

Large Data ◽

Map Reduce ◽

Raspberry Pi

Map-Reduce is a programming model and an associated implementation for processing and generating large data sets. This model has a single point of failure: the master, who coordinates the work in a cluster. On the contrary, wireless sensor networks (WSNs) are distributed systems that scale and feature large numbers of small, computationally limited, low-power, unreliable nodes. In this article, we provide a top-down approach explaining the architecture, implementation and rationale of a distributed fault-tolerant IoT middleware. Specifically, this middleware consists of multiple mini-computing devices (Raspberry Pi) connected in a WSN which implement the Map-Reduce algorithm. First, we explain the tools used to develop this system. Second, we focus on the Map-Reduce algorithm implemented to overcome common network connectivity issues, as well as to enhance operation availability and reliability. Lastly, we provide benchmarks for our middleware as a crowd tracking application for a preserved building in Greece (i.e., M. Hatzidakis’ residence). The results of this study show that IoT middleware with low-power and low-cost components are viable solutions for medium-sized cloud computing distributed and parallel computing centres. Potential uses of this middleware apply for monitoring buildings and indoor structures, in addition to crowd tracking to prevent the spread of COVID-19.

Download Full-text

Establishment of Flash Flood Risk Map, Reduce It is Harmful Effect

10.9734/bpi/nvst/v11/1630a ◽

2021 ◽

pp. 24-67

Author(s):

Ba Le Huy ◽

Hoan Nguyen Xuan ◽

Thanh Le Minh

Keyword(s):

Flood Risk ◽

Flash Flood ◽

Harmful Effect ◽

Map Reduce ◽

Risk Map ◽

Flood Risk Map

Download Full-text

Optimal Common Job Block Table (CJBT) to improve the Performance in Hadoop framework

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit217689 ◽

2021 ◽

pp. 346-350

Author(s):

Pinjari Vali Basha

Keyword(s):

Optimal Algorithm ◽

Structured Data ◽

Map Reduce ◽

Huge Amount ◽

Resource Saving ◽

Hadoop Distributed File System ◽

Two Phases ◽

The Cost ◽

Rapid Transformation ◽

Hadoop Framework

By rapid transformation of technology, huge amount of data (structured data and Un Structured data) is generated every day. With the aid of 5G technology and IoT the data generated and processed every day is very large. If we dig deeper the data generated approximately 2.5 quintillion bytes. This data (Big Data) is stored and processed with the help of Hadoop framework. Hadoop framework has two phases for storing and retrieve the data in the network. <ul> <li>Hadoop Distributed file System (HDFS)</li> <li>Map Reduce algorithm</li> </ul> In the native Hadoop framework, there are some limitations for Map Reduce algorithm. If the same job is repeated again then we have to wait for the results to carry out all the steps in the native Hadoop. This led to wastage of time, resources. If we improve the capabilities of Name node i.e., maintain Common Job Block Table (CJBT) at Name node will improve the performance. By employing Common Job Block Table will improve the performance by compromising the cost to maintain Common Job Block Table. Common Job Block Table contains the meta data of files which are repeated again. This will avoid re computations, a smaller number of computations, resource saving and faster processing. The size of Common Job Block Table will keep on increasing, there should be some limit on the size of the table by employing algorithm to keep track of the jobs. The optimal Common Job Block table is derived by employing optimal algorithm at Name node.

Download Full-text

Research on Massive Data Mining Technology Based on Map Reduce

2021 International Conference on Big Data Analytics for Cyber-Physical System in Smart City - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-981-16-7466-2_28 ◽

2021 ◽

pp. 255-260

Author(s):

Xia Chang

Keyword(s):

Data Mining ◽

Map Reduce ◽

Massive Data ◽

Mining Technology

Download Full-text

The Improved Effectual Data Processing in Big Data Executing Map Reduce Frame Work

10.1109/mysurucon52639.2021.9641660 ◽

2021 ◽

Author(s):

Sudhakar Yadav N ◽

Sagar Yeruva ◽

T Sunil Kumar ◽

Talluri Susan

Keyword(s):

Big Data ◽

Data Processing ◽

Map Reduce ◽

Frame Work

Download Full-text

EVALUATION OF TRUST PATH AMONG USERS IN ONLINE SOCIAL NETWORKS USING HADOOP MAP REDUCE

Indian Journal of Computer Science and Engineering ◽

10.21817/indjcse/2021/v12i5/211205077 ◽

2021 ◽

Vol 12 (5) ◽

pp. 1302-1312

Author(s):

A. Satish Kumar ◽

Dr. Revathy S.

Keyword(s):

Social Networks ◽

Online Social Networks ◽

Map Reduce

Download Full-text

An Approach in Big Data Analytics to improve the velocity of unstructured data Using Map Reduce

International Journal of System Dynamics Applications ◽

10.4018/ijsda.20211001oa06 ◽

2021 ◽

Vol 10 (4) ◽

pp. 0-0

Keyword(s):

Big Data ◽

Time Delay ◽

Data Processing ◽

Data Analytics ◽

Big Data Analytics ◽

Data Retrieval ◽

High Volume ◽

Map Reduce ◽

Hadoop Clusters ◽

Search Index

Big Data Analytics is an innovative approach for extracting the data from a huge volume of data warehouse systems. It reveals the method to compress the high volume of data into clusters by MapReduce and HDFS. However, the data processing has taken more time for extract and store in Hadoop clusters. The proposed system deals with the challenges of time delay in shuffle phase of map-reduce due to scheduling and sequencing. For improving the speed of big data, this proposed work using the Compressed Elastic Search Index (CESI) and MapReduce-Based Next Generation Sequencing Approach (MRBNGSA). This approach helps to increase the speed of data retrieval from HDFS clusters because of the way it is stored in that. this method is stored only the metadata in HDFS which takes less memory during runtime compare to big data due to the volume of data stored in HDFS. This approach is reduces the CPU utilization and memory allocation of the resource manager in Hadoop Framework and imroves data processing speed, such a way that time delay has to be reduced with minimum latency.

Download Full-text