Comparative Analysis of Hadoop MapReduce and Spark Based on People’s Livelihood Appeal Data

Author(s):  
Lixin Liang ◽  
Heng Zhao ◽  
Yongan Shen
2019 ◽  
Vol 2 (1) ◽  
pp. 61-73
Author(s):  
Pankaj Lathar ◽  
K. G. Srinivasa

With the advancements in science and technology, data is being generated at a staggering rate. The raw data generated is generally of high value and may conceal important information with the potential to solve several real-world problems. In order to extract this information, the raw data available must be processed and analysed efficiently. It has however been observed, that such raw data is generated at a rate faster than it can be processed by traditional methods. This has led to the emergence of the popular parallel processing programming model – MapReduce. In this study, the authors perform a comparative analysis of two popular data processing engines – Apache Flink and Hadoop MapReduce. The analysis is based on the parameters of scalability, reliability and efficiency. The results reveal that Flink unambiguously outperformance Hadoop's MapReduce. Flink's edge over MapReduce can be attributed to following features – Active Memory Management, Dataflow Pipelining and an Inline Optimizer. It can be concluded that as the complexity and magnitude of real time raw data is continuously increasing, it is essential to explore newer platforms that are adequately and efficiently capable of processing such data.


2007 ◽  
Vol 177 (4S) ◽  
pp. 398-398
Author(s):  
Luis H. Braga ◽  
Joao L. Pippi Salle ◽  
Sumit Dave ◽  
Sean Skeldon ◽  
Armando J. Lorenzo ◽  
...  
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document