mapreduce scheduling Latest Research Papers

Based on the classical MapReduce concept, we propose an extended MapReduce scheduling model. In the extended MapReduce scheduling problem, we assumed that each job contains an open-map task (the map task can be divided into multiple unparallel operations) and series-reduce tasks (each reduce task consists of only one operation). Different from the classical MapReduce scheduling problem, we also assume that all the operations cannot be processed in parallel, and the machine settings are unrelated machines. For solving the extended MapReduce scheduling problem, we establish a mixed-integer programming model with the minimum makespan as the objective function. We then propose a genetic algorithm, a simulated annealing algorithm, and an L-F algorithm to solve this problem. Numerical experiments show that L-F algorithm has better performance in solving this problem.

Download Full-text

A heuristic method towards deadline-aware energy-efficient mapreduce scheduling problem in Hadoop YARN

Cluster Computing ◽

10.1007/s10586-020-03146-7 ◽

2020 ◽

Author(s):

Vaibhav Pandey ◽

Poonam Saini

Keyword(s):

Energy Efficient ◽

Heuristic Method ◽

Scheduling Problem ◽

Mapreduce Scheduling

Download Full-text

Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Data Centers

IEEE Transactions on Cloud Computing ◽

10.1109/tcc.2019.2961653 ◽

2020 ◽

pp. 1-1

Author(s):

Donglin Yang ◽

Dazhao Cheng ◽

Wei Rang ◽

Yu Wang

Keyword(s):

Data Centers ◽

Joint Optimization ◽

Hierarchical Data ◽

Mapreduce Scheduling

Download Full-text

A systematic literature review on MapReduce scheduling methods

Intelligent Decision Technologies ◽

10.3233/idt-190363 ◽

2019 ◽

Vol 13 (1) ◽

pp. 1-21

Author(s):

Majid Rahimi ◽

Seyed Mohammad Hossein Hasheminejad

Keyword(s):

Literature Review ◽

Systematic Literature Review ◽

Mapreduce Scheduling

Download Full-text

Jargon of Hadoop MapReduce scheduling techniques: a scientific categorization

The Knowledge Engineering Review ◽

10.1017/s0269888918000371 ◽

2019 ◽

Vol 34 ◽

Author(s):

Muhammad Hanif ◽

Choonhwa Lee

Keyword(s):

Critical Role ◽

Data Locality ◽

Research Community ◽

Process Data ◽

Data Set ◽

Apache Hadoop ◽

Hadoop Mapreduce ◽

Operating Environments ◽

Commodity Clusters ◽

Mapreduce Scheduling

Abstract Recently, valuable knowledge that can be retrieved from a huge volume of datasets (called Big Data) set in motion the development of frameworks to process data based on parallel and distributed computing, including Apache Hadoop, Facebook Corona, and Microsoft Dryad. Apache Hadoop is an open source implementation of Google MapReduce that attracted strong attention from the research community both in academia and industry. Hadoop MapReduce scheduling algorithms play a critical role in the management of large commodity clusters, controlling QoS requirements by supervising users, jobs, and tasks execution. Hadoop MapReduce comprises three schedulers: FIFO, Fair, and Capacity. However, the research community has developed new optimizations to consider advances and dynamic changes in hardware and operating environments. Numerous efforts have been made in the literature to address issues of network congestion, straggling, data locality, heterogeneity, resource under-utilization, and skew mitigation in Hadoop scheduling. Recently, the volume of research published in journals and conferences about Hadoop scheduling has consistently increased, which makes it difficult for researchers to grasp the overall view of research and areas that require further investigation. A scientific literature review has been conducted in this study to assess preceding research contributions to the Apache Hadoop scheduling mechanism. We classify and quantify the main issues addressed in the literature based on their jargon and areas addressed. Moreover, we explain and discuss the various challenges and open issue aspects in Hadoop scheduling optimizations.

Download Full-text

Optimal online algorithms for MapReduce scheduling on two uniform machines

Optimization Letters ◽

10.1007/s11590-018-01384-8 ◽

2019 ◽

Vol 13 (7) ◽

pp. 1663-1676 ◽

Cited By ~ 3

Author(s):

Yiwei Jiang ◽

Ping Zhou ◽

T. C. E. Cheng ◽

Min Ji

Keyword(s):

Online Algorithms ◽

Uniform Machines ◽

Mapreduce Scheduling

Download Full-text

mapreduce scheduling
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Minimizing total job completion time in MapReduce scheduling

SPO: A Secure and Performance-aware Optimization for MapReduce Scheduling

Constraint programming versus heuristic approach to MapReduce scheduling problem in Hadoop YARN for energy minimization

Analysis of hadoop MapReduce scheduling in heterogeneous environment

Heuristic Algorithms for MapReduce Scheduling Problem with Open-Map Task and Series-Reduce Tasks

A heuristic method towards deadline-aware energy-efficient mapreduce scheduling problem in Hadoop YARN

Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Data Centers

A systematic literature review on MapReduce scheduling methods

Jargon of Hadoop MapReduce scheduling techniques: a scientific categorization

Optimal online algorithms for MapReduce scheduling on two uniform machines

Export Citation Format

mapreduce schedulingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Minimizing total job completion time in MapReduce scheduling

SPO: A Secure and Performance-aware Optimization for MapReduce Scheduling

Constraint programming versus heuristic approach to MapReduce scheduling problem in Hadoop YARN for energy minimization

Analysis of hadoop MapReduce scheduling in heterogeneous environment

Heuristic Algorithms for MapReduce Scheduling Problem with Open-Map Task and Series-Reduce Tasks

A heuristic method towards deadline-aware energy-efficient mapreduce scheduling problem in Hadoop YARN

Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Data Centers

A systematic literature review on MapReduce scheduling methods

Jargon of Hadoop MapReduce scheduling techniques: a scientific categorization

Optimal online algorithms for MapReduce scheduling on two uniform machines

mapreduce scheduling
Recently Published Documents