Joint scheduling of processing and Shuffle phases in MapReduce systems

Fangfei Chen, Murali Kodialam, T. V. Lakshman
2012 2012 Proceedings IEEE INFOCOM  
MapReduce has emerged as an important paradigm for processing data in large data centers. MapReduce is a three phase algorithm comprising of Map, Shuffle and Reduce phases. Due to its widespread deployment, there have been several recent papers outlining practical schemes to improve the performance of MapReduce systems. All these efforts focus on one of the three phases to obtain performance improvement. In this paper, we consider the problem of jointly scheduling all three phases of the
more » ... ce process with a view of understanding the theoretical complexity of the joint scheduling and working towards practical heuristics for scheduling the tasks. We give guaranteed approximation algorithms and outline several heuristics to solve the joint scheduling problem.
doi:10.1109/infcom.2012.6195473 dblp:conf/infocom/ChenKL12 fatcat:5jjlo6ndbfh3lhlsr3fzz2ve5m