Dynamic energy efficient data placement and cluster reconfiguration algorithm for MapReduce framework

Nitesh Maheshwari, Radheshyam Nanduri, Vasudeva Varma
2012 Future generations computer systems  
With the recent emergence of cloud computing based services on the Internet, MapReduce and distributed file systems like HDFS have emerged as the paradigm of choice for developing large scale data intensive applications. Given the scale at which these applications are deployed, minimizing power consumption of these clusters can significantly cut down operational costs and reduce their carbon footprint -thereby increasing the utility from a provider's point of view. This paper addresses energy
more » ... nservation for clusters of nodes that run MapReduce jobs. The algorithm dynamically reconfigures the cluster based on the current workload and turns cluster nodes on or off when the average cluster utilization rises above or falls below administrator specified thresholds, respectively. We evaluate our algorithm using the GridSim toolkit and our results show that the proposed algorithm achieves an energy reduction of 33% under average workloads and up to 54% under low workloads.
doi:10.1016/j.future.2011.07.001 fatcat:d7ffbaxy7rb5rj62ituy63zrmu