MRBench: A Benchmark for MapReduce Framework

Kiyoung Kim, Kyungho Jeon, Hyuck Han, Shin-gyu Kim, Hyungsoo Jung, Heon Y. Yeom
2008 2008 14th IEEE International Conference on Parallel and Distributed Systems  
MapReduce is Google's programming model for easy development of scalable parallel applications which process huge quantity of data on many clusters. Due to its conveniency and efficiency, MapReduce is used in various applications (e.g., web search services and online analytical processing.) However, there are only few good benchmarks to evaluate MapReduce implementations by realistic testsets. In this paper, we present MRBench that is a benchmark for evaluating MapReduce systems. MRBench
more » ... on processing business oriented queries and concurrent data modifications. To this end, we build MR-Bench to deal with large volumes of relational data and execute highly complex queries. By MRBench, users can evaluate the performance of MapReduce systems while varying environmental parameters such as data size and the number of (Map/Reduce) tasks. Our extensive experimental results show that MRBench is a useful tool to benchmark the capability of answering critical business questions.
doi:10.1109/icpads.2008.70 dblp:conf/icpads/KimJHKJY08 fatcat:jbtt63l4brapfmqo4makwoep7i