Debugging Big Data Analytics in Spark with BigDebug

Muhammad Ali Gulzar, Matteo Interlandi, Tyson Condie, Miryung Kim
2017 Proceedings of the 2017 ACM International Conference on Management of Data - SIGMOD '17  
To process massive quantities of data, developers leverage Data-Intensive Scalable Computing (DISC) systems such as Apache Spark. In terms of debugging, DISC systems support only postmortem log analysis and do not provide any debugging functionality. This demonstration paper showcases BIGDEBUG: a tool enhancing Apache Spark with a set of interactive debugging features that can help users in debug their Big Data Applications.
doi:10.1145/3035918.3058737 dblp:conf/sigmod/GulzarICK17 fatcat:o4jwkovkwndhblayxmifzeonge