Distributed and interactive cube exploration

Niranjan Kamat, Prasanth Jayachandran, Karthik Tunga, Arnab Nandi
2014 2014 IEEE 30th International Conference on Data Engineering  
Interactive ad-hoc analytics over large datasets has become an increasingly popular use case. We detail the challenges encountered when building a distributed system that allows the interactive exploration of a data cube. We introduce DICE, a distributed system that uses a novel session-oriented model for data cube exploration, designed to provide the user with interactive sub-second latencies for specified accuracy levels. A novel framework is provided that combines three concepts: faceted
more » ... oration of data cubes, speculative execution of queries and query execution over subsets of data. We discuss design considerations, implementation details and optimizations of our system. Experiments demonstrate that DICE provides a subsecond interactive cube exploration experience at the billion-tuple scale that is at least 33% faster than current approaches.
doi:10.1109/icde.2014.6816674 dblp:conf/icde/KamatJTN14 fatcat:cgniwdhebvdkff2ubv6qytboh4