Optimizing Parallel Recursive Datalog Evaluation on Multicore Machines

Jiacheng Wu, Jin Wang, Carlo Zaniolo
2022 Proceedings of the 2022 International Conference on Management of Data  
Over the past years, there has been a resurgence of interest in Datalog due to its superior ability of expressing applications that require recursive computations. However, in addition to expressive power, supporting analytical tasks with ever-increasing volume of data requires high performance and scalability. In this paper, we present DCDatalog, an in-memory Datalog engine specifically designed for modern shared-memory multicore architectures. Our key contribution is a novel system
more » ... e that supports a wide scope of Datalog applications with a light-weight coordination scheme during parallel evaluation. To this end, we propose a dynamic scheduling strategy that can generate the parallel execution plan on-the-fly while reducing concurrent accesses to the shared memory. Experimental results on several large datasets show that our system significantly outperforms existing parallel Datalog engines and also scales well with increasing amount of data. CCS CONCEPTS • Information systems → Relational parallel and distributed DBMSs.
doi:10.1145/3514221.3517853 fatcat:lgv7q6jdojabjbcerxjneawhla