Scaling Out Continuous Multi-Way Theta-Joins

Manuel Hoffmann, Sebastian Michel
2017 Proceedings of the 4th Algorithms and Systems on MapReduce and Beyond - BeyondMR'17  
In this paper, we propose generic tuple routing schemes that allow the computation of distributed multi-way thetajoins over streaming data. We present an architecture which compiles query plans in form of logical operators into Apache Storm topologies and report on first results of evaluating TPC-H data using Amazon EC2 instances running these topologies.
doi:10.1145/3070607.3070611 dblp:conf/sigmod/HoffmannM17 fatcat:65kafp4hifegzff3fkwfgrzafy