TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark [chapter]

Peter Boncz, Thomas Neumann, Orri Erling
2014 Lecture Notes in Computer Science  
The TPC-D benchmark was developed almost 20 years ago, and even though its current existence as TPC-H could be considered superseded by TPC-DS, one can still learn from it. We focus on the technical level, summarizing the challenges posed by the TPC-H workload as we now understand them, which we call "choke points". We identify 28 different such choke points, grouped into six categories: Aggregation Performance, Join Performance, Data Access Locality, Expression Calculation, Correlated
more » ... s and Parallel Execution. On the meta-level, we make the point that the rich set of choke-points found in TPC-H sets an example on how to design future DBMS benchmarks. 4
doi:10.1007/978-3-319-04936-6_5 fatcat:dy5nfvl6avbfvgq5yt6adjrnva