A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
PIP: A database system for great and small expectations
2010
2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)
Estimation via sampling out of highly selective join queries is well known to be problematic, most notably in online aggregation. Without goal-directed sampling strategies, samples falling outside of the selection constraints lower estimation efficiency at best, and cause inaccurate estimates at worst. This problem appears in general probabilistic database systems, where query processing is tightly coupled with sampling. By committing to a set of samples before evaluating the query, the engine
doi:10.1109/icde.2010.5447879
dblp:conf/icde/KennedyK10
fatcat:tc3qgxxw25auzirg5pu2h7rjqy