When can we trust progress estimators for SQL queries?

Surajit Chaudhuri, Raghav Kaushik, Ravishankar Ramamurthy
2005 Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05  
The problem of estimating progress for long-running queries has recently been introduced. We analyze the characteristics of the progress estimation problem, from the perspective of providing robust, worst-case guarantees. Our first result is that in the worst case, no progress estimation algorithm can yield anything even moderately better than the trivial guarantee that identifies the progress as lying between 0% and 100%. In such cases, we introduce an estimator that can optimally bound the
more » ... imally bound the error. By placing different types of restrictions on the data and query characteristics, we show that it is possible to design effective progress estimators with small error bounds. We show where previous solutions lie in this spectrum. We then demonstrate empirically that these "good" scenarios are common in practice and discuss possible ways of combining the estimators.
doi:10.1145/1066157.1066223 dblp:conf/sigmod/ChaudhuriKR05 fatcat:f76ugtkymrexdegyacgpqjrewa