A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
A cost model and index architecture for the similarity join
Proceedings 17th International Conference on Data Engineering
The similarity join is an important database primitive which has been successfully applied to speed up data mining algorithms. In the similarity join, two point sets of a multidimensional vector space are combined such that the result contains all point pairs where the distance does not exceed a parameter ε. Due to its high practical relevance, many similarity join algorithms have been devised. In this paper, we propose an analytical cost model for the similarity join operation based on
doi:10.1109/icde.2001.914854
dblp:conf/icde/BohmK01
fatcat:6woaiqmhy5fmnoytyygsszzumi