A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Efficient and Scalable Processing of String Similarity Join
2013
IEEE Transactions on Knowledge and Data Engineering
The string similarity join is a basic operation of many applications that need to find all string pairs from a collection given a similarity function and a user specified threshold. Recently, there has been considerable interest in designing new algorithms with the assistant of an inverted index to support efficient string similarity joins. These algorithms typically adopt a two-step filter-and-refine approach in identifying similar string pairs: (1) generating candidate pairs by traversing the
doi:10.1109/tkde.2012.195
fatcat:duymkqx6z5br5alidzx4vfju4i