A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Record linkage
2006
Proceedings of the 2006 ACM SIGMOD international conference on Management of data - SIGMOD '06
Formalized the approach of Newcombe et al. [NKAJ59] Given two sets of records (relations) A and B perform an approximate join comparison vector Contains comparison features e.g., same last names, same SSN, etc. Γ: range of γ(a,b) the comparison space. 9/23/06 13 Fellegi-Sunter Issues: Tuning: Estimates for m (γ), u (γ) ? Training data: active learning for M, U labels Semi or un-supervised clustering: identify M U clusters Setting µ , λ? Defining the comparison space Γ?
doi:10.1145/1142473.1142599
dblp:conf/sigmod/KoudasSS06
fatcat:qmqy53kianfzrdylejufvlvdnq