A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Text joins in an RDBMS for web data integration
2003
Proceedings of the twelfth international conference on World Wide Web - WWW '03
The integration of data produced and collected across autonomous, heterogeneous web services is an increasingly important and challenging problem. Due to the lack of global identifiers, the same entity (e.g., a product) might have different textual representations across databases. Textual data is also often noisy because of transcription errors, incomplete information, and lack of standard formats. A fundamental task during data integration is matching of strings that refer to the same entity.
doi:10.1145/775152.775166
dblp:conf/www/GravanoIKS03
fatcat:rcpffe2mbvaedaogukgztzcmfe