A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Parallel meta-blocking: Realizing scalable entity resolution over large, heterogeneous data
2015
2015 IEEE International Conference on Big Data (Big Data)
Entity resolution constitutes a crucial task for many applications, but has an inherently quadratic complexity. Typically, it scales to large volumes of data through blocking: similar entities are clustered into blocks so that it suffices to perform comparisons only within each block. Meta-blocking further increases efficiency by cleaning the overlapping blocks from unnecessary comparisons. However, even Meta-blocking can be time-consuming: applying it to blocks with 7.4 million entities and
doi:10.1109/bigdata.2015.7363782
dblp:conf/bigdataconf/Efthymiou0PSP15
fatcat:mlqyuhwy7bd47myqa4hqm53siu