The Internet Archive has a preservation copy of this work in our general collections.
The file type is application/pdf
.
Improving Entity Resolution with Global Constraints
[article]
2011
arXiv
pre-print
Some of the greatest advances in web search have come from leveraging socio-economic properties of online user behavior. Past advances include PageRank, anchor text, hubs-authorities, and TF-IDF. In this paper, we investigate another socio-economic property that, to our knowledge, has not yet been exploited: sites that create lists of entities, such as IMDB and Netflix, have an incentive to avoid gratuitous duplicates. We leverage this property to resolve entities across the different web
arXiv:1108.6016v1
fatcat:islnpt7vvfdq3gdqpakcqtbxym