Gianluca Demartini, Djellel Eddine Difallah, Philippe Cudré-Mauroux
2012 Proceedings of the 21st international conference on World Wide Web - WWW '12  
We tackle the problem of entity linking for large collections of online pages; Our system, ZenCrowd, identifies entities from natural language text using state of the art techniques and automatically connects them to the Linked Open Data cloud. We show how one can take advantage of human intelligence to improve the quality of the links by dynamically generating micro-tasks on an online crowdsourcing platform. We develop a probabilistic framework to make sensible decisions about candidate links
more » ... nd to identify unreliable human workers. We evaluate ZenCrowd in a real deployment and show how a combination of both probabilistic reasoning and crowdsourcing techniques can significantly improve the quality of the links, while limiting the amount of work performed by the crowd.
doi:10.1145/2187836.2187900 dblp:conf/www/DemartiniDC12 fatcat:ywleux4n2bdlzmavh4x5jrx5xy