A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Exploiting information redundancy to wring out structured data from the web
2010
Proceedings of the 19th international conference on World wide web - WWW '10
A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although such information is spread across a myriad of sources, the web scale implies a relevant redundancy. We present a domain independent system that exploits the redundancy of information to automatically extract and integrate data from the Web. Our solution concentrates on sources that provide structured data about multiple
doi:10.1145/1772690.1772805
dblp:conf/www/BlancoBCMP10
fatcat:3acznfpf7fhgxojerdnedg66zu