A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2007; you can also visit the original URL.
The file type is application/pdf
.
Schema-guided wrapper maintenance for web-data extraction
2003
Proceedings of the fifth ACM international workshop on Web information and data management - WIDM '03
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. There are two main issues relevant to Web-data extraction, namely wrapper generation and wrapper maintenance. In this paper, we propose a novel schema-guided approach to the problem of automatic wrapper maintenance. It is based on the observation that despite various page changes, many important features of the pages are preserved, such as syntactic
doi:10.1145/956699.956701
dblp:conf/widm/MengHL03
fatcat:maqjddsdebholitgj5lsq6wdyq