On the Automatic Extraction of Data from the Hidden Web [chapter]

Stephen W. Liddle, Sai Ho Yau, David W. Embley
2002 Lecture Notes in Computer Science  
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are easy and precise) and from a data management perspective (static pages need not be maintained; databases can be accessed directly), automated agents have greater difficulty accessing data behind forms. In this paper we present a method for automatically filling in forms to retrieve the associated dynamically generated
more » ... ges. Using our approach automated agents can begin to systematically access portions of the "hidden Web."
doi:10.1007/3-540-46140-x_17 fatcat:jt4fww7g7ng6ljqd43zcplz54u