Web Harvesting [chapter]

Serguei Mankovskii, Maarten van Steen, Minos Garofalakis, Alan Fekete, Christian S. Jensen, Richard T. Snodgrass, Alex Wun, Vanja Josifovski, Andrei Broder, Dennis Fetterly, Marc Najork, Robert Baumgartner (+55 others)
2009 Encyclopedia of Database Systems  
SYNONYMS web data extraction, web information extraction, web mining DEFINITION Web harvesting describes the process of gathering and integrating data from various heterogeneous web sources. Necessary input is an appropriate knowledge representation of the domain of interest (e.g. an ontology), together with example instances of concepts or relationships (seed knowledge). Output is structured data (e.g. in the form of a relational database) that is gathered from the Web. The term harvesting
more » ... ies that, while passing over a large body of available information, the process gathers only such information that lies in the domain of interest and is, as such, relevant.
doi:10.1007/978-0-387-39940-9_1172 fatcat:g57kzd22ozc2jndgawknid65nm