A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
An Overview of Web Data Clustering Practices
[chapter]
2004
Lecture Notes in Computer Science
Clustering is a challenging topic in the area of Web data management. Various forms of clustering are required in a wide range of applications, including finding mirrored Web pages, detecting copyright violations, and reporting search results in a structured way. Clustering can either be performed once offline, (independently to search queries), or online (on the results of search queries). Important efforts have focused on mining Web access logs and to cluster search engine results on the fly.
doi:10.1007/978-3-540-30192-9_59
fatcat:cil7pgmogfdcbehyihgsihqx4a