A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
A Scalable Constant-Memory Sampling Algorithm for Pattern Discovery in Large Databases
[chapter]
2002
Lecture Notes in Computer Science
Many data mining tasks can be seen as an instance of the problem of finding the most interesting (according to some utility function) patterns in a large database. In recent years, significant progress has been achieved in scaling algorithms for this task to very large databases through the use of sequential sampling techniques. However, except for sampling-based greedy algorithms which cannot give absolute quality guarantees, the scalability of existing approaches to this problem is only with
doi:10.1007/3-540-45681-3_33
fatcat:4mbvu2za3jec7l4wm4cqtypmma