The Internet Archive has a preservation copy of this work in our general collections.
The file type is application/pdf
.
A Simple Mechanism for Focused Web-harvesting
[article]
2008
arXiv
pre-print
The focused web-harvesting is deployed to realize an automated and comprehensive index databases as an alternative way for virtual topical data integration. The web-harvesting has been implemented and extended by not only specifying the targeted URLs, but also predefining human-edited harvesting parameters to improve the speed and accuracy. The harvesting parameter set comprises three main components. First, the depth-scale of being harvested final pages containing desired information counted
arXiv:0809.0723v1
fatcat:3vj3m36tyrb5ljtieiqohp2ldi