A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2006; you can also visit the original URL.
The file type is
Alternate track papers & posters of the 13th international conference on World Wide Web - WWW Alt. '04
We present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog softwares but also ones written as normal web pages. Our approach is based on extraction of date expressions and analysis of HTML documents. Our system also extracts and mines useful information from the collected blog pages.doi:10.1145/1010432.1010520 fatcat:ifwk7ssy6jfmnostrazuex3744