Automatically collecting, monitoring, and mining japanese weblogs

Tomoyuki Nanno, Toshiaki Fujiki, Yasuhiro Suzuki, Manabu Okumura
2004 Alternate track papers & posters of the 13th international conference on World Wide Web - WWW Alt. '04  
We present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog softwares but also ones written as normal web pages. Our approach is based on extraction of date expressions and analysis of HTML documents. Our system also extracts and mines useful information from the collected blog pages.
doi:10.1145/1010432.1010520 fatcat:ifwk7ssy6jfmnostrazuex3744