Wikipedia Mining for an Association Web Thesaurus Construction [chapter]

Kotaro Nakayama, Takahiro Hara, Shojiro Nishio
Web Information Systems Engineering – WISE 2007  
Wikipedia has become a huge phenomenon on the WWW. As a corpus for knowledge extraction, it has various impressive characteristics such as a huge amount of articles, live updates, a dense link structure, brief link texts and URL identification for concepts. In this paper, we propose an efficient link mining method pfibf (Path Frequency -Inversed Backward link Frequency) and the extension method "forward / backward link weighting (FB weighting)" in order to construct a huge scale association
more » ... aurus. We proved the effectiveness of our proposed methods compared with other conventional methods such as cooccurrence analysis and TF-IDF.
doi:10.1007/978-3-540-76993-4_27 dblp:conf/wise/NakayamaHN07 fatcat:oaezjmo345e4nnuqyur2r6jaj4