Integrating cross-lingually relevant news articles and monolingual web documents in bilingual lexicon acquisition

Takehito Utsuro, Kohei Hino, Mitsuhiro Kida, Seiichi Nakagawa, Satoshi Sato
2004 Proceedings of the 20th international conference on Computational Linguistics - COLING '04   unpublished
In the framework of bilingual lexicon acquisition from cross-lingually relevant news articles on the Web, it is relatively harder to reliably estimate bilingual term correspondences for low frequency terms. Considering such a situation, this paper proposes to complementarily use much larger monolingual Web documents collected by search engines, as a resource for reliably re-estimating bilingual term correspondences. We experimentally show that, using a sufficient number of monolingual Web
more » ... nts, it is quite possible to have reliable estimate of bilingual term correspondences for those low frequency terms.
doi:10.3115/1220355.1220504 fatcat:fwjgbqyy65dfradezneonxo7te