A cross-lingual framework for monolingual biomedical information retrieval

Dolf Trieschnigg, Djoerd Hiemstra, Franciska de Jong, Wessel Kraaij
2010 Proceedings of the 19th ACM international conference on Information and knowledge management - CIKM '10  
An important challenge for biomedical information retrieval (IR) is dealing with the complex, inconsistent and ambiguous biomedical terminology. Frequently, a concept-based representation defined in terms of a domain-specific terminological resource is employed to deal with this challenge. In this paper, we approach the incorporation of a concept-based representation in monolingual biomedical IR from a crosslingual perspective. In the proposed framework, this is realized by translating and
more » ... ing between text and conceptbased representations. The approach allows for deployment of a rich set of techniques proposed and evaluated in traditional cross-lingual IR. We compare six translation models and measure their effectiveness in the biomedical domain. We demonstrate that the approach can result in significant improvements in retrieval effectiveness over word-based retrieval. Moreover, we demonstrate increased effectiveness of a CLIR framework for monolingual biomedical IR if basic translations models are combined.
doi:10.1145/1871437.1871463 dblp:conf/cikm/TrieschniggHJK10 fatcat:sfyjjbe32bbvtbjgqbsza6w67a