An empirical study on retrieval models for different document genres

Makoto Iwayama, Atsushi Fujii, Noriko Kando, Yuzo Marukawa
2003 Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03  
Reflecting the rapid growth in the utilization of large test collections for information retrieval since the 1990s, extensive comparative experiments have been performed to explore the effectiveness of various retrieval models. However, most collections were intended for retrieving newspaper articles and technical abstracts. In this paper, we describe the process of producing a test collection for patent retrieval, the NTCIR-3 Patent Retrieval Collection, which includes two years of Japanese
more » ... ent applications and 31 topics produced by professional patent searchers. We also report experimental results obtained by using this collection to reexamine the effectiveness of existing retrieval models in the context of patent retrieval. The relative superiority among existing retrieval models did not significantly differ depending on the document genre, that is, patents and newspaper articles. Issues related to patent retrieval are also discussed.
doi:10.1145/860480.860482 fatcat:275gtggfazd5lnjhqrqxk3fidm