A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit <a rel="external noopener" href="https://static.aminer.org/pdf/20170130/pdfs/sigir/yrnyqwpmzobcg1v7uvfe3g9el4adz6k0.pdf">the original URL</a>. The file type is <code>application/pdf</code>.
Improving Retrieval Performance for Verbose Queries via Axiomatic Analysis of Term Discrimination Heuristic
<span title="">2017</span>
<i title="ACM Press">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '17</a>
</i>
Number of terms in a query is a query-speci c constant that is typically ignored in retrieval functions. However, previous studies have shown that the performance of retrieval models varies for di erent query lengths, and it usually degrades when query length increases. A possible reason for this issue can be the extraneous terms in longer queries that makes it a challenge for the retrieval models to distinguish between the key and complementary concepts of the query. As a signal to understand
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3077136.3080761">doi:10.1145/3077136.3080761</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/AriannezhadMZS17.html">dblp:conf/sigir/AriannezhadMZS17</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/4iaffwtg7vconpe6jnjiqvgdz4">fatcat:4iaffwtg7vconpe6jnjiqvgdz4</a>
</span>
more »
... he importance of a term, inverse document frequency (IDF) can be used to discriminate query terms. In this paper, we propose a constraint to model the interaction between query length and IDF. Our theoretical analysis shows that current state-of-the-art retrieval models, such as BM25, do not satisfy the proposed constraint. We further analyze the BM25 model and suggest a modi cation to adapt BM25 so that it adheres to the new constraint. Our experiments on three TREC collections demonstrate that the proposed modi cation outperforms the baselines, especially for verbose queries.
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218104037/https://static.aminer.org/pdf/20170130/pdfs/sigir/yrnyqwpmzobcg1v7uvfe3g9el4adz6k0.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/e1/63/e163cd9e3c04f2ef5744f90c861402b2dc2b4208.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3077136.3080761">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
acm.org
</button>
</a>