A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Multi-Dueling Bandits and Their Application to Online Ranker Evaluation
2016
Proceedings of the 25th ACM International on Conference on Information and Knowledge Management - CIKM '16
Online ranker evaluation focuses on the challenge of efficiently determining, from implicit user feedback, which ranker out of a finite set of rankers is the best. It can be modeled by dueling bandits, a mathematical model for online learning under limited feedback from pairwise comparisons. Comparisons of pairs of rankers is performed by interleaving their result sets and examining which documents users click on. The dueling bandits model addresses the key issue of which pair of rankers to
doi:10.1145/2983323.2983659
dblp:conf/cikm/BrostSCL16
fatcat:lopvamobzjdrbnwxvhoktwuguq