Deep Exploration for Recommendation Systems [article]

Zheqing Zhu, Benjamin Van Roy
2021 arXiv   pre-print
We investigate the design of recommendation systems that can efficiently learn from sparse and delayed feedback. Deep Exploration can play an important role in such contexts, enabling a recommendation system to much more quickly assess a user's needs and personalize service. We design an algorithm based on Thompson Sampling that carries out Deep Exploration. We demonstrate through simulations that the algorithm can substantially amplify the rate of positive feedback relative to common
more » ... tion system designs in a scalable fashion. These results demonstrate promise that we hope will inspire engineering of production recommendation systems that leverage Deep Exploration.
arXiv:2109.12509v1 fatcat:a244qn4k3zc5rhvxai6y2md2tu