Efficient Online Learning to Rank for Sequential Music Recommendation

Pedro Dalla Vecchia Chaves, Bruno L. Pereira, Rodrygo L. T. Santos
2022 Proceedings of the ACM Web Conference 2022  
Music streaming services heavily rely upon recommender systems to acquire, engage, and retain users. One notable component of these services are playlists, which can be dynamically generated in a sequential manner based on the user's feedback during a listening session. Online learning to rank approaches have recently been shown effective at leveraging such feedback to learn users' preferences in the space of song features. Nevertheless, these approaches can suffer from slow convergence as a
more » ... ult of their random exploration component and get stuck in local minima as a result of their session-agnostic exploitation component. To overcome these limitations, we propose a novel online learning to rank approach which efficiently explores the space of candidate recommendation models by restricting itself to the orthogonal complement of the subspace of previous underperforming exploration directions. Moreover, to help overcome local minima, we propose a session-aware exploitation component which adaptively leverages the current best model during model updates. Our thorough evaluation using simulated listening sessions from Last.fm demonstrates substantial improvements over state-of-the-art approaches regarding earlystage performance and overall long-term convergence.
doi:10.1145/3485447.3512116 fatcat:2tjnrrvzwfgzvflqdts35xhtmq