A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is
This paper proposes NeurWIN, a neural Whittle index network that seeks to learn the Whittle indices for any restless bandits by leveraging mathematical properties of the Whittle indices. ... We show that a neural network that produces the Whittle index is also one that produces the optimal control for a set of Markov decision problems. ... NeurWIN Algorithm: Neural Whittle Index Network In this section, we present NeurWIN, a deep-RL algorithm that trains neural networks to predict the Whittle indices. ...arXiv:2110.02128v2 fatcat:l4im36uuabhljdougq33a3ptju