A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Filters
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
[article]
2022
arXiv
pre-print
This paper proposes NeurWIN, a neural Whittle index network that seeks to learn the Whittle indices for any restless bandits by leveraging mathematical properties of the Whittle indices. ...
We show that a neural network that produces the Whittle index is also one that produces the optimal control for a set of Markov decision problems. ...
NeurWIN Algorithm: Neural Whittle Index Network In this section, we present NeurWIN, a deep-RL algorithm that trains neural networks to predict the Whittle indices. ...
arXiv:2110.02128v2
fatcat:l4im36uuabhljdougq33a3ptju