A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Successor Options: An Option Discovery Framework for Reinforcement Learning
[article]
2019
arXiv
pre-print
The options framework in reinforcement learning models the notion of a skill or a temporally extended sequence of actions. The discovery of a reusable set of skills has typically entailed building options, that navigate to bottleneck states. This work adopts a complementary approach, where we attempt to discover options that navigate to landmark states. These states are prototypical representatives of well-connected regions and can hence access the associated region with relative ease. In this
arXiv:1905.05731v1
fatcat:h7zwlqb6v5bprc4bfnijhf5lzy