A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning
[article]
2021
arXiv
pre-print
Operating in the real-world often requires agents to learn about a complex environment and apply this understanding to achieve a breadth of goals. This problem, known as goal-conditioned reinforcement learning (GCRL), becomes especially challenging for long-horizon goals. Current methods have tackled this problem by augmenting goal-conditioned policies with graph-based planning algorithms. However, they struggle to scale to large, high-dimensional state spaces and assume access to exploration
arXiv:2111.09858v1
fatcat:opnzhm5bl5h6dc7utpfcoo7dku