A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators
[article]
2022
arXiv
pre-print
Recent advances in reinforcement learning (RL) have led to a growing interest in applying RL to classical planning domains or applying classical planning methods to some complex RL domains. However, the long-horizon goal-based problems found in classical planning lead to sparse rewards for RL, making direct application inefficient. In this paper, we propose to leverage domain-independent heuristic functions commonly used in the classical planning literature to improve the sample efficiency of
arXiv:2109.14830v2
fatcat:s4bxrobylzgpbdjdm4i4rggnnm