A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
[article]
2022
arXiv
pre-print
Value iteration (VI) is a foundational dynamic programming method, important for learning and planning in optimal control and reinforcement learning. VI proceeds in batches, where the update to the value of each state must be completed before the next batch of updates can begin. Completing a single batch is prohibitively expensive if the state space is large, rendering VI impractical for many applications. Asynchronous VI helps to address the large state space problem by updating one state at a
arXiv:2207.01613v2
fatcat:2pe5tinuh5gdxl3jfroeuz4jie