A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Bounded Incremental Real-Time Dynamic Programming
2007
2007 Frontiers in the Convergence of Bioscience and Information Technologies
A real-time multi-step planning problem is characterized by alternating decision-making and execution processes, whole online decision-making time divided in slices between each execution, and the pressing need for policy that only relates to current step. We propose a new criterion to judge the optimality of a policy based on the upper and lower bound theory. This criterion guarantees that the agent can act earlier in a real-time decision process while an optimal policy with sufficient
doi:10.1109/fbit.2007.14
dblp:conf/fbit/FanC07
fatcat:5okhrvtu6rbzvc4r3iouvupzai