A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Clipping in Neurocontrol by Adaptive Dynamic Programming
2014
IEEE Transactions on Neural Networks and Learning Systems
In adaptive dynamic programming, neurocontrol and reinforcement learning, the objective is for an agent to learn to choose actions so as to minimise a total cost function. In this paper we show that when discretized time is used to model the motion of the agent, it can be very important to do "clipping" on the motion of the agent in the final time step of the trajectory. By clipping we mean that the final time step of the trajectory is to be truncated such that the agent stops exactly at the
doi:10.1109/tnnls.2014.2297991
pmid:25291742
fatcat:hy6xlwnoinhfpi7om6ly5u2bvy