Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations
[article]
Finale Doshi-Velez, George Konidaris
2013
arXiv
pre-print
We introduce the Hidden Parameter Markov Decision Process (HiP-MDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric ...
In the control setting, we show that a learned HiP-MDP rapidly identifies the dynamics of a new task instance, allowing an agent to flexibly adapt to task variations. ...
Thus, we could always learn to solve each HiP-MDP instance as its own distinct MDP. Second, the parameter vector θ b is fixed for the duration of the task, and thus the hidden state has no dynamics. ...
arXiv:1308.3513v1
fatcat:ddzfwarcnnfjradolbqmg2bhqy