A Predictive Model for Imitation Learning in Partially Observable Environments

Abdeslam Boularias
2008 2008 Seventh International Conference on Machine Learning and Applications  
Learning by imitation has shown to be a powerful paradigm for automated learning in autonomous robots. This paper presents a general framework of learning by imitation for stochastic and partially observable systems. The model is a Predictive Policy Representation (PPR) whose goal is to represent the teacher's policies without any reference to states. The model is fully described in terms of actions and observations only. We show how this model can efficiently learn the personal behavior and
more » ... nal behavior and preferences of an assistive robot user.
doi:10.1109/icmla.2008.142 dblp:conf/icmla/Boularias08 fatcat:kxra5ymhqje37ncnvsriv57jmy