Reinforcement learning of multiple tasks using parametric bias

Leszek Rybicki, Yuuya Sugita, Jun Tani
2009 2009 International Joint Conference on Neural Networks  
We propose a reinforcement learning system designed to learn multiple different continuous state-action-space tasks. The system has been tested on a family of space-searching task akin to Morris water maze, but with obstacles. While exploring a task, the agent builds its internal model of the environment and approximates a state value function. For learning multiple tasks, we use a parametric bias switching mechanism in which the value of the parametric bias layer identifies the task for the
more » ... nt. Each task has a specific parametric bias vector, and during training the vectors selforganize to reflect the structure of relationships between tasks in the task set. This mapping of the task set to parametric bias space can later be used to generate novel behaviors of the agent.
doi:10.1109/ijcnn.2009.5178868 dblp:conf/ijcnn/RybickiST09 fatcat:hh456v66tzbk3j2fgj2nx2t3vy