Maximizing Learning Progress: An Internal Reward System for Development [chapter]

Frédéric Kaplan, Pierre-Yves Oudeyer
2004 Lecture Notes in Computer Science  
This chapter presents a generic internal reward system that drives an agent to increase the complexity of its behavior. This reward system does not reinforce a predefined task. Its purpose is to drive the agent to progress in learning given its embodiment and the environment in which it is placed. The dynamics created by such a system are studied first in a simple environment and then in the context of active vision.
doi:10.1007/978-3-540-27833-7_19 fatcat:vvahfutbu5bpjhg33rqzsdmopa