The Internet Archive has a preservation copy of this work in our general collections.
The file type is application/pdf
.
Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis
2013
Frontiers in Psychology
One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviors. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support task-dependent learning. The work presented here is a preliminary step in which we investigate the predictive information (the mutual information of the past and future of the sensor stream) as an intrinsic drive, ideally supporting any kind of task acquisition.
doi:10.3389/fpsyg.2013.00801
pmid:24204351
pmcid:PMC3816314
fatcat:oi5bhb7zqfhirkfnj5qvixsxlq