A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays
[chapter]
2018
Lecture Notes in Computer Science
During sleep and awake rest, the hippocampus replays sequences of place cells that have been activated during prior experiences. These have been interpreted as a memory consolidation process, but recent results suggest a possible interpretation in terms of reinforcement learning. The Dyna reinforcement learning algorithms use off-line replays to improve learning. Under limited replay budget, a prioritized sweeping approach, which requires a model of the transitions to the predecessors, can be
doi:10.1007/978-3-319-95972-6_4
fatcat:abjqy35lmzdgraxd6mxwul35eq