A multiagent variant of Dyna-Q

G. Weiss
Proceedings Fourth International Conference on MultiAgent Systems  
This paper describes a multiagent variant of Dyna-Q called M-Dyna-Q. Dyna-Q is an integrated single-agent framework for planning, reacting, and learning. Like Dyna-Q, M-Dyna-Q employs two key ideas: learning results can serve as a valuable input for both planning and reacting, and results of planning and reacting can serve as a valuable input to learning. M-Dyna-Q extends Dyna-Q in that planning, reacting, and learning are jointly realized by multiple agents.
doi:10.1109/icmas.2000.858525 dblp:conf/icmas/Weiss00 fatcat:gi7ese7vojepzh6qtan24burny