Graphical models for online solutions to interactive POMDPs

Prashant Doshi, Yifeng Zeng, Qiongyu Chen
2007 Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems - AAMAS '07  
We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear than the previous representation. These graphical models called interactive dynamic influence diagrams (I-DIDs) seek to explicitly model the structure that is often present in realworld problems by decomposing the situation into chance and decision variables, and the dependencies between the variables. I-DIDs generalize
more » ... DIDs, which may be viewed as graphical representations of POMDPs, to multiagent settings in the same way that I-POMDPs generalize POMDPs. I-DIDs may be used to compute the policy of an agent online as the agent acts and observes in a setting that is populated by other interacting agents. Using several examples, we show how I-DIDs may be applied and demonstrate their usefulness.
doi:10.1145/1329125.1329387 dblp:conf/atal/DoshiZC07 fatcat:havbf62gbfgpdkseu75fjbs5li