Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion

Siyuan Ding, Shengxiang Li, Guangyi Liu, Ou Li, Ke Ke, Yijie Bai, Weiye Chen, Giuseppe Quero
2021 Journal of Sensors  
The exponential explosion of joint actions and massive data collection are two main challenges in multiagent reinforcement learning algorithms with centralized training. To overcome these problems, in this paper, we propose a model-free and fully decentralized actor-critic multiagent reinforcement learning algorithm based on message diffusion. To this end, the agents are assumed to be placed in a time-varying communication network. Each agent makes limited observations regarding the global
more » ... and joint actions; therefore, it needs to obtain and share information with others over the network. In the proposed algorithm, agents hold local estimations of the global state and joint actions and update them with local observations and the messages received from neighbors. Under the hypothesis of the global value decomposition, the gradient of the global objective function to an individual agent is derived. The convergence of the proposed algorithm with linear function approximation is guaranteed according to the stochastic approximation theory. In the experiments, the proposed algorithm was applied to a passive location task multiagent environment and achieved superior performance compared to state-of-the-art algorithms.
doi:10.1155/2021/8739206 fatcat:z2kvi3ym7ndjdgwqea5i4lpbai