A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Modelling Cournot Games as Multi-agent Multi-armed Bandits
2022
Proceedings of the ... International Florida Artificial Intelligence Research Society Conference
We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value). Agents interact with separate and independent bandit problems. In this formulation, each agent makes sequential choices among arms to maximize its own reward. Agents do not have any information about the environment; they can only see their own rewards after
doi:10.32473/flairs.v35i.130697
fatcat:a2dkksnkfzf4dgr7lxvrwveq4y