A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Penalty-Regulated Dynamics and Robust Learning Procedures in Games
2015
Mathematics of Operations Research
Starting from a heuristic learning scheme for strategic n-person games, we derive a new class of continuous-time learning dynamics which consist of a replicator-like term adjusted by an entropic penalty that keeps players' strategies away from the boundary of the game's strategy space. These entropy-driven dynamics are equivalent to players taking an exponentially discounting aggregate of their on-going payoffs and then using a quantal response choice model to pick an action based on these
doi:10.1287/moor.2014.0687
fatcat:z5asnj7uzjcnreruv2vdjmwzha