On Playing Golf with Two Balls

Ioana Dumitriu, Prasad Tetali, Peter Winkler
2003 SIAM Journal on Discrete Mathematics  
We analyze and solve a game in which a player chooses which of several Markov chains to advance, with the object of minimizing the expected time (or cost) for one of the chains to reach a target state. The solution entails computing (in polynomial time) a function γ-a variety of "Gittins index"-on the states of the individual chains, the minimization of which produces an optimal strategy. It turns out that γ is a useful cousin of the expected hitting time of a Markov chain but is defined, for
more » ... ample, even for random walks on infinite graphs. We derive the basic properties of γ and consider its values in some natural situations.
doi:10.1137/s0895480102408341 fatcat:xd66zvj2j5h5popr3yhoxlv35u