A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy's Structural Results
2015
Operations Research
This paper provides a relaxation of the sufficient conditions and an extension of the structural results for partially observed Markov decision processes (POMDPs) obtained by Lovejoy in 1987. Sufficient conditions are provided so that the optimal policy can be upper and lower bounded by judiciously chosen myopic policies. These myopic policy bounds are constructed to maximize the volume of belief states where they coincide with the optimal policy. Numerical examples illustrate these myopic
doi:10.1287/opre.2014.1332
fatcat:f7wtnbkemvfrtcibi6hb2hb2gi