Temporal Logic Control of POMDPs via Label-based Stochastic Simulation Relations

S. Haesaert, P. Nilsson, C.I. Vasile, R. Thakker, A. Agha-mohammadi, A.D. Ames, R.M. Murray
2018 IFAC-PapersOnLine  
The synthesis of controllers guaranteeing linear temporal logic specifications on partially observable Markov decision processes (POMDP) via their belief models causes computational issues due to the continuous spaces. In this work, we construct a finite-state abstraction on which a control policy is synthesized and refined back to the original belief model. We introduce a new notion of labelbased approximate stochastic simulation to quantify the deviation between belief models. We develop a
more » ... ust synthesis methodology that yields a lower bound on the satisfaction probability, by compensating for deviations a priori, and that utilizes a less conservative control refinement.
doi:10.1016/j.ifacol.2018.08.046 fatcat:lf364ahqxbbhzldzrrx7uatyu4