A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Sample Complexity Bounds for Stochastic Shortest Path with a Generative Model
2021
International Conference on Algorithmic Learning Theory
We consider the objective of computing an ε-optimal policy in a stochastic shortest path (SSP) setting, provided that we can access a generative sampling oracle. We propose two algorithms for this setting and derive PAC bounds on their sample complexity: one for the case of positive costs and the other for the case of non-negative costs under a restricted optimality criterion. While tight sample complexity bounds have been derived for the finite-horizon and discounted MDPs, the SSP problem is a
dblp:conf/alt/TarbouriechPVL21
fatcat:kgrxchwxkfezpo4bruucv4kjuu