A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system
2011
Interspeech 2011
unpublished
The optimization of dialogue policies using reinforcement learning (RL) is now an accepted part of the state of the art in spoken dialogue systems (SDS). Yet, it is still the case that the commonly used training algorithms for SDS require a large number of dialogues and hence most systems still rely on artificial data generated by a user simulator. Optimization is therefore performed off-line before releasing the system to real users. Gaussian Processes (GP) for RL have recently been applied to
doi:10.21437/interspeech.2011-434
fatcat:zdxenj44izbkfnghea5h7h5tiq