Deceptive Kernel Function on Observations of Discrete POMDP [article]

Zhili Zhang, Quanyan Zhu
<span title="2020-08-12">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
This paper studies the deception applied on agent in a partially observable Markov decision process. We introduce deceptive kernel function (the kernel) applied to agent's observations in a discrete POMDP. Based on value iteration, value function approximation and POMCP three characteristic algorithms used by agent, we analyze its belief being misled by falsified observations as the kernel's outputs and anticipate its probable threat on agent's reward and potentially other performance. We
more &raquo; ... te our expectation and explore more detrimental effects of the deception by experimenting on two POMDP problems. The result shows that the kernel applied on agent's observation can affect its belief and substantially lower its resulting rewards; meantime certain implementation of the kernel could induce other abnormal behaviors by the agent.
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2008.05585v1">arXiv:2008.05585v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/uyl3ryghuva5dfox3gfp2j7r6a">fatcat:uyl3ryghuva5dfox3gfp2j7r6a</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200819220106/https://arxiv.org/pdf/2008.05585v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2008.05585v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>