Playing against Nature: causal discovery for decision making under uncertainty [article]

M. Gonzalez-Soto, L.E. Sucar, H.J. Escalante
<span title="2018-07-03">2018</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
We consider decision problems under uncertainty where the options available to a decision maker and the resulting outcome are related through a causal mechanism which is unknown to the decision maker. We ask how a decision maker can learn about this causal mechanism through sequential decision making as well as using current causal knowledge inside each round in order to make better choices had she not considered causal knowledge and propose a decision making procedure in which an agent holds
more &raquo; ... liefs about her environment which are used to make a choice and are updated using the observed outcome. As proof of concept, we present an implementation of this causal decision making model and apply it in a simple scenario. We show that the model achieves a performance similar to the classic Q-learning while it also acquires a causal model of the environment.
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="">arXiv:1807.01268v1</a> <a target="_blank" rel="external noopener" href="">fatcat:wiheqa5o7jcypdwif3dr4zc4tm</a> </span>
<a target="_blank" rel="noopener" href="" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="" title=" access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> </button> </a>