A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit <a rel="external noopener" href="https://arxiv.org/pdf/2006.09786v1.pdf">the original URL</a>. The file type is <code>application/pdf</code>.
Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
[article]
<span title="2020-06-17">2020</span>
<i >
arXiv
</i>
<span class="release-stage" >pre-print</span>
This paper investigates how a Bayesian reinforcement learning method can be used to create a tactical decision-making agent for autonomous driving in an intersection scenario, where the agent can estimate the confidence of its recommended actions. An ensemble of neural networks, with additional randomized prior functions (RPF), are trained by using a bootstrapped experience replay memory. The coefficient of variation in the estimated Q-values of the ensemble members is used to approximate the
<span class="external-identifiers">
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2006.09786v1">arXiv:2006.09786v1</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/g5krdbp335gvja25gd4y7vaxga">fatcat:g5krdbp335gvja25gd4y7vaxga</a>
</span>
more »
... certainty, and a criterion that determines if the agent is sufficiently confident to make a particular decision is introduced. The performance of the ensemble RPF method is evaluated in an intersection scenario, and compared to a standard Deep Q-Network method. It is shown that the trained ensemble RPF agent can detect cases with high uncertainty, both in situations that are far from the training distribution, and in situations that seldom occur within the training distribution. In this study, the uncertainty information is used to choose safe actions in unknown situations, which removes all collisions from within the training distribution, and most collisions outside of the distribution.
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200623082205/https://arxiv.org/pdf/2006.09786v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
</button>
</a>
<a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2006.09786v1" title="arxiv.org access">
<button class="ui compact blue labeled icon button serp-button">
<i class="file alternate outline icon"></i>
arxiv.org
</button>
</a>