Learning State Features from Policies to Bias Exploration in Reinforcement Learning [report]

Bryan Singer, Manuela Veloso
1999 unpublished
doi:10.21236/ada363533 fatcat:f6aaolhkzfchblxn5qdn7x5lty