Model-Free reinforcement learning with continuous action in practice

T. Degris, P. M. Pilarski, R. S. Sutton
2012 2012 American Control Conference (ACC)  
doi:10.1109/acc.2012.6315022 fatcat:zcffq2qphvfpnnszvs5tu4ts5q