A Study of Adaptive QoS Routing scheme using Policy-gradient Reinforcement Learning
정책 기울기 값 강화학습을 이용한 적응적인 QoS 라우팅 기법 연구

Jeong-Soo Han
2011 Journal of the Korea Society of Computer and Information  
In this paper, we propose a policy-gradient routing scheme under Reinforcement Learning that can be used adaptive QoS routing. A policy-gradient RL routing can provide fast learning of network environments as using optimal policy adapted average estimate rewards gradient values. This technique shows that fast of learning network environments results in high success rate of routing. For prove it, we simulate and compare with three different schemes.
doi:10.9708/jksci.2011.16.2.093 fatcat:crksdf4lybdxjcu7ll2zfy7wee