A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Reinforcement Learning Algorithms: Survey and Classification
2017
Indian Journal of Science and Technology
Under Reinforcement Learning it is very well known, there are 2 broad classifications as Model-based and Model-free RL 3 . Model-based RLs have the knowledge about the environment in which the agent acts, and about the agent, per se, as well. The state transition-action mapping combined with the reward model is available a-priori. That means the agent knows the environment in which it is acting; it knows the state transitions very well -that is P(s '|s, a). It also has the reward matrix
doi:10.17485/ijst/2017/v10i1/109385
fatcat:n7fsqz5mxfbetlopeplp2h2pke