Associative reinforcement learning: A generate and test algorithm

Leslie Pack Kaelbling
1994 Machine Learning  
An agent that must learn to act in the world by trial and error faces the reinforcement learning problem, which is quite different from standard concept learning. Although good algorithms exist for this problem in the general case, they are often quite inefficient and do not exhibit generalization. One strategy is to find restricted classes of action policies that can be learned more efficiently. This paper pursues that strategy by developing an algorithm that performans an on-line search
more » ... h the space of action mappings, expressed as Boolean formulae. The algorithm is compared with existing methods in empirical trials and is shown to have very good performance.
doi:10.1007/bf00993348 fatcat:t7hrpohocngzndkl3sdreg5uca