USAK METHOD FOR THE REINFORCEMENT LEARNING

Mykhailo Novotarskyi, Valentin Kuzmich
2020 Information, Computing and Intelligent systems  
In the field of reinforcement learning, tabular methods have become widespread. There are many important scientific results, which significantly improve their performance in specific applications. However, the application of tabular methods is limited due to the large amount of resources required to store value functions in tabular form under high-dimensional state spaces. A natural solution to the memory problem is to use parameterized function approximations. However, conventional approaches
more » ... o function approximations, in most cases, have ceased to give the desired result of memory reduction in solving realworld problems. This fact became the basis for the application of new approaches, one of which is the use of Sparse Distributed Memory (SDM) based on Kanerva coding. A further development of this direction was the method of Similarity-Aware Kanerva (SAK). In this paper, a modification of the SAK method is proposed, the Uniform Similarity-Aware Kanerva (USAK) method, which is based on the uniform distribution of prototypes in the state space. This approach has reduced the use of RAM required to store prototypes. In addition, reducing the receptive distance of each of the prototypes made it possible to increase the learning speed by reducing the number of calculations in the linear approximator.
doi:10.20535/2708-4930.1.2020.216042 fatcat:citwm63udnaslfa7hr7jvq2sp4