A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
A hybrid architecture for function approximation
2008
2008 6th IEEE International Conference on Industrial Informatics
This paper proposes a new approach to build a value function estimation based on a combination of temporaldifferent (TD) and on-line variant of Random Forest (RF). We call this implementation Random-TD. First RF is induced into on-line mode in order to deal with large state space and memory constraints, while state-action mapping is based on the Bellman error, or on the TD error. We evaluate the potential of the proposed procedure in terms of a reduction in the Bellman error with extended
doi:10.1109/indin.2008.4618267
fatcat:mevmnn5ktngkzo3m6c3riyx6qu