A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
Estimating Probability Distribution with Q-learning for Biped Gait Generation and Optimization
2006
2006 IEEE/RSJ International Conference on Intelligent Robots and Systems
A new biped gait generation and optimization method is proposed in the frame of Estimation of Distribution Algorithms (EDAs) with Q-learning method. By formulating the biped gait synthesis as a constrained multi-objective optimization problem, a dynamically stable and low energy cost biped gait is generated by EDAs with Q-learning (EDA Q), which estimate probability distributions derived from the objective function to be optimized to generate searching points in the highly-coupled and high
doi:10.1109/iros.2006.281728
dblp:conf/iros/HuZS06
fatcat:qng7gccawfhwpf5qe4j32dyhx4