Biological robot arm motion through reinforcement learning

J. Izawa, T. Kondo, K. Ito
Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292)  
The present paper discusses an optimal control method of biological robot arm which has redundancy of the mapping from the control input to the task goal. The control input space is divided into a couple of subspaces according to a priority order depending on the progress and stability of learning. In the proposed method, the search noise which is required for reinforcement learning is restricted within the first priority subspace. Then the constraint is relaxed with the progress of learning,
more » ... d the search space extends to the second priority subspace in accordance with the history of learning. The method was applied to the musculoskeletal system as an example of biological control systems. Dynamic manipulation is obtained through reinforcement learning with no previous knowledge of the arm's dynamics. The effectiveness of the proposed method is shown by computational simulation. Keywordslearning control, bio-mimetic robot, reinforcement,learning, neural network, over-actuated system 3398 0-7803-7272-7/02/$17.00
doi:10.1109/robot.2002.1014236 dblp:conf/icra/IzawaKI02 fatcat:gcexfhnwjfa5pi7nyibtej6z2e