Abstract
We present a method for transferring behaviour from humans to robots via apprenticeship learning. While previous methods have relied on an accurate model of the demonstrator's dynamics, in most practical settings such models fail to capture (i) complex, non-linear dynamics of the human musculoskeletal system, and (ii) inconsistencies between modelling assumptions and the configuration and placement of measurement apparatus. To avoid such issues, we propose a model-free approach to apprenticeship learning, in which off- policy, model-free reinforcement learning techniques are used to extract a model of the objective function optimised in human behaviour. As a key ingredient, we derive a novel formulation of Least Squares Policy Iteration (LSPI) and Least Squares Temporal Difference learning (LSTD) to enable their application in this setting. The robustness of our approach is demonstrated in experiments where human hitting behaviour is transferred to a non-biomorphic robotic device.
| Original language | English |
|---|---|
| Title of host publication | 11th IEEE-RAS International Conference on Humanoid Robots, Bled, Slovenia |
| Publisher | Institute of Electrical and Electronics Engineers |
| Pages | 239-246 |
| Number of pages | 7 |
| ISBN (Electronic) | 978-1-61284-867-9 |
| ISBN (Print) | 978-1-61284-866-2 |
| DOIs | |
| Publication status | Published - 2011 |
Fingerprint
Dive into the research topics of 'Model-Free Apprenticeship Learning for Transfer of Human Impedance Behaviour'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Sensorimotor structuring of perception and action for emerging cognition - LINKED TO R82639 & R82640
Vijayakumar, S. (Principal Investigator)
1/10/06 → 30/06/10
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver