Projects per year
Abstract
Learning algorithms are enabling robots to solve
increasingly challenging real-world tasks. These approaches often
rely on demonstrations and reproduce the behavior shown.
Unexpected changes in the environment or in robot morphology
may require using different behaviors to achieve the same effect,
for instance to reach and grasp an object in changing clutter. An
emerging paradigm addressing this robustness issue is to learn a
diverse set of successful behaviors for a given task, from which a
robot can select the most suitable policy when faced with a new
environment. In this paper, we explore a novel realization of this
vision by learning a generative model over policies. Rather than
learning a single policy, or a small fixed repertoire, our generative
model for policies compactly encodes an unbounded number
of policies and allows novel controller variants to be sampled.
Leveraging our generative policy network, a robot can sample
novel behaviors until it finds one that works for a new scenario.
We demonstrate this idea with an application of robust ball-throwing in the presence of obstacles, as well as joint-damage-robust throwing. We show that this approach achieves a greater
diversity of behaviors than an existing evolutionary approach,
while maintaining good efficacy of sampled behaviors, allowing
a Baxter robot to hit targets more often when ball throwing in
the presence of varying obstacles or joint impediments.
Original language | English |
---|---|
Pages (from-to) | 1344-1355 |
Number of pages | 11 |
Journal | IEEE Transactions on Cognitive and Developmental Systems |
Volume | 14 |
Issue number | 4 |
Early online date | 10 Jul 2020 |
DOIs | |
Publication status | Published - 1 Dec 2022 |
Keywords / Materials (for Non-textual outputs)
- Machine learning
- robot learning
Fingerprint
Dive into the research topics of 'Behavioral Repertoire via Generative Adversarial Policy Networks'. Together they form a unique fingerprint.Projects
- 2 Finished
-
UK Robotics and Artificial Intelligence Hub for Offshore Energy Asset Integrity Management (ORCA)
Vijayakumar, S., Mistry, M., Ramamoorthy, R. & Williams, C.
1/10/17 → 31/03/22
Project: Research
-
DREAM - Deferred Restructuring of Experience in Autonomous Machines
1/09/16 → 31/12/18
Project: Research