Projects per year
Abstract
This paper presents a model for planning in a highly complex game, where certain action types are more common than others and cyclic behaviour can also easily arise. These issues are addressed by exploiting the inherent structure among the possible options to enhance the online learning algorithm: sampling during Monte Carlo Tree Search becomes a two step process, by first sampling from a distribution over the types of legal actions followed by sampling from individual actions of the chosen type. This policy drastically reduces the breadth of the rollout as well as its depth by avoiding redundant sampling behaviour. The result is a large increase in both the performance and efficiency of the model. Another contribution of this paper is assessing the benefits of a parallel implementation and afterstates in complex games. Evaluation is done via agent simulations in the board game Settlers of Catan. The resulting agent is the first
based on purely online learning strategies that can handle the full set of legal actions of the game. The evaluation shows that our model outperforms previous state-of-the-art agents while taking decisions in a time threshold tolerated by human opponents.
based on purely online learning strategies that can handle the full set of legal actions of the game. The evaluation shows that our model outperforms previous state-of-the-art agents while taking decisions in a time threshold tolerated by human opponents.
Original language | English |
---|---|
Title of host publication | IEEE Technically Sponsored Intelligent Systems Conference (IntelliSys 2017) |
Place of Publication | London. UK |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 729-737 |
Number of pages | 9 |
ISBN (Electronic) | 978-1-5090-6435-9 |
ISBN (Print) | 978-1-5090-6436-6 |
DOIs | |
Publication status | Published - 26 Mar 2018 |
Event | SAI Intelligent Systems Conference 2017 - London, United Kingdom Duration: 7 Sept 2017 → 8 Sept 2017 http://saiconference.com/Conferences/IntelliSys2017 |
Conference
Conference | SAI Intelligent Systems Conference 2017 |
---|---|
Abbreviated title | IntelliSys 2017 |
Country/Territory | United Kingdom |
City | London |
Period | 7/09/17 → 8/09/17 |
Internet address |
Fingerprint
Dive into the research topics of 'Exploiting Action Categories in Learning Complex Games'. Together they form a unique fingerprint.Projects
- 2 Finished
-
STAC - strategic conversation
Lascarides, A. (Principal Investigator)
1/06/11 → 31/05/17
Project: Research
-
JAMES Joint Action for multimodal embodies social systems
Lascarides, A. (Principal Investigator) & Petrick, R. (Co-investigator)
1/02/11 → 31/07/14
Project: Research
Profiles
-
Alex Lascarides
- School of Informatics - Personal Chair in Semantics
- Institute of Language, Cognition and Computation
Person: Academic: Research Active