Reasoning about Hypothetical Agent Behaviours and their Parameters

S.V. Albrecht, Peter Stone

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Agents can achieve effective interaction with previously unknown other agents by maintaining beliefs over a set of hypothetical behaviours, or types, that these agents may have. A current limitation in this method is that it does not recognise parameters within type specifications, because types are viewed as blackbox mappings from interaction histories to probability distributions over actions. In this work, we propose a general method which allows an agent to reason about both the relative likelihood of types and the values of any bounded continuous parameters within types. The method maintains individual parameter estimates for each type and selectively updates the estimates for some types after each observation. We propose different methods for the selection of types and the estimation of parameter values. The proposed methods are evaluated in detailed experiments, showing that updating the parameter estimates of a single type after each observation can be sufficient to achieve good performance.
Original languageEnglish
Title of host publicationProceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-17)
Place of PublicationSão Paulo, Brazil
PublisherInternational Foundation for Autonomous Agents and Multiagent Systems
Pages547-555
Number of pages9
Publication statusPublished - 1 May 2017
Event16th International Conference on Autonomous Agents and Multiagent Systems 2017 - Sao Paulo, Brazil
Duration: 8 May 201712 May 2017
http://www.ifaamas.org/

Conference

Conference16th International Conference on Autonomous Agents and Multiagent Systems 2017
Abbreviated titleAAMAS 2017
Country/TerritoryBrazil
CitySao Paulo
Period8/05/1712/05/17
Internet address

Fingerprint

Dive into the research topics of 'Reasoning about Hypothetical Agent Behaviours and their Parameters'. Together they form a unique fingerprint.

Cite this