Projects per year
Abstract / Description of output
Methods for learning and planning in sequential decision problems often assume the learner is aware of all possible states and actions in advance. This assumption is sometimes untenable. In this paper, we give a method to learn factored markov decision problems from both domain exploration and expert assistance, which guarantees convergence to near-optimal behaviour, even when the agent begins unaware of factors critical to success. Our experiments show our agent learns optimal behaviour on small and large problems, and that conserving information on discovering new possibilities results in faster convergence.
Original language | English |
---|---|
Title of host publication | Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI 2019 |
Subtitle of host publication | Tel Aviv, Israel, July 22-25, 2019 |
Place of Publication | Tel Aviv, Israel |
Number of pages | 11 |
Publication status | Published - 22 Jul 2019 |
Event | 35th Conference on Uncertainty in Artificial Intelligence, UAI 2019 - Tel Aviv, Israel Duration: 22 Jul 2019 → 25 Jul 2019 http://auai.org/uai2019/ |
Conference
Conference | 35th Conference on Uncertainty in Artificial Intelligence, UAI 2019 |
---|---|
Abbreviated title | UAI 2019 |
Country/Territory | Israel |
City | Tel Aviv |
Period | 22/07/19 → 25/07/19 |
Internet address |
Fingerprint
Dive into the research topics of 'Learning Factored Markov Decision Processes with Unawareness'. Together they form a unique fingerprint.Projects
- 1 Finished