Projects per year
Abstract
We study countably infinite Markov decision processes with Büchi objectives, which ask to visit a given subset F of states infinitely often. A question left open by T.P. Hill in 1979 [10] is whether there always exist ε-optimal Markov strategies, i.e., strategies that base decisions only on the current state and the number of steps taken so far. We provide a negative answer to this question by constructing a non-trivial counterexample. On the other hand, we show that Markov strategies with only 1 bit of extra memory are sufficient.
Original language | English |
---|---|
Title of host publication | 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019) |
Editors | Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, Stefano Leonardi |
Publisher | Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany |
Pages | 119:1–119:14 |
Number of pages | 27 |
Volume | 132 |
ISBN (Electronic) | 978-3-95977-109-2 |
DOIs | |
Publication status | E-pub ahead of print - 12 Jul 2019 |
Event | 46th International Colloquium on Automata, Languages and Programming - Patras, Greece Duration: 8 Jul 2019 → 12 Jul 2019 https://icalp2019.upatras.gr/index.php#welcome |
Publication series
Name | LIPICS |
---|---|
Publisher | Schloss Dagstuhl - Leibniz-Zentrum für Informatik |
Volume | 132 |
ISSN (Electronic) | 1868-8969 |
Conference
Conference | 46th International Colloquium on Automata, Languages and Programming |
---|---|
Abbreviated title | ICALP 2019 |
Country/Territory | Greece |
City | Patras |
Period | 8/07/19 → 12/07/19 |
Internet address |
Keywords
- math.PR
- cs.FL
- math.OC
- Markov decision process
- Random walks
- Markov chains
- Probability
- Statistics
Fingerprint
Dive into the research topics of 'Büchi Objectives in Countable MDPs'. Together they form a unique fingerprint.Projects
- 1 Finished