Arena-Independent Finite-Memory Determinacy in Stochastic Games,arXiv - CS - Formal Languages and Automata Theory

当前位置： X-MOL 学术 › arXiv.cs.FL › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Arena-Independent Finite-Memory Determinacy in Stochastic Games
arXiv - CS - Formal Languages and Automata Theory Pub Date : 2021-02-19 , DOI: arxiv-2102.10104
Patricia Bouyer, Youssouf Oualhadj, Mickael Randour, Pierre Vandenhove

We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games (i.e., Markov decision processes). Third, we characterize the sufficiency of AIFM strategies through two intuitive properties of objectives. This work extends a line of research started on deterministic games in [BLO+20] to stochastic ones. [BLO+20] Patricia Bouyer, St\'ephane Le Roux, Youssouf Oualhadj, Mickael Randour, and Pierre Vandenhove. Games Where You Can Play Optimally with Arena-Independent Finite Memory. CONCUR 2020.

中文翻译：

随机游戏中与竞技场无关的有限记忆确定性

我们研究图表上的随机零和博弈，这是在随机环境中存在敌对对手的情况下进行决策建模的流行工具。在这种情况下，一个重要的问题是策略复杂性之一：哪种策略足以发挥作用或需要发挥最佳作用（例如，随机化或内存需求）？我们的贡献进一步了解了与舞台无关的有限内存（AIFM）的确定性，即需要记忆的目标的研究，但其方式仅取决于游戏图的有限参数。首先，我们证明了纯AIFM策略足以发挥其最佳性能的目标也承认了纯AIFM子游戏的完美策略。第二，我们表明，我们可以将对纯粹AIFM策略在两人随机游戏中就可以满足的目标的研究减少到对一人随机游戏（即，马尔可夫决策过程）的轻松研究中。第三，我们通过目标的两个直观属性来描述AIFM策略的充分性。这项工作将对[BLO + 20]中确定性游戏的研究范围扩展到了随机性游戏。[BLO + 20]帕特里夏·鲍耶（Patricia Bouyer），圣埃法内·勒·鲁（St \'ephane Le Roux），优素福·瓦尔哈德（Youssouf Oualhadj），米凯尔·兰杜（Mickael Randour）和皮埃尔·范登霍夫（Pierre Vandenhove）。借助独立于竞技场的有限记忆，可以最佳玩游戏的游戏。2020年大会。这项工作将对[BLO + 20]中确定性游戏的研究范围扩展到了随机性游戏。[BLO + 20]帕特里夏·鲍耶（Patricia Bouyer），圣埃法内·勒·鲁（St \'ephane Le Roux），优素福·瓦尔哈德（Youssouf Oualhadj），米凯尔·兰杜（Mickael Randour）和皮埃尔·范登霍夫（Pierre Vandenhove）。借助独立于竞技场的有限记忆，可以最佳玩游戏的游戏。2020年大会。这项工作将对[BLO + 20]中确定性游戏的研究范围扩展到了随机性游戏。[BLO + 20]帕特里夏·鲍耶（Patricia Bouyer），圣埃法内·勒·鲁（St \'ephane Le Roux），优素福·瓦尔哈德（Youssouf Oualhadj），米凯尔·兰杜（Mickael Randour）和皮埃尔·范登霍夫（Pierre Vandenhove）。借助独立于竞技场的有限记忆，可以最佳玩游戏的游戏。2020年大会。

更新日期：2021-02-22

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>