当前位置: X-MOL 学术arXiv.cs.GT › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Asymptotically optimal strategies for online prediction with history-dependent experts
arXiv - CS - Computer Science and Game Theory Pub Date : 2020-08-31 , DOI: arxiv-2008.13703
Jeff Calder and Nadejda Drenska

We establish sharp asymptotically optimal strategies for the problem of online prediction with history dependent experts. The prediction problem is played (in part) over a discrete graph called the $d$ dimensional de Bruijn graph, where $d$ is the number of days of history used by the experts. Previous work [11] established $O(\varepsilon)$ optimal strategies for $n=2$ experts and $d\leq 4$ days of history, while [10] established $O(\varepsilon^{1/3})$ optimal strategies for all $n\geq 2$ and all $d\geq 1$, where the game is played for $N$ steps and $\varepsilon=N^{-1/2}$. In this paper, we show that the optimality conditions over the de Bruijn graph correspond to a graph Poisson equation, and we establish $O(\varepsilon)$ optimal strategies for all values of $n$ and $d$.

中文翻译:

与历史相关专家在线预测的渐近最优策略

我们为历史依赖专家的在线预测问题建立了尖锐的渐近最优策略。预测问题(部分地)在一个称为 $d$ 维 de Bruijn 图的离散图上进行,其中 $d$ 是专家使用的历史天数。先前的工作 [11] 为 $n=2$ 专家和 $d\leq 4$ 天的历史建立了 $O(\varepsilon)$ 最优策略,而 [10] 建立了 $O(\varepsilon^{1/3}) $ 对所有 $n\geq 2$ 和所有 $d\geq 1$ 的最优策略,其中游戏进行了 $N$ 步和 $\varepsilon=N^{-1/2}$。在本文中,我们证明了 de Bruijn 图上的最优性条件对应于图泊松方程,并且我们为 $n$ 和 $d$ 的所有值建立了 $O(\varepsilon)$ 最优策略。
更新日期:2020-09-01
down
wechat
bug