Asymptotically optimal strategies for online prediction with history-dependent experts,arXiv - CS - Computer Science and Game Theory

当前位置： X-MOL 学术 › arXiv.cs.GT › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Asymptotically optimal strategies for online prediction with history-dependent experts
arXiv - CS - Computer Science and Game Theory Pub Date : 2020-08-31 , DOI: arxiv-2008.13703
Jeff Calder and Nadejda Drenska

We establish sharp asymptotically optimal strategies for the problem of online prediction with history dependent experts. The prediction problem is played (in part) over a discrete graph called the $d$ dimensional de Bruijn graph, where $d$ is the number of days of history used by the experts. Previous work [11] established $O(\varepsilon)$ optimal strategies for $n=2$ experts and $d\leq 4$ days of history, while [10] established $O(\varepsilon^{1/3})$ optimal strategies for all $n\geq 2$ and all $d\geq 1$, where the game is played for $N$ steps and $\varepsilon=N^{-1/2}$. In this paper, we show that the optimality conditions over the de Bruijn graph correspond to a graph Poisson equation, and we establish $O(\varepsilon)$ optimal strategies for all values of $n$ and $d$.

中文翻译：

与历史相关专家在线预测的渐近最优策略

我们为历史依赖专家的在线预测问题建立了尖锐的渐近最优策略。预测问题（部分地）在一个称为 $d$ 维 de Bruijn 图的离散图上进行，其中 $d$ 是专家使用的历史天数。先前的工作 [11] 为 $n=2$ 专家和 $d\leq 4$ 天的历史建立了 $O(\varepsilon)$ 最优策略，而 [10] 建立了 $O(\varepsilon^{1/3}) $ 对所有 $n\geq 2$ 和所有 $d\geq 1$ 的最优策略，其中游戏进行了 $N$ 步和 $\varepsilon=N^{-1/2}$。在本文中，我们证明了 de Bruijn 图上的最优性条件对应于图泊松方程，并且我们为 $n$ 和 $d$ 的所有值建立了 $O(\varepsilon)$ 最优策略。

更新日期：2020-09-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>