Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models,Sequential Analysis

当前位置： X-MOL 学术 › Sequ. Anal. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models
Sequential Analysis ( IF 0.6 ) Pub Date : 2021-03-11 , DOI: 10.1080/07474946.2021.1847965
Aurélien Garivier ₁ , Emilie Kaufmann ₂

Affiliation

Abstract

In this article, we study sequential testing problems with overlapping hypotheses. We first focus on the simple problem of assessing if the mean μ of a Gaussian distribution is smaller or larger than a fixed $ϵ > 0;$ if $μ \in (- ϵ, ϵ),$ both answers are considered to be correct. Then, we consider probably approximately correct best arm identification in a bandit model: given K probability distributions on $R$ with means $μ_{1}, \dots, μ_{K},$ we derive the asymptotic complexity of identifying, with risk at most δ, an index $I \in {1, \dots, K}$ such that $μ_{I} \geq \max_{i} μ_{i} - ϵ .$ We provide nonasymptotic bounds on the error of a parallel general likelihood ratio test, which can also be used for more general testing problems. We further propose a lower bound on the number of observations needed to identify a correct hypothesis. Those lower bounds rely on information-theoretic arguments, and specifically on two versions of a change of measure lemma (a high-level form and a low-level form) whose relative merits are discussed.

中文翻译：

重叠假设的非渐近顺序检验应用于匪徒模型中的近乎最佳手臂识别

摘要

在本文中，我们研究具有重叠假设的顺序测试问题。我们首先关注评估高斯分布的均值μ小于或大于固定点的简单问题 $ϵ > 0;$ 如果 $μ \in （ - ϵ ， ϵ ），$ 这两个答案都被认为是正确的。然后，我们考虑在强盗模型中可能近似正确的最佳手臂识别：给定K个概率分布 $[R$ 用手段 $μ_{1个} ， \dots ， μ_{ķ} ，$ 我们得出了风险最大为δ的指标的识别的渐近复杂度 $一世 \in {1个， \dots ， ķ}$ 这样 $μ_{一世} \geq {最大限度}_{一世} μ_{一世} - ϵ 。$ 我们提供了关于并行一般似然比检验的误差的非渐近边界，它也可以用于更一般的检验问题。我们进一步提出了确定正确假设所需的观测值数量的下限。这些下限依赖于信息理论论证，特别是依赖于衡量度量引理变化的两个版本（高级形式和低级形式），它们的相对优点在此进行了讨论。

更新日期：2021-03-12

点击分享查看原文

点击收藏

阅读更多本刊最新论文