The Secretary Problem with Independent Sampling,arXiv - CS - Computer Science and Game Theory

当前位置： X-MOL 学术 › arXiv.cs.GT › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

The Secretary Problem with Independent Sampling
arXiv - CS - Computer Science and Game Theory Pub Date : 2020-11-16 , DOI: arxiv-2011.07869
Jos\'e Correa, Andr\'es Cristi, Laurent Feuilloley, Tim Oosterwijk and Alexandros Tsigonias-Dimitriadis

In the secretary problem we are faced with an online sequence of elements with values. Upon seeing an element we have to make an irrevocable take-it-or-leave-it decision. The goal is to maximize the probability of picking the element of maximum value. The most classic version of the problem is that in which the elements arrive in random order and their values are arbitrary. However, by varying the available information, new interesting problems arise. Also the case in which the arrival order is adversarial instead of random leads to interesting variants that have been considered in the literature. In this paper we study both the random order and adversarial order secretary problems with an additional twist. The values are arbitrary, but before starting the online sequence we independently sample each element with a fixed probability $p$. The sampled elements become our information or history set and the game is played over the remaining elements. We call these problems the random order secretary problem with $p$-sampling (ROS$p$ for short) and the adversarial order secretary problem with $p$-sampling (AOS$p$ for short). Our main result is to obtain best possible algorithms for both problems and all values of $p$. As $p$ grows to 1 the obtained guarantees converge to the optimal guarantees in the full information case. In the adversarial order setting, the best possible algorithm turns out to be a simple fixed threshold algorithm in which the optimal threshold is a function of $p$ only. In the random order setting we prove that the best possible algorithm is characterized by a fixed sequence of time thresholds, dictating at which point in time we should start accepting a value that is both a maximum of the online sequence and has a given ranking within the sampled elements.

中文翻译：

独立抽样的秘书问题

在秘书问题中，我们面临一个带有值的在线元素序列。在看到一个元素时，我们必须做出不可撤销的接受或放弃的决定。目标是最大化选择最大值元素的概率。该问题最经典的版本是元素以随机顺序到达并且它们的值是任意的。然而，通过改变可用信息，新的有趣问题出现了。此外，到达顺序是对抗性而不是随机的情况会导致文献中考虑过的有趣变体。在本文中，我们研究了随机顺序和对抗性顺序秘书问题，并进行了额外的研究。这些值是任意的，但在开始在线序列之前，我们以固定概率 $p$ 独立采样每个元素。采样的元素成为我们的信息或历史集，游戏在剩余的元素上进行。我们将这些问题称为带有 $p$-sampling（简称 ROS$p$）的随机订单秘书问题和带有 $p$-sampling（简称 AOS$p$）的对抗订单秘书问题。我们的主要结果是为两个问题和 $p$ 的所有值获得最佳算法。随着 $p$ 增长到 1，获得的保证收敛到全信息情况下的最佳保证。在对抗性顺序设置中，最好的算法被证明是一个简单的固定阈值算法，其中最佳阈值仅是 $p$ 的函数。在随机顺序设置中，我们证明了最好的算法的特点是固定的时间阈值序列，

更新日期：2020-11-17

点击分享查看原文

点击收藏

阅读更多本刊最新论文