Sharp Waiting-Time Bounds for Multiserver Jobs,arXiv - CS - Performance

当前位置： X-MOL 学术 › arXiv.cs.PF › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Sharp Waiting-Time Bounds for Multiserver Jobs
arXiv - CS - Performance Pub Date : 2021-09-11 , DOI: arxiv-2109.05343
Yige Hong, Weina Wang

Multiserver jobs, which are jobs that occupy multiple servers simultaneously during service, are prevalent in today's computing clusters. But little is known about the delay performance of systems with multiserver jobs. We consider queueing models for multiserver jobs in a scaling regime where the total number of servers in the system becomes large and meanwhile both the system load and the number of servers that a job needs scale with the total number of servers. Prior work has derived upper bounds on the queueing probability in this scaling regime. However, without proper lower bounds, the existing results cannot be used to differentiate between policies. In this paper, we study the delay performance by establishing sharp bounds on the mean waiting time of multiserver jobs, where the waiting time of a job is the time spent in queueing rather than in service. We first consider the commonly used First-Come-First-Serve (FCFS) policy and characterize the exact order of its mean waiting time. We then prove a lower bound on the mean waiting time of all policies, and demonstrate that there is an order gap between this lower bound and the mean waiting time under FCFS. We finally complement the lower bound with an achievability result: we show that under a priority policy that we call P-Priority, the mean waiting time achieves the order of the lower bound. This achievability result implies the tightness of the lower bound, the asymptotic optimality of P-Priority, and the strict suboptimality of FCFS.

中文翻译：

多服务器作业的明显等待时间限制

多服务器作业，即在服务期间同时占用多台服务器的作业，在当今的计算集群中很普遍。但是对于具有多服务器作业的系统的延迟性能知之甚少。我们考虑了多服务器作业的队列模型，其中系统中的服务器总数变大，同时系统负载和作业所需的服务器数量随服务器总数而扩展。先前的工作已经推导出了这种缩放机制中排队概率的上限。然而，如果没有合适的下限，现有的结果就不能用于区分策略。在本文中，我们通过建立多服务器作业的平均等待时间的尖锐界限来研究延迟性能，其中工作的等待时间是排队而不是服务所花费的时间。我们首先考虑常用的先到先服务 (FCFS) 策略并描述其平均等待时间的确切顺序。然后我们证明所有策略的平均等待时间的下限，并证明该下限与 FCFS 下的平均等待时间之间存在订单差距。我们最终用可实现性结果补充下限：我们表明，在我们称为 P-Priority 的优先级策略下，平均等待时间达到了下限的顺序。这个可实现性结果暗示了下界的严格性、P-Priority 的渐近最优性以及 FCFS 的严格次优性。我们首先考虑常用的先到先服务 (FCFS) 策略并描述其平均等待时间的确切顺序。然后我们证明所有策略的平均等待时间的下限，并证明该下限与 FCFS 下的平均等待时间之间存在订单差距。我们最终用可实现性结果补充下限：我们表明，在我们称为 P-Priority 的优先级策略下，平均等待时间达到了下限的顺序。这个可实现性结果暗示了下界的严格性、P-Priority 的渐近最优性以及 FCFS 的严格次优性。我们首先考虑常用的先到先服务 (FCFS) 策略并描述其平均等待时间的确切顺序。然后我们证明所有策略的平均等待时间的下限，并证明该下限与 FCFS 下的平均等待时间之间存在订单差距。我们最终用可实现性结果补充下限：我们表明，在我们称为 P-Priority 的优先级策略下，平均等待时间达到了下限的顺序。这个可实现性结果暗示了下界的严格性、P-Priority 的渐近最优性以及 FCFS 的严格次优性。我们最终用可实现性结果补充下限：我们表明，在我们称为 P-Priority 的优先级策略下，平均等待时间达到了下限的顺序。这个可实现性结果暗示了下界的严格性、P-Priority 的渐近最优性以及 FCFS 的严格次优性。我们最终用可实现性结果补充下限：我们表明，在我们称为 P-Priority 的优先级策略下，平均等待时间达到了下限的顺序。这个可实现性结果暗示了下界的严格性、P-Priority 的渐近最优性以及 FCFS 的严格次优性。

更新日期：2021-09-14

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>