Urgency-aware Optimal Routing in Repeated Games through Artificial Currencies,arXiv - CS - Systems and Control

当前位置： X-MOL 学术 › arXiv.cs.SY › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Urgency-aware Optimal Routing in Repeated Games through Artificial Currencies
arXiv - CS - Systems and Control Pub Date : 2020-11-23 , DOI: arxiv-2011.11595
Mauro SalazarMaurice, Dario PaccagnanMaurice, Andrea AgazziMaurice, W. P. M. H.Maurice, Heemels

When people choose routes minimizing their individual delay, the aggregate congestion can be much higher compared to that experienced by a centrally-imposed routing. Yet centralized routing is incompatible with the presence of self-interested agents. How can we reconcile the two? In this paper we address this question within a repeated game framework and propose a fair incentive mechanism based on artificial currencies that routes selfish agents in a system-optimal fashion, while accounting for their temporal preferences. We instantiate the framework in a parallel-network whereby agents commute repeatedly (e.g., daily) from a common start node to the end node. Thereafter, we focus on the specific two-arcs case whereby, based on an artificial currency, the agents are charged when traveling on the first, fast arc, whilst they are rewarded when traveling on the second, slower arc. We assume the agents to be rational and model their choices through a game where each agent aims at minimizing a combination of today's discomfort, weighted by their urgency, and the average discomfort encountered for the rest of the period (e.g., a week). We show that, if prices of artificial currencies are judiciously chosen, the routing pattern converges to a system-optimal solution, while accommodating the agents' urgency. We complement our study through numerical simulations. Our results show that it is possible to achieve a system-optimal solution whilst reducing the agents' perceived discomfort by 14-20% when compared to a centralized optimal but urgency-unaware policy.

中文翻译：

通过人工货币在重复游戏中的应急意识最优路由

当人们选择最小化各自延迟的路由时，总拥塞可能比集中施加的路由要高得多。然而，集中式路由与自利代理的存在不兼容。我们如何调和两者？在本文中，我们在重复的博弈框架内解决了这个问题，并提出了一种基于人工货币的公平激励机制，该机制以系统最优的方式路由自私行为者，同时考虑了他们的时间偏好。我们在并行网络中实例化框架，代理通过该网络从公共起始节点到结束节点重复（例如每天）通勤。之后，我们将重点放在特定的两弧度情况下，在这种情况下，基于人工货币，代理商在第一条快速弧上行驶时需要向代理商收取费用，当他们沿着第二个较慢的弧线行驶时会获得奖励。我们假设代理人是理性的，并通过一个游戏来模拟他们的选择，其中每个代理人的目标是最大程度地减少今天的不适感（由他们的紧迫性加权）和在剩余时间（例如一周）内遇到的平均不适感。我们表明，如果明智地选择了人工货币的价格，则路由模式将收敛到系统最优解决方案，同时满足代理商的紧急性。我们通过数值模拟对研究进行补充。我们的结果表明，与集中的优化但不紧急的策略相比，可以实现系统最佳的解决方案，同时将座席的不适感降低14-20％。我们假设代理人是理性的，并通过一个游戏来模拟他们的选择，其中每个代理人的目标是最大程度地减少今天的不适感（由他们的紧迫性加权）和在剩余时间（例如一周）内遇到的平均不适感。我们表明，如果明智地选择了人工货币的价格，则路由模式将收敛到系统最优解决方案，同时满足代理商的紧急性。我们通过数值模拟对研究进行补充。我们的结果表明，与集中的优化但不紧急的策略相比，可以实现系统最佳的解决方案，同时将座席的不适感降低14-20％。我们假设代理人是理性的，并通过一个游戏来模拟他们的选择，其中每个代理人的目标是最大程度地减少今天的不适感（由他们的紧迫性加权）和在剩余时间（例如一周）内遇到的平均不适感。我们表明，如果明智地选择了人工货币的价格，则路由模式将收敛到系统最优解决方案，同时满足代理商的紧急性。我们通过数值模拟对研究进行补充。我们的结果表明，与集中的优化但不紧急的策略相比，可以实现系统最佳的解决方案，同时将座席的不适感降低14-20％。其余时间段（例如一周）的平均不适感。我们表明，如果明智地选择了人工货币的价格，则路由模式将收敛到系统最优解决方案，同时满足代理商的紧急性。我们通过数值模拟对研究进行补充。我们的结果表明，与集中的优化但不紧急的策略相比，可以实现系统最佳的解决方案，同时将座席的不适感降低14-20％。在此期间的剩余时间内（例如一周）平均感到不适。我们表明，如果明智地选择了人工货币的价格，则路由模式将收敛到系统最优解决方案，同时满足代理商的紧急性。我们通过数值模拟对研究进行补充。我们的结果表明，与集中的优化但不紧急的策略相比，可以实现系统最佳的解决方案，同时将座席的不适感降低14-20％。

更新日期：2020-11-25

点击分享查看原文

点击收藏

阅读更多本刊最新论文