The value of randomized strategies in distributionally robust risk averse network interdiction games,arXiv - CS - Computer Science and Game Theory

当前位置： X-MOL 学术 › arXiv.cs.GT › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

The value of randomized strategies in distributionally robust risk averse network interdiction games
arXiv - CS - Computer Science and Game Theory Pub Date : 2020-03-17 , DOI: arxiv-2003.07915
Utsav Sadana, Erick Delage

Conditional Value at Risk (CVaR) is widely used to account for the preferences of a risk-averse agent in the extreme loss scenarios. To study the effectiveness of randomization in interdiction games with an interdictor that is both risk and ambiguity averse, we introduce a distributionally robust network interdiction game where the interdictor randomizes over the feasible interdiction plans in order to minimize the worst-case CVaR of the flow with respect to both the unknown distribution of the capacity of the arcs and his mixed strategy over interdicted arcs. The flow player, on the contrary, maximizes the total flow in the network. By using the budgeted uncertainty set, we control the degree of conservatism in the model and reformulate the interdictor's non-linear problem as a bi-convex optimization problem. For solving this problem to any given optimality level, we devise a spatial branch and bound algorithm that uses the McCormick inequalities and reduced reformulation linearization technique (RRLT) to obtain convex relaxation of the problem. We also develop a column generation algorithm to identify the optimal support of the convex relaxation which is then used in the coordinate descent algorithm to determine the upper bounds. The efficiency and convergence of the spatial branch and bound algorithm is established in the numerical experiments. Further, our numerical experiments show that randomized strategies can have significantly better in-sample and out-of-sample performance than optimal deterministic ones.

中文翻译：

随机策略在分布稳健的风险规避网络阻断博弈中的价值

条件风险价值 (CVaR) 被广泛用于解释风险规避代理在极端损失情况下的偏好。为了研究具有既厌恶风险又厌恶歧义的拦截器的拦截游戏中随机化的有效性，我们引入了一个分布鲁棒的网络拦截游戏，其中拦截器随机化可行的拦截计划，以最小化流量的最坏情况 CVaR关于弧线容量的未知分布和他对被拦截弧线的混合策略。相反，流量玩家最大化网络中的总流量。通过使用预算不确定性集，我们控制模型中的保守程度，并将拦截器的非线性问题重新表述为双凸优化问题。为了将这个问题解决到任何给定的最优水平，我们设计了一种空间分支定界算法，该算法使用 McCormick 不等式和简化的重构线性化技术 (RRLT) 来获得问题的凸松弛。我们还开发了一种列生成算法来确定凸松弛的最佳支持，然后在坐标下降算法中使用它来确定上限。在数值实验中建立了空间分支定界算法的效率和收敛性。此外，我们的数值实验表明，与最佳确定性策略相比，随机策略可以具有明显更好的样本内和样本外性能。我们设计了一种空间分支定界算法，该算法使用 McCormick 不等式和简化重构线性化技术 (RRLT) 来获得问题的凸松弛。我们还开发了一种列生成算法来确定凸松弛的最佳支持，然后在坐标下降算法中使用它来确定上限。在数值实验中建立了空间分支定界算法的效率和收敛性。此外，我们的数值实验表明，与最佳确定性策略相比，随机策略可以具有明显更好的样本内和样本外性能。我们设计了一种空间分支定界算法，该算法使用 McCormick 不等式和简化重构线性化技术 (RRLT) 来获得问题的凸松弛。我们还开发了一种列生成算法来确定凸松弛的最佳支持，然后在坐标下降算法中使用它来确定上限。在数值实验中建立了空间分支定界算法的效率和收敛性。此外，我们的数值实验表明，与最佳确定性策略相比，随机策略可以具有明显更好的样本内和样本外性能。我们还开发了一种列生成算法来确定凸松弛的最佳支持，然后在坐标下降算法中使用它来确定上限。在数值实验中建立了空间分支定界算法的效率和收敛性。此外，我们的数值实验表明，与最佳确定性策略相比，随机策略可以具有明显更好的样本内和样本外性能。我们还开发了一种列生成算法来确定凸松弛的最佳支持，然后在坐标下降算法中使用它来确定上限。在数值实验中建立了空间分支定界算法的效率和收敛性。此外，我们的数值实验表明，与最佳确定性策略相比，随机策略可以具有明显更好的样本内和样本外性能。

更新日期：2020-03-19

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>