Optimal control of conditioned processes with feedback controls,Journal de Mathématiques Pures et Appliquées

当前位置： X-MOL 学术 › J. Math. Pures Appl. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Optimal control of conditioned processes with feedback controls
Journal de Mathématiques Pures et Appliquées ( IF 2.3 ) Pub Date : 2020-07-24 , DOI: 10.1016/j.matpur.2020.07.014
Yves Achdou , Mathieu Laurière , Pierre-Louis Lions

We consider a class of closed loop stochastic optimal control problems in finite time horizon, in which the cost is an expectation conditional on the event that the process has not exited a given bounded domain. An important difficulty is that the probability of the event that conditionates the strategy decays as time grows. The optimality conditions consist of a system of partial differential equations, including a Hamilton-Jacobi-Bellman equation (backward w.r.t. time) and a (forward w.r.t. time) Fokker-Planck equation for the law of the conditioned process. The two equations are supplemented with Dirichlet conditions. Next, we discuss the asymptotic behavior as the time horizon tends to +∞. This leads to a new kind of optimal control problem driven by an eigenvalue problem related to a continuity equation with Dirichlet conditions on the boundary. We prove existence for the latter. We also propose numerical methods and supplement the various theoretical aspects with simulations.

中文翻译：

带有反馈控制的条件过程的最佳控制

我们考虑了有限时间范围内的一类闭环随机最优控制问题，其中成本是对过程没有离开给定有界域的事件的期望。一个重要的困难是，随着时间的增长，决定策略的事件的概率会下降。最佳条件由偏微分方程组组成，其中包括Hamilton-Jacobi-Bellman方程（后向时间）和（前向时间）Fokker-Planck方程，用于条件过程定律。这两个方程用Dirichlet条件进行了补充。接下来，我们讨论随着时间范围趋于+∞的渐近行为。这导致了一种新的最优控制问题，该问题由与边界上具有Dirichlet条件的连续性方程有关的特征值问题驱动。我们证明后者的存在。我们还提出了数值方法，并通过仿真补充了各种理论方面。

更新日期：2020-07-24

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>