当前位置: X-MOL 学术Stoch. Process. their Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Partial derivative with respect to the measure and its application to general controlled mean-field systems
Stochastic Processes and their Applications ( IF 1.4 ) Pub Date : 2021-01-22 , DOI: 10.1016/j.spa.2021.01.003
Rainer Buckdahn , Yajie Chen , Juan Li

Let (E,E) be an arbitrary measurable space. The paper first focuses on studying the partial derivative of a function f:P2,0(Rd×E)R defined on the space of probability measures μ over (Rd×E,B(Rd)E) whose first marginal μ1μ(×E) has a finite second order moment. This partial derivative is taken with respect to q(dx,z), where μ has the disintegration μ(dxdz)=q(dx,z)μ2(dz) with respect to its second marginal μ2()=μ(Rd×). Simplifying the language, we will speak of the derivative with respect to the law μ conditioned to its second marginal. Our results extend those of the derivative of a function g:P2(Rd)R over the space of probability measures with finite second order moment by P.L. Lions (see Lions (2013)) but cover also as a particular case recent approaches considering E=Rk and supposing the differentiability of f over P2(Rd×Rk), in order to use the derivative μf to define the partial derivative (μf)1. The second part of the paper focuses on investigating a stochastic maximum principle, where the controlled state process is driven by a general mean-field stochastic differential equation with partial information. The control set is just supposed to be a measurable space, and the coefficients of the controlled system, i.e., those of the dynamics as well as of the cost functional, depend on the controlled state process X, the control v, a partial information on X, as well as on the joint law of (X,v). Through considering a new second-order variational equation and the corresponding second-order adjoint equation, and a totally new method to prove the estimate for the solution of the first-order variational equation, the optimal principle is proved through spike variation of an optimal control and with the help of the tailor-made form of second-order expansion. We emphasize that in our assumptions we do not need any regularity of the coefficients neither in the control variable nor with respect to the law of the control process.



中文翻译:

关于度量的偏导数及其在一般受控平均场系统中的应用

ËË是一个任意可测量的空间。本文首先着重研究函数的偏导数FP20[Rd×Ë[R 在概率测度空间上定义 μ 过度 [Rd×Ë[RdË 他的第一边际 μ1个μ×Ë具有有限的二阶矩 该偏导数是相对于qdXž,在哪里 μ 有瓦解 μdXdž=qdXžμ2dž 关于其第二边缘 μ2=μ[Rd×。简化语言,我们将谈谈关于法律的派生词μ限制在第二边缘。我们的结果扩展了函数导数的结果GP2[Rd[R PL Lions(参见Lions(2013))在具有有限二阶矩的概率测度空间上进行了研究,但作为一个特殊案例也涵盖了最近的方法 Ë=[Rķ 并假设 F 过度 P2[Rd×[Rķ,以便使用导数 μF 定义偏导数 μF1个。本文的第二部分着重研究随机最大原理,其中控制状态过程由具有部分信息的一般平均场随机微分方程驱动。控制集只是一个可测量的空间,受控系统的系数(即动力学系数和成本函数的系数)取决于受控状态过程X,控制 v,有关的部分信息 X以及关于 Xv。通过考虑一个新的二阶变分方程和相应的二阶伴随方程,以及一种全新的方法来证明一阶变分方程解的估计,通过最优控制的尖峰变化来证明最优原理。并借助量身定制的二阶扩展形式。我们强调,在我们的假设中,我们既不需要控制变量中的系数的任何规律性,也不需要控制过程的规律方面的系数。

更新日期:2021-02-03
down
wechat
bug