A Discounted Approach in Communicating Average Markov Decision Chains Under Risk-Aversion,Journal of Optimization Theory and Applications

当前位置： X-MOL 学术 › J. Optim. Theory Appl. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

A Discounted Approach in Communicating Average Markov Decision Chains Under Risk-Aversion
Journal of Optimization Theory and Applications ( IF 1.6 ) Pub Date : 2020-10-07 , DOI: 10.1007/s10957-020-01758-y
Julio Saucedo-Zul , Rolando Cavazos-Cadena , Hugo Cruz-Suárez

This work concerns with discrete-time Markov decision processes on a denumerable state space. Assuming that the decision maker is risk-averse with constant risk-sensitivity coefficient, the performance of a control policy is measured by an average criterion associated with a non-negative and bounded cost function. Under conditions ensuring that the optimal average cost is constant, but not necessarily determined via the average cost optimality equation, it is shown that a discounted criterion can be used to approximate the optimal average index.

中文翻译：

风险规避下交流平均马尔可夫决策链的折扣方法

这项工作涉及可枚举状态空间上的离散时间马尔可夫决策过程。假设决策者是具有恒定风险敏感系数的风险规避者，则控制策略的性能由与非负和有界成本函数相关联的平均标准来衡量。在保证最优平均成本恒定但不一定通过平均成本最优方程确定的条件下，表明可以使用贴现标准来近似最优平均指数。

更新日期：2020-10-07

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11