当前位置:
X-MOL 学术
›
arXiv.cs.SY
›
论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Maximum Likelihood Constraint Inference from Stochastic Demonstrations
arXiv - CS - Systems and Control Pub Date : 2021-02-24 , DOI: arxiv-2102.12554 David L. McPherson, Kaylene C. Stocking, S. Shankar Sastry
arXiv - CS - Systems and Control Pub Date : 2021-02-24 , DOI: arxiv-2102.12554 David L. McPherson, Kaylene C. Stocking, S. Shankar Sastry
When an expert operates a perilous dynamic system, ideal constraint
information is tacitly contained in their demonstrated trajectories and
controls. The likelihood of these demonstrations can be computed, given the
system dynamics and task objective, and the maximum likelihood constraints can
be identified. Prior constraint inference work has focused mainly on
deterministic models. Stochastic models, however, can capture the uncertainty
and risk tolerance that are often present in real systems of interest. This paper extends maximum likelihood constraint inference to stochastic
applications by using maximum causal entropy likelihoods. Furthermore, we
propose an efficient algorithm that computes constraint likelihood and risk
tolerance in a unified Bellman backup, allowing us to generalize to stochastic
systems without increasing computational complexity.
中文翻译:
随机论证的最大似然约束推论
当专家操作危险的动态系统时,理想的约束信息将默认包含在他们演示的轨迹和控制中。给定系统动力学和任务目标,可以计算出这些演示的可能性,并且可以确定最大可能性约束。先验约束推理工作主要集中在确定性模型上。但是,随机模型可以捕获实际感兴趣的系统中经常存在的不确定性和风险承受能力。本文通过使用最大因果熵似然将最大似然约束推论扩展到随机应用中。此外,我们提出了一种有效的算法,可以在统一的Bellman备份中计算约束可能性和风险承受能力,
更新日期:2021-02-26
中文翻译:
随机论证的最大似然约束推论
当专家操作危险的动态系统时,理想的约束信息将默认包含在他们演示的轨迹和控制中。给定系统动力学和任务目标,可以计算出这些演示的可能性,并且可以确定最大可能性约束。先验约束推理工作主要集中在确定性模型上。但是,随机模型可以捕获实际感兴趣的系统中经常存在的不确定性和风险承受能力。本文通过使用最大因果熵似然将最大似然约束推论扩展到随机应用中。此外,我们提出了一种有效的算法,可以在统一的Bellman备份中计算约束可能性和风险承受能力,