当前位置: X-MOL 学术arXiv.cs.SY › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Maximum Likelihood Constraint Inference from Stochastic Demonstrations
arXiv - CS - Systems and Control Pub Date : 2021-02-24 , DOI: arxiv-2102.12554
David L. McPherson, Kaylene C. Stocking, S. Shankar Sastry

When an expert operates a perilous dynamic system, ideal constraint information is tacitly contained in their demonstrated trajectories and controls. The likelihood of these demonstrations can be computed, given the system dynamics and task objective, and the maximum likelihood constraints can be identified. Prior constraint inference work has focused mainly on deterministic models. Stochastic models, however, can capture the uncertainty and risk tolerance that are often present in real systems of interest. This paper extends maximum likelihood constraint inference to stochastic applications by using maximum causal entropy likelihoods. Furthermore, we propose an efficient algorithm that computes constraint likelihood and risk tolerance in a unified Bellman backup, allowing us to generalize to stochastic systems without increasing computational complexity.

中文翻译:

随机论证的最大似然约束推论

当专家操作危险的动态系统时,理想的约束信息将默认包含在他们演示的轨迹和控制中。给定系统动力学和任务目标,可以计算出这些演示的可能性,并且可以确定最大可能性约束。先验约束推理工作主要集中在确定性模型上。但是,随机模型可以捕获实际感兴趣的系统中经常存在的不确定性和风险承受能力。本文通过使用最大因果熵似然将最大似然约束推论扩展到随机应用中。此外,我们提出了一种有效的算法,可以在统一的Bellman备份中计算约束可能性和风险承受能力,
更新日期:2021-02-26
down
wechat
bug