Explore the Context: Optimal Data Collection for Context-Conditional Dynamics Models,arXiv - CS - Robotics

当前位置： X-MOL 学术 › arXiv.cs.RO › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Explore the Context: Optimal Data Collection for Context-Conditional Dynamics Models
arXiv - CS - Robotics Pub Date : 2021-02-22 , DOI: arxiv-2102.11394
Jan Achterhold, Joerg Stueckler

In this paper, we learn dynamics models for parametrized families of dynamical systems with varying properties. The dynamics models are formulated as stochastic processes conditioned on a latent context variable which is inferred from observed transitions of the respective system. The probabilistic formulation allows us to compute an action sequence which, for a limited number of environment interactions, optimally explores the given system within the parametrized family. This is achieved by steering the system through transitions being most informative for the context variable. We demonstrate the effectiveness of our method for exploration on a non-linear toy-problem and two well-known reinforcement learning environments.

中文翻译：

探索上下文：上下文条件动力学模型的最佳数据收集

在本文中，我们学习具有变化特性的动力学系统的参数化族的动力学模型。动力学模型被公式化为以潜在上下文变量为条件的随机过程，该变量是从观察到的各个系统的转换推断出来的。概率公式使我们能够计算一个动作序列，该动作序列在有限的环境交互作用下，可以最佳地探索参数化族内的给定系统。这可以通过引导系统通过对上下文变量最有帮助的转换来实现。我们证明了我们的方法对于探索非线性玩具问题和两个众所周知的强化学习环境的有效性。

更新日期：2021-02-24

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>