Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents,arXiv - CS - Artificial Intelligence

当前位置： X-MOL 学术 › arXiv.cs.AI › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents
arXiv - CS - Artificial Intelligence Pub Date : 2020-09-15 , DOI: arxiv-2009.07132
Nicola Milano, Stefano Nolfi

As discussed in previous studies, the efficacy of evolutionary or reinforcement learning algorithms for continuous control optimization can be enhanced by including a neural module dedicated to feature extraction trained through self-supervised methods. In this paper we report additional experiments supporting this hypothesis and we demonstrate how the advantage provided by feature extraction is not limited to problems that benefit from dimensionality reduction or that involve agents operating on the basis of allocentric perception. We introduce a method that permits to continue the training of the feature-extraction module during the training of the policy network and that increases the efficacy of feature extraction. Finally, we compare alternative feature-extracting methods and we show that sequence-to-sequence learning yields better results than the methods considered in previous studies.

中文翻译：

控制特征的自主学习：具体化和定位代理的实验

正如之前的研究中所讨论的那样，通过包含一个专用于通过自监督方法训练的特征提取的神经模块，可以增强进化或强化学习算法对连续控制优化的功效。在本文中，我们报告了支持这一假设的其他实验，并展示了特征提取提供的优势如何不仅限于受益于降维或涉及基于异中心感知操作的代理的问题。我们引入了一种方法，允许在策略网络训练期间继续训练特征提取模块，并提高特征提取的效率。最后，

更新日期：2020-09-16

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>