Online state space generation by a growing self-organizing map and differential learning for reinforcement learning,Applied Soft Computing

当前位置： X-MOL 学术 › Appl. Soft Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Online state space generation by a growing self-organizing map and differential learning for reinforcement learning
Applied Soft Computing ( IF 7.2 ) Pub Date : 2020-09-16 , DOI: 10.1016/j.asoc.2020.106723
Akira Notsu , Koji Yasuda , Seiki Ubukata , Katsuhiro Honda

In this research, we develop a method integrating a growing self-organizing map and differential learning system for online reinforcement learning which adaptively builds the state structure. In the conventional method, models and information on the environment are required beforehand, whereas the proposed method automatically estimates the state transitions from differentials of input signals and from these builds the state space without reference to prior information on the environment. Also, since it is an online learning method, the proposed method requires less computation and no batch memory. Through numerical experiments, we show that the proposed method has the same performance as the conventional method with the information given and that the learning time is shortened by abstraction of the state space.

中文翻译：

通过不断增长的自组织图和差分学习进行在线状态空间生成，以进行强化学习

在这项研究中，我们开发了一种集成增长的自组织图和差分学习系统的方法，用于在线强化学习，自适应地建立状态结构。在传统方法中，事先需要有关环境的模型和信息，而所提出的方法会自动根据输入信号的差异估计状态转换，并根据这些信号构建状态空间，而无需参考有关环境的现有信息。而且，由于它是一种在线学习方法，因此所提出的方法需要较少的计算并且不需要批处理内存。通过数值实验，我们证明了该方法在给定信息的情况下具有与传统方法相同的性能，并且通过抽象状态空间缩短了学习时间。

更新日期：2020-09-16

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11