Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective,arXiv - CS - Artificial Intelligence

当前位置： X-MOL 学术 › arXiv.cs.AI › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective
arXiv - CS - Artificial Intelligence Pub Date : 2021-02-22 , DOI: arxiv-2102.11232
Mirza Ramicic, Andrea Bonarini

Faced with an ever-increasing complexity of their domains of application, artificial learning agents are now able to scale up in their ability to process an overwhelming amount of information coming from their interaction with an environment. However, this process of scaling does come with a cost of encoding and processing an increasing amount of redundant information that is not necessarily beneficial to the learning process itself. This work exploits the properties of the learning systems defined over partially observable domains by selectively focusing on the specific type of information that is more likely to express the causal interaction among the transitioning states of the environment. Adaptive masking of the observation space based on the \textit{temporal difference displacement} criterion enabled a significant improvement in convergence of temporal difference algorithms defined over a partially observable Markov process.

中文翻译：

部分可观察域中的不确定性最大化：认知角度

面对其应用领域的日益复杂性，人工学习代理现在能够扩展其处理与环境交互作用产生的大量信息的能力。然而，这种缩放过程确实伴随着编码和处理越来越多的冗余信息的成本，这不一定对学习过程本身有利。这项工作通过选择性地关注特定类型的信息，从而更容易地表达环境的过渡状态之间的因果关系，从而利用了在部分可观察的域中定义的学习系统的属性。

更新日期：2021-02-23

点击分享查看原文

点击收藏

阅读更多本刊最新论文