当前位置: X-MOL 学术arXiv.cs.IT › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Dynamic Resource Configuration for Low-Power IoT Networks: A Multi-Objective Reinforcement Learning Method
arXiv - CS - Information Theory Pub Date : 2021-06-05 , DOI: arxiv-2106.02826
Yang Huang, Caiyong Hao, Yijie Mao, Fuhui Zhou

Considering grant-free transmissions in low-power IoT networks with unknown time-frequency distribution of interference, we address the problem of Dynamic Resource Configuration (DRC), which amounts to a Markov decision process. Unfortunately, off-the-shelf methods based on single-objective reinforcement learning cannot guarantee energy-efficient transmission, especially when all frequency-domain channels in a time interval are interfered. Therefore, we propose a novel DRC scheme where configuration policies are optimized with a Multi-Objective Reinforcement Learning (MORL) framework. Numerical results show that the average decision error rate achieved by the MORL-based DRC can be even less than 12% of that yielded by the conventional R-learning-based approach.

中文翻译:

低功耗物联网网络的动态资源配置:一种多目标强化学习方法

考虑到干扰时频分布未知的低功率物联网网络中的无授权传输,我们解决了动态资源配置 (DRC) 的问题,这相当于马尔可夫决策过程。不幸的是,基于单目标强化学习的现成方法无法保证高能效传输,尤其是当一个时间间隔内的所有频域信道都受到干扰时。因此,我们提出了一种新颖的 DRC 方案,其中使用多目标强化学习(MORL)框架优化配置策略。数值结果表明,基于 MORL 的 DRC 实现的平均决策错误率甚至可以低于传统的基于 R 学习的方法产生的决策错误率的 12%。
更新日期:2021-06-08
down
wechat
bug