当前位置: X-MOL 学术IEEE Trans. Parallel Distrib. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Multi-agent Imitation Learning for Pervasive Edge Computing: A Decentralized Computation Offloading Algorithm
IEEE Transactions on Parallel and Distributed Systems ( IF 5.3 ) Pub Date : 2021-02-01 , DOI: 10.1109/tpds.2020.3023936
Xiaojie Wang , Zhaolong Ning , Song Guo

Pervasive edge computing refers to one kind of edge computing that merely relies on edge devices with sensing, storage and communication abilities to realize peer-to-peer offloading without centralized management. Due to lack of unified coordination, users always pursue profits by maximizing their own utilities. However, on one hand, users may not make appropriate scheduling decisions based on their local observations. On the other hand, how to guarantee the fairness among different edge devices in the fully decentralized environment is rather challenging. To solve the above issues, we propose a decentrailized computation offloading algorithm with the purpose of minimizing average task completion time in the pervasive edge computing networks. We first derive a Nash equilibrium among devices by stochastic game theories based on the full observations of system states. After that, we design a traffic offloading algorithm based on partial observations by integrating general adversarial imitation learning. Multiple experts can provide demonstrations, so that devices can mimic the behaviors of corresponding experts by minimizing the gaps between the distributions of their observation-action pairs. At last, theoretical and performance results show that our solution has a significant advantage compared with other representative algorithms.

中文翻译:

普适边缘计算的多智能体模仿学习:一种分散计算卸载算法

普适边缘计算是指一种边缘计算,仅依靠具有感知、存储和通信能力的边缘设备实现点对点卸载,无需集中管理。由于缺乏统一协调,用户总是通过最大化自己的效用来追求利润。然而,一方面,用户可能无法根据他们的本地观察做出适当的调度决策。另一方面,在完全去中心化的环境中,如何保证不同边缘设备之间的公平性是相当具有挑战性的。为了解决上述问题,我们提出了一种去中心化的计算卸载算法,旨在最小化普适边缘计算网络中的平均任务完成时间。我们首先基于对系统状态的全面观察,通过随机博弈论推导出设备之间的纳什均衡。之后,我们通过集成通用对抗性模仿学习设计了一种基于部分观察的流量卸载算法。多个专家可以提供演示,以便设备可以通过最小化其观察-动作对分布之间的差距来模仿相应专家的行为。最后,理论和性能结果表明,与其他代表性算法相比,我们的解决方案具有显着优势。这样设备就可以通过最小化其观察-动作对分布之间的差距来模仿相应专家的行为。最后,理论和性能结果表明,与其他代表性算法相比,我们的解决方案具有显着优势。这样设备就可以通过最小化其观察-动作对分布之间的差距来模仿相应专家的行为。最后,理论和性能结果表明,与其他代表性算法相比,我们的解决方案具有显着优势。
更新日期:2021-02-01
down
wechat
bug