首页> 外文会议>IEEE Global Communications Conference >Joint Offloading and Resource Allocation in Mobile Edge Computing Systems: An Actor-Critic Approach
【24h】

Joint Offloading and Resource Allocation in Mobile Edge Computing Systems: An Actor-Critic Approach

机译:移动边缘计算系统中的联合卸载和资源分配:一种Actor-Critic方法

获取原文
获取外文期刊封面目录资料

摘要

Offloading computationally intensive tasks from user equipments (UEs) to mobile edge computing (MEC) servers is a promising technique to boost up the computational capacity of UEs. However, MEC will incur extra energy consumption and time delays, which motivates the deployment of energy harvesting (EH) small cell networks with MEC in mobile networks. Due to the complexity of such networks, it is challenging to effectively allocate resources for UEs. In this paper, we investigate the offloading decision, wireless and computational resources allocation problem in energy harvesting (EH) small cell networks with MEC. Different from existing literatures, our research focuses on improving mobile operators' revenue by maximizing the amount of the offloaded tasks while decreasing the energy expenditure and time-delays. Besides, queues are created at the MEC server side to store the un-executed tasks in a time slot, which is used as a punishment in our utility function to avoid serious delay. Considering the varying lengths of queues, the states of EH-batteries of small base stations (SBSs) and down-link channels, the above problem is modeled as a Markov decision process (MDP). Since the states and actions in the MDP are infinite, an online and on-policy actor-critic with eligibility traces algorithm is proposed to resolve the problem. Simulation results show the proposed algorithm has superior performances compared with the policy-gradient algorithm and Q-learning.
机译:将计算密集型任务从用户设备(UE)卸载到移动边缘计算(MEC)服务器是一种有前途的技术,可以提高UE的计算能力。但是,MEC将招致额外的能耗和时间延迟,这会促使在移动网络中使用MEC部署能量收集(EH)小小区网络。由于这种网络的复杂性,有效地为UE分配资源是挑战。在本文中,我们研究了使用MEC的能量收集(EH)小型蜂窝网络中的卸载决策,无线和计算资源分配问题。与现有文献不同,我们的研究重点是通过最大程度地减少卸载任务,同时减少能源消耗和时间延迟来提高移动运营商的收入。此外,在MEC服务器端创建队列以将未执行的任务存储在一个时隙中,这是对我们的实用程序功能的一种惩罚,以避免严重的延迟。考虑到队列长度的变化,小型基站(SBS)的EH电池状态和下行链路信道,上述问题被建模为马尔可夫决策过程(MDP)。由于MDP中的状态和动作是无限的,因此提出了一种具有资格跟踪的在线和基于策略的行为者评论家算法来解决该问题。仿真结果表明,与策略梯度算法和Q学习相比,该算法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号