首页> 外文会议>IEEE International Conference on Communications Workshops >Packet Drop Probability-Optimal Cross-layer Scheduling: Dealing with Curse of Sparsity using Prioritized Experience Replay
【24h】

Packet Drop Probability-Optimal Cross-layer Scheduling: Dealing with Curse of Sparsity using Prioritized Experience Replay

机译:数据包丢弃概率 - 最佳跨层调度:处理稀疏性的诅咒,使用优先级经验重放

获取原文

摘要

In this work, we develop a reinforcement learning (RL) based model-free approach to obtain a policy for joint packet scheduling and rate adaptation, such that the packet drop probability (PDP) is minimized. The developed learning scheme yields an online cross-layer scheduling policy which takes into account the randomness in packet arrivals and wireless channels, as well as the state of packet buffers. Inherent difference in the time-scales of packet arrival process and the wireless channel variations leads to sparsity in the observed reward signal. Since an RL agent learns by using the feedback obtained in terms of rewards for its actions, the sample complexity of RL approach increases exponentially due to resulting sparsity. Therefore, a basic RL based approach, e.g., double deep Q-network (DDQN) based RL, results in a policy with negligible performance gain over the state-of-the-art schemes, such as shortest processing time (SPT) based scheduling. In order to alleviate the sparse reward problem, we leverage prioritized experience replay (PER) and develop a DDQN-based learning scheme with PER. We observe through simulations that the policy learned using DDQN-PER approach results in a 3-5% lower PDP, compared to both the basic DDQN based RL and SPT scheme.
机译:在这项工作中,我们开发了一种基于的增强学习(RL)的无模型方法,以获得联合分组调度和速率自适应的策略,使得分组丢弃概率(PDP)最小化。开发的学习方案产生了在线跨层调度策略,该策略考虑了数据包到达和无线信道中的随机性,以及分组缓冲区的状态。数据包到达过程的时间尺度的固有差异,无线信道变化导致观察到的奖励信号中的稀疏性。由于RL代理通过使用在其作用的奖励方面获得的反馈来学习,因此由于产生的稀疏性,RL方法的样本复杂性增加。因此,基于基于RL的基本方法,例如,基于双深Q网络(DDQN)的RL,导致策略在最先进的方案上具有可忽略的性能增益,例如基于最短的处理时间(SPT)的调度。为了减轻稀疏奖励问题,我们利用优先考虑的经验重播(每个),并使用每次开发基于DDQN的学习计划。我们通过模拟观察,与基于基于DDQN的RL和SPT方案相比,使用DDQN-PER-PERIC的策略获得了3-5%的PDP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号