首页> 外文会议>IEEE International Conference on Communications Workshops >Packet Drop Probability-Optimal Cross-layer Scheduling: Dealing with Curse of Sparsity using Prioritized Experience Replay

【24h】

Packet Drop Probability-Optimal Cross-layer Scheduling: Dealing with Curse of Sparsity using Prioritized Experience Replay

机译：数据包丢弃概率 - 最佳跨层调度：处理稀疏性的诅咒，使用优先级经验重放

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we develop a reinforcement learning (RL) based model-free approach to obtain a policy for joint packet scheduling and rate adaptation, such that the packet drop probability (PDP) is minimized. The developed learning scheme yields an online cross-layer scheduling policy which takes into account the randomness in packet arrivals and wireless channels, as well as the state of packet buffers. Inherent difference in the time-scales of packet arrival process and the wireless channel variations leads to sparsity in the observed reward signal. Since an RL agent learns by using the feedback obtained in terms of rewards for its actions, the sample complexity of RL approach increases exponentially due to resulting sparsity. Therefore, a basic RL based approach, e.g., double deep Q-network (DDQN) based RL, results in a policy with negligible performance gain over the state-of-the-art schemes, such as shortest processing time (SPT) based scheduling. In order to alleviate the sparse reward problem, we leverage prioritized experience replay (PER) and develop a DDQN-based learning scheme with PER. We observe through simulations that the policy learned using DDQN-PER approach results in a 3-5% lower PDP, compared to both the basic DDQN based RL and SPT scheme.

机译：在这项工作中，我们开发了一种基于的增强学习（RL）的无模型方法，以获得联合分组调度和速率自适应的策略，使得分组丢弃概率（PDP）最小化。开发的学习方案产生了在线跨层调度策略，该策略考虑了数据包到达和无线信道中的随机性，以及分组缓冲区的状态。数据包到达过程的时间尺度的固有差异，无线信道变化导致观察到的奖励信号中的稀疏性。由于RL代理通过使用在其作用的奖励方面获得的反馈来学习，因此由于产生的稀疏性，RL方法的样本复杂性增加。因此，基于基于RL的基本方法，例如，基于双深Q网络（DDQN）的RL，导致策略在最先进的方案上具有可忽略的性能增益，例如基于最短的处理时间（SPT）的调度。为了减轻稀疏奖励问题，我们利用优先考虑的经验重播（每个），并使用每次开发基于DDQN的学习计划。我们通过模拟观察，与基于基于DDQN的RL和SPT方案相比，使用DDQN-PER-PERIC的策略获得了3-5％的PDP。

著录项

来源
《IEEE International Conference on Communications Workshops 》|2021年|1-6|共6页
会议地点
作者
Mohit K. Sharma; Tan Peng Hui; Ernest Kurniawan; Sun Sumei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Wireless communication; Adaptation models; Scheduling algorithms; Conferences; Reinforcement learning; Performance gain; Scheduling;

机译：无线通信;适配模型;调度算法;会议;加强学习;性能收益;调度;

相似文献

外文文献
中文文献
专利

1. Cross-Layer prioritized H.264 video packetization and error protection over noisy channels [J] . Kambhatla Kashyap K. R., Paluri Seethal, Matyjas John D., Multimedia Tools and Applications . 2016 ,第6期

机译：跨通道优先处理H.264视频分包和带噪通道的错误保护
2. A divided and prioritized experience replay approach for streaming regression [J] . Mikkel Leite Arn?, John-Morten Godhavn, Ole Morten Aamo MethodsX . 2021 ,第a期

机译：用于流媒体回归的分割和优先考虑体验重播方法
3. Bias-reduced hindsight experience replay with virtual goal prioritization [J] . Manela B., Biess A. Neurocomputing . 2021 ,第Sepa3期

机译：偏见减少的后敏感体验重放虚拟目标优先级
4. Ddper: Decentralized Distributed Prioritized Experience Replay [C] . Sidun Liu, Peng Qiao, Yong Dou, IEEE International Conference on Multimedia and Expo . 2021

机译：DDPER：分散分布式优先考虑体验重放
5. Cross-Layer Prioritized Video Transmission: Adaptive Packetization, FEC Protection and Scheduling Methods. [D] . Kambhatla, Kashyap Kodanda Ram. 2014

机译：跨层优先视频传输：自适应分组，FEC保护和调度方法。
6. An Efficient Cross-Layer Approach for Malicious Packet Dropping Detection in MANETs [O] . Leovigildo Sánchez-casado, Gabriel Maciá-fernández, Pedro García-teodoro 2014

机译：一种用于maNET中恶意丢包检测的高效跨层方法

Packet Drop Probability-Optimal Cross-layer Scheduling: Dealing with Curse of Sparsity using Prioritized Experience Replay

摘要

著录项

相似文献

相关主题

期刊订阅