首页> 外文会议>International Conference on Intelligent Computing and Signal Processing >Deep Multi-User Reinforcement Learning for Centralized Dynamic Multichannel Access
【24h】

Deep Multi-User Reinforcement Learning for Centralized Dynamic Multichannel Access

机译:用于集中动态多通道访问的深度多用户加固学习

获取原文

摘要

We consider the problem of dynamic multichannel access for transmission maximization in multi-user wireless communication networks. At the beginning of each time slot, the centralized node evaluates all K channels and allocate a channel to transmit a packet for each user. After each time slot, the centralized node receives feedback signals for each user indicating whether the packet was successfully delivered or not. The objective is to find a multi-user strategy that maximizes global channel utilization with a low collision in a centralized manner without any prior knowledge. Obtaining an optimal solution for centralized dynamic multichannel access problem is a difficult problem due to the large-state and large-action space. To tackle this problem, we develop a centralized dynamic multichannel access framework based on double deep recurrent reinforcement learning. Specifically, at each time slot, the centralized node map current state to channel access actions and select multiple channels where the sum of corresponding value function obtains its maximum value based on neural network. The proposed method decreases collision through centralized allocation policy to achieve an optimal multi-user channel allocation.
机译:我们考虑了多用户无线通信网络中传输最大化的动态多声道访问的问题。在每个时隙的开始处,集中节点评估所有K信道并分配用于为每个用户发送分组的信道。在每个时隙之后,集中式节点接收指示分组是否成功传送的每个用户的反馈信号。目标是找到一种多用户策略,可以以集中式方式在没有任何先验知识的情况下以低碰撞最大化全局信道利用率。由于大状态和大动作空间,获得集中动态多声道接入问题的最佳解决方案是难题。为了解决这个问题,我们开发了一种基于双层经常性强化学习的集中动态多信道访问框架。具体地,在每个时隙,集中式节点映射当前状态到信道访问动作,并选择多个信道,其中相应的值函数的和基于神经网络获得其最大值。该方法通过集中分配策略降低碰撞,以实现最佳的多用户频道分配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号