首页> 外文会议>IEEE International Symposium on Dynamic Spectrum Access Networks >Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access
【24h】

Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access

机译:具有多频段感应功能的深度Q学习,可实现动态频谱访问

获取原文

摘要

We study a dynamic spectrum access situation where, in each time slot, a single cognitive agent decides to either stay idle or access one of the N frequency channels based on its sensing of the whole spectrum. The channels are occupied or vacant according to N independent nonidentical 2-state Markov chains. We prove that the optimal access policy can easily be found if the state transition probabilities of all channels are known. When the agent has no knowledge about the channel model, we propose to use the deep Q-learning method to learn a state-action value function that determines an access policy from the observed states of all channels. In this method, the optimal Q-function is approximated with a neural network of all dense layers that is trained via experience replay. We demonstrate through experiments that the learning-based policies consistently achieve performances that are close to the optimal ones.
机译:我们研究了一种动态频谱访问情况,其中在每个时隙中,单个认知代理基于对整个频谱的感知而决定保持空闲状态或访问N个频道之一。根据N个独立的不相同的2状态Markov链,信道被占用或空闲。我们证明,如果知道所有通道的状态转移概率,就可以轻松找到最佳访问策略。当代理不了解通道模型时,我们建议使用深度Q学习方法来学习状态作用值函数,该函数根据观察到的所有通道的状态来确定访问策略。在这种方法中,最佳Q函数通过所有密集层的神经网络进行近似,该神经网络通过经验重播进行训练。我们通过实验证明,基于学习的策略始终可以实现接近最佳策略的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号