The Application of Deep Reinforcement Learning to Distributed Spectrum Access in Dynamic Heterogeneous Environments With Partial Observations

Xu Yue; Yu Jianyuan; Buehrer R. Michael

首页> 外文期刊>IEEE transactions on wireless communications >The Application of Deep Reinforcement Learning to Distributed Spectrum Access in Dynamic Heterogeneous Environments With Partial Observations

【24h】

The Application of Deep Reinforcement Learning to Distributed Spectrum Access in Dynamic Heterogeneous Environments With Partial Observations

机译：深度加强学习在局部观测中的动态异构环境中分布式频谱接入的应用

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This papera (1) investigates deep reinforcement learning (DRL) based on a Recurrent Neural Network (RNN) for Dynamic Spectrum Access (DSA) under partial observations, referred to as a Deep Recurrent Q-Network (DRQN). Specifically, we consider a scenario with multiple independent channels and multiple heterogeneous Primary Users (PUs). Two key challenges in our problem formulation are that we assume our DRQN node does not have any prior knowledge of the other nodes' behavior patterns and attempts to predict the future channel state based on previous observations. The goal of the DRQN is to learn a channel access strategy with a low collision rate but a high channel utilization rate. With proper definitions of the state, action and rewards, our extensive simulation results show that a DRQN-based approach can handle a variety of communication environments including dynamic environments. Further, our results show that the DRQN node is also able to cope with multi-rate and multi-agent scenarios. Importantly, we show the following benefits of using recurrent neural networks in DSA: (i) the ability to learn the optimal strategy in different environments under partial observations; (ii) robustness to imperfect observations and (iii) the ability to utilize multiple channels, and (iv) robustness in the presence of multiple agents. (1) A parton of this work was presented at MILCOM 2018 in [1].

机译：本文（1）根据局部观察下的动态频谱接入（DSA）的经常性神经网络（RNN）来研究深度加强学习（DRL），称为深复发性Q网络（DRQN）。具体来说，我们考虑一个具有多个独立通道和多个异构主用户（PU）的场景。我们的问题制定中的两个关键挑战是我们假设我们的DRQN节点不具有其他节点的行为模式的任何先前知识，并且尝试基于先前观察预测未来信道状态。 DRQN的目标是学习具有低碰撞速率但高信道利用率的频道访问策略。具有正确定义状态，行动和奖励，我们的广泛仿真结果表明，基于DRQN的方法可以处理包括动态环境的各种通信环境。此外，我们的结果表明，DRQN节点还能够应对多速率和多代理方案。重要的是，我们展示了在DSA中使用经常性神经网络的以下好处：（i）在部分观察下学习不同环境中的最佳策略的能力; （ii）对缺乏观察的鲁棒性和（iii）在多种药剂存在下利用多个通道的能力，（iv）鲁棒性。（1）本工作的Parton在[1]中介绍了Milcom 2018。

著录项

来源
《IEEE transactions on wireless communications》 |2020年第7期|4494-4506|共13页
作者
Xu Yue; Yu Jianyuan; Buehrer R. Michael;
展开▼
作者单位

Virginia Tech Bradley Dept Elect & Comp Engn Blacksburg VA 24061 USA;

Virginia Tech Elect & Comp Engn Blacksburg VA 24061 USA;

Wireless Virginia Tech Blacksburg VA 24060 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Wireless communication; Recurrent neural networks; Machine learning; Dynamic spectrum access; Indexes; Markov processes; Heuristic algorithms; Dynamic spectrum access; partial observation; deep reinforcement learning; recurrent neural network; prediction; imperfect system feedback; multi-rate and multi-agent;

机译：无线通信;经常性神经网络;机器学习;动态频谱访问;索引;马尔可夫进程;启发式算法;动态频谱接入;部分观察;局部观察;复发性神经网络;预测;不完美的系统反馈;多速率和多代理;多速率和多功能;

相似文献

外文文献
中文文献
专利

1. Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access [J] . Naparstek Oshri, Cohen Kobi IEEE transactions on wireless communications . 2019,第1期

机译：用于分布式动态频谱访问的深度多用户强化学习
2. Dynamic Spectrum Access via Channel-Aware Heterogeneous Multi-Channel Auction With Distributed Learning [J] . Zandi Marjan, Dong Min, Grami Ali Wireless Communications, IEEE Transactions on . 2015,第11期

机译：通过具有分布式学习功能的通道感知异构多通道拍卖进行动态频谱访问
3. Distributed Opportunistic Spectrum Access in an Unknown and Dynamic Environment: A Stochastic Learning Approach [J] . Cao Huijin, Cai Jun Fortschritte der Physik . 2018,第5期

机译：在未知和动态环境中分布式机会主义频谱访问：随机学习方法
4. Deep-Reinforcement Learning for Fair Distributed Dynamic Spectrum Access in Wireless Networks [C] . Siavash Barqi Janiar, Vahid Pourahmadi IEEE Annual Consumer Communications and Networking Conference . 2021

机译：无线网络中公平分布式动态频谱访问的深加固学习
5. A study of interconnected dynamical systems and reinforcement learning in a multi-agent and distributed environment. [D] . Madera, Manuel. 2012

机译：在多主体和分布式环境中研究相互联系的动力系统和强化学习。
6. Is Deep Reinforcement Learning Ready for Practical Applications in Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in Sepsis Patients [O] . MingYu Lu, Zachary Shahn, Daby Sow, 2020

机译：深增强学习是否准备用于医疗保健的实际应用？脓毒症患者血流动力学管理的DUEL-DDQN敏感性分析
7. Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access [O] . Oshri Naparstek, Kobi Cohen 2019

机译：用于分布式动态频谱接入的深度多用户增强学习

The Application of Deep Reinforcement Learning to Distributed Spectrum Access in Dynamic Heterogeneous Environments With Partial Observations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅