Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning

Jiang Zhu; Jun Wang; Tao Luo; Shaoqian Li

首页> 外文期刊>Telecommunication Systems >Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning

【24h】

Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning

机译：通过强化学习的节能认知无线电网络在衰落信道上的自适应传输调度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address a cross-layer issue of long-term average utility maximization in energy-efficient cognitive radio networks supporting packetized data traffic under the constraint of collision rate with licensed users. Utility is determined by the number of packets transmitted successfully per consumed power and buffer occupancy. We formulate the problem by dynamic programming method namely constrained Markov decision process (CMDP). Reinforcement learning (RL) approach is employed to finding a near-optimal policy under undiscovered environment. The policy learned by RL can guide transmitter to access available channels and select proper transmission rate at the beginning of each frame for its long-term optimal goals. Some implement problems of the RL approach are discussed. Firstly, state space compaction is utilized to cope with so-called curse of dimensionality due to large state space of formulated CMDP. Secondly, action set reduction is presented to reduce the number of actions for some system states. Finally, the CMDP is converted to a corresponding unconstrained Markov decision process (UMDP) by Lagrangian multiplier approach and a golden section search method is proposed to find the proper multiplier. In order to evaluate the performance of the policy learned by RL, we present two naive policies and compare them by simulations.

机译：在本文中，我们解决了在与许可用户发生冲突的情况下，支持分组数据流量的高能效认知无线电网络中长期平均效用最大化的跨层问题。效用取决于每消耗的功率和缓冲区占用情况下成功传输的数据包数量。我们通过动态规划方法即约束马尔可夫决策过程（CMDP）来表达问题。强化学习（RL）方法用于在未发现的环境下找到接近最佳的策略。 RL学习到的策略可以指导发射机访问可用信道，并在每个帧的开始为其长期最佳目标选择合适的传输速率。讨论了RL方法的一些实现问题。首先，由于配制的CMDP的大状态空间，状态空间压缩被用于应对所谓的尺寸诅咒。其次，提出了减少动作集的方法，以减少某些系统状态的动作数量。最后，通过拉格朗日乘数法将CMDP转换为相应的无约束马尔可夫决策过程（UMDP），并提出了黄金分割搜索方法以找到合适的乘数。为了评估RL学习到的策略的性能，我们提出两种朴素的策略，并通过仿真进行比较。

著录项

来源
《Telecommunication Systems》 |2009年第2期|123-138|共16页
作者
Jiang Zhu; Jun Wang; Tao Luo; Shaoqian Li;
展开▼
作者单位

No.4 Section 2 North Jianshe Road Chengdu P.R. China;

No.4 Section 2 North Jianshe Road Chengdu P.R. China;

No.4 Section 2 North Jianshe Road Chengdu P.R. China;

No.4 Section 2 North Jianshe Road Chengdu P.R. China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; Energy-efficient networks; Markov decision process; Cognitive radio; Cross-layer design;

机译：强化学习;节能网络;马尔可夫决策过程;认知无线电;跨层设计;

相似文献

外文文献
中文文献
专利

1. Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning [J] . Jiang Zhu, Jun Wang, Tao Luo, Telecommunication systems: Modeling, Analysis, Design and Management . 2009,第1a2期

机译：强化学习的节能认知无线电网络在衰落信道上的自适应传输调度
2. Energy-Efficient Multichannel Cooperative Sensing Scheduling With Heterogeneous Channel Conditions for Cognitive Radio Networks [J] . Eryigit S., Bayhan S., Tugcu T. IEEE Transactions on Vehicular Technology . 2013,第6期

机译：认知无线电网络中具有异构信道条件的高能效多信道协作感知调度
3. Energy-Efficient Resource Allocation in Cognitive Radio Networks Under Cooperative Multi-Agent Model-Free Reinforcement Learning Schemes [J] . Kaur Amandeep, Kumar Krishan IEEE transactions on network and service management . 2020,第3期

机译：在合作多剂型无代理模型加强学习计划下认知无线电网络中的节能资源分配
4. Joint Transaction Transmission and Channel Selection in Cognitive Radio Based Blockchain Networks: A Deep Reinforcement Learning Approach [C] . Nguyen Cong Luong, Tran The Anh, Huynh Thi Thanh Binh, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：基于认知无线电的区块链网络中的联合事务传输和信道选择：深度强化学习方法
5. A Channel-access Framework for Scheduling Transmission Assignments in Ad Hoc Networks with Rate Adaptive Radios [D] . Bollapragada Subrahmanya, Vikas. 2020

机译：具有速率自适应收音机的临时网络中的传输分配的通道访问框架
6. An Energy-Efficient Spectrum-Aware Reinforcement Learning-Based Clustering Algorithm for Cognitive Radio Sensor Networks [O] . Ibrahim Mustapha, Borhanuddin Mohd Ali, Mohd Fadlee A. Rasid, 2015

机译：认知无线电传感器网络的一种基于能效频谱感知增强学习的聚类算法
7. Joint Transaction Transmission and Channel Selection in Cognitive Radio Based Blockchain Networks: A Deep Reinforcement Learning Approach [O] . Nguyen Cong Luong, Tran The Anh, Huynh Thi Thanh Binh, 2019

机译：认知无线电区块链网络中的联合交易传输和频道选择：深度加强学习方法
8. Optimal and Low-complexity Algorithms for Dynamic Spectrum Access in Centralized Cognitive Radio Networks with Fading Channels. [R] . Bkassiny, M., Jayaweera, S. K., Li, Y., 2011

机译：具有衰落信道的集中认知无线电网络中动态频谱接入的最优和低复杂度算法。

Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅