Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning

Wu Keyu; Jiang Hai; Tellambura Chintha

首页> 外文期刊>IEEE Transactions on Vehicular Technology >Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning

【24h】

Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning

机译：具有两阶段后态强化学习的能量收集认知无线电的传感，探测和传输策略

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper considers joint optimization of spectrum sensing, channel probing, and transmission power control for a single-channel secondary transmitter that operates with harvested energy from ambient sources. At each time slot, to maximize the expected secondary throughput, the transmitter needs to decide whether or not to perform the operations of spectrum sensing, channel probing, and transmission, according to energy status and channel fading status. First, we model this stochastic optimization problem as a two-stage continuous-state Markov decision process, with a sensing-and-probing stage and a transmit-power-control stage. We simplify this problem by a more useful after-state value function formulation. We then propose a reinforcement learning algorithm to learn the after-state value function from data samples when the statistical distributions of harvested energy and channel fading are unknown. Numerical results demonstrate learning characteristics and performance of the proposed algorithm.

机译：本文考虑了单通道辅助发射机的频谱感测，信道探测和发射功率控制的联合优化，该发射机工作于从周围环境获取的能量。在每个时隙，为了使预期的第二吞吐量最大化，发射机需要根据能量状态和信道衰落状态来决定是否执行频谱感测，信道探测和传输操作。首先，我们将该随机优化问题建模为一个两阶段的连续状态马尔可夫决策过程，包括一个传感和探测阶段以及一个发射功率控制阶段。我们通过更有用的事后价值函数公式简化了这个问题。然后，我们提出了一种增强学习算法，以在收集的能量和信道衰落的统计分布未知时从数据样本中学习事后状态函数。数值结果证明了该算法的学习特性和性能。

著录项

来源
《IEEE Transactions on Vehicular Technology》 |2019年第2期|1616-1630|共15页
作者
Wu Keyu; Jiang Hai; Tellambura Chintha;
展开▼
作者单位

Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 1H9, Canada;

Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 1H9, Canada;

Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 1H9, Canada;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Cognitive radio; energy harvesting; power control; reinforcement learning; spectrum sensing;

机译：认知无线电;能量收集;功率控制;强化学习;频谱感测;

相似文献

外文文献
中文文献
专利

1. Reinforcement learning based sensing policy optimization for energy efficient cognitive radio networks [J] . Jan Oksanen, Jarmo Lunden, Visa Koivunen Neurocomputing . 2012,第期

机译：基于增强学习的节能认知无线电网络感知策略优化
2. Multi-Slot Spectrum Sensing Schedule and Transmitted Energy Allocation in Harvested Energy Powered Cognitive Radio Networks Under Secrecy Constraints [J] . Tran Nhut Khai Hoan, Insoo Koo Sensors Journal, IEEE . 2017,第7期

机译：保密约束下以收割能量为动力的认知无线电网络中的多时隙频谱感知调度和发射能量分配
3. Sensing and Transmit Energy Optimization for an Energy Harvesting Cognitive Radio [J] . Sultan Ahmed Wireless Communications Letters, IEEE . 2012,第5期

机译：能量收集认知无线电的传感和传输能量优化
4. Sensing, probing, and transmitting strategy for energy harvesting cognitive radio [C] . Keyu Wu, Hai Jiang, Chintha Tellambura IEEE International Conference on Communications . 2017

机译：能量收集认知无线电的传感，探测和传输策略
5. Energy efficiency in cognitive radio network: Study of cooperative sensing using different channel sensing methods [D] . Cui, Chenxuan. 2015

机译：认知无线电网络中的能源效率：使用不同渠道传感方法研究协作感测
6. Multichannel-Sensing Scheduling and Transmission-Energy Optimizing in Cognitive Radio Networks with Energy Harvesting [O] . Tran-Nhut-Khai Hoan, Vu-Van Hiep, In-Soo Koo 2016

机译：具有能量收集功能的认知无线电网络中的多通道感知调度和传输能量优化
7. Reinforcement learning based sensing policy optimization for energy efficient cognitive radio networks [O] . Jan Oksanen, Jarmo Lundén, Visa Koivunen 2012

机译：基于增强学习的能源高效认知无线电网络的传感政策优化
8. Multi-Objective Reinforcement Learning for Cognitive Radio-Based Satellite Communications. [R] . Ferreira, P. V. R., Paffenroth, R., Wyglinski, A. M., 2016

机译：基于认知无线电卫星通信的多目标强化学习。

Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅