首页> 外文期刊>IEEE Transactions on Vehicular Technology >Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning
【24h】

Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning

机译:具有两阶段后态强化学习的能量收集认知无线电的传感,探测和传输策略

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper considers joint optimization of spectrum sensing, channel probing, and transmission power control for a single-channel secondary transmitter that operates with harvested energy from ambient sources. At each time slot, to maximize the expected secondary throughput, the transmitter needs to decide whether or not to perform the operations of spectrum sensing, channel probing, and transmission, according to energy status and channel fading status. First, we model this stochastic optimization problem as a two-stage continuous-state Markov decision process, with a sensing-and-probing stage and a transmit-power-control stage. We simplify this problem by a more useful after-state value function formulation. We then propose a reinforcement learning algorithm to learn the after-state value function from data samples when the statistical distributions of harvested energy and channel fading are unknown. Numerical results demonstrate learning characteristics and performance of the proposed algorithm.
机译:本文考虑了单通道辅助发射机的频谱感测,信道探测和发射功率控制的联合优化,该发射机工作于从周围环境获取的能量。在每个时隙,为了使预期的第二吞吐量最大化,发射机需要根据能量状态和信道衰落状态来决定是否执行频谱感测,信道探测和传输操作。首先,我们将该随机优化问题建模为一个两阶段的连续状态马尔可夫决策过程,包括一个传感和探测阶段以及一个发射功率控制阶段。我们通过更有用的事后价值函数公式简化了这个问题。然后,我们提出了一种增强学习算法,以在收集的能量和信道衰落的统计分布未知时从数据样本中学习事后状态函数。数值结果证明了该算法的学习特性和性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号