首页> 外文会议>International Conference on Cognitive Radio Oriented Wireless Networks and Communications >Reinforcement learning based spectrum-aware routing in multi-hop cognitive radio networks
【24h】

Reinforcement learning based spectrum-aware routing in multi-hop cognitive radio networks

机译:基于跳跃学习的频谱意识路由在多跳认知无线电网络中

获取原文

摘要

Routing in multi-hop cognitive radio networks (CRN) should be spectrum-aware. In this paper, two adaptive reinforcement learning based spectrum-aware routing protocols are introduced. Q-learning and dual reinforcement learning are applied respectively for them. The cognitive nodes store a table of Q values that estimate the numbers of available channels on the routes and update them while routing. So they can adaptively learn good routes which have more available channels from just local information. Compared to the previous spectrum aware routing protocols in multi-hop cognitive radio networks, they are simpler and easier to implement, more cost-effective, and can avoid drawbacks in on-demand protocols but still keep adaptive and dynamic routing. Both of our protocols perform better than the spectrum-aware shortest path protocol when network load is not too low. In the meantime, spectrum-aware DRQ-routing learns the optimal routing policy 1.5 times as fast as the spectrum-aware Q-routing at low and medium network load. It also learns a routing policy which is more than seven times as good as that of spectrum-aware Q-routing at high network load.
机译:在多跳认知无线电网络(CRN)中的路由应该是频谱感知。在本文中,引入了两个基于自适应增强的频谱感知路由协议。 Q-Learning和双重强化学习分别用于它们。认知节点存储Q值的表,该表估计路由上的可用通道的数量并在路由时更新。因此,他们可以自适应地学习良好的路线,该路线与本地信息有更多可用频道。与先前的频谱感知路由协议相比,在多跳认知无线电网络中,它们更简单,更容易实现,更具成本效益,并且可以避免在按需协议中缺点,但仍然保持自适应和动态路由。当网络负载不是太低时,我们的两个协议都比频谱感知的最短路径协议更好。与此同时,频谱感知DRQ路由将最佳路由策略学习1.5次,以快速的频谱感知Q-Routing处于低和中网络负载。它还学习了一个路由策略,其在高网络负载下的频谱感知Q-Routing的路由策略超过七倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号