首页> 外文期刊>Wireless Networks >RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic loT networks
【24h】

RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic loT networks

机译:Rlproproph:一种基于动态规划的机会批次网络最优路由的加固学习方法

获取原文
获取原文并翻译 | 示例
       

摘要

Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.
机译:在机会主义互联网上的路由网络(对比)是一个具有挑战性的任务,因为设备之间的间歇性连接和消息之间的源和目的地之间的固定路径。最近,机器学习(ML)和强化学习(RL)已被使用巨大的成功,以自动化许多不同问题域中的过程。在本文中,我们试图通过使用策略迭代算法来充分自动化对立路由过程,以最大限度地提高消息传递的可能性。此外,我们将对立环境的模拟作为马尔可夫决策过程(MDP)与各国,行动,奖励和过渡概率重新提供。所提出的路由协议RLPRPOLOP,能够通过通过使用策略迭代解决MDP而获得的最佳策略来优化路由过程。通过广泛的仿真,我们表明RLProproph优于许多常数性能标准的ML基于ML的基于和上下文感知的路由协议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号