首页> 外文期刊>IFAC PapersOnLine >Weight Adjustments in a Routing Algorithm for Wireless Sensor and Actuator Networks Using Q-Learning ?
【24h】

Weight Adjustments in a Routing Algorithm for Wireless Sensor and Actuator Networks Using Q-Learning ?

机译:使用Q学习的无线传感器和执行器网络路由算法中的权重调整

获取原文
           

摘要

Wireless Sensor and Actuator Networks like ISA SP100.11a and WirelessHART have a special device known as network manager, which has tasks such as admission control of devices, definition of routes and allocation of communication resources. The routing algorithms used in those protocols need to build routes that keep path redundancy, network reliability and balance energy consumption, resource use and latency. Some of the routing algorithms used for those protocols have weights that allow adjusting some route preferences. The dynamicity of wireless networks can be challenging for adjusting the routing algorithms, and Reinforcement Learning models can be useful to select and adapt weights and optimize routes according to application requirements and current operating conditions. In this work, a global routing agent with Q-Learning is proposed for weight adjustments of a state-of-the-art routing algorithm, aiming the balance of overall latency and lifetime of the network. States are modeled as a set of weights and actions change the current state. The rewards are positive when a state-action pair increases the expected network lifetime and decreases the average network latency. Experiments were conducted using a WirelessHART simulator, and the results showed that the approach can balance the latency and network lifetime when compared with other state-of-the-art routing algorithms with fixed weights.
机译:诸如ISA SP100.11a和WirelessHART之类的无线传感器和执行器网络具有一种称为网络管理器的特殊设备,其任务包括诸如设备的准入控制,路由的定义和通信资源的分配。这些协议中使用的路由算法需要构建路由,以保持路径冗余,网络可靠性并平衡能耗,资源使用和延迟。用于那些协议的某些路由算法的权重允许调整某些路由首选项。无线网络的动态性对于调整路由算法可能具有挑战性,强化学习模型对于根据应用程序要求和当前操作条件选择和调整权重以及优化路由很有用。在这项工作中,提出了一种具有Q-Learning的全局路由代理,用于调整最新路由算法的权重,以平衡网络的总体延迟和生命周期。状态被建模为一组权重,并且操作会更改当前状态。当状态-动作对增加预期的网络寿命并减少平均网络延迟时,奖励是肯定的。使用WirelessHART模拟器进行的实验表明,与其他具有固定权重的最新路由算法相比,该方法可以平衡延迟和网络寿命。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号