首页> 外文会议>IFAC Conference on Embedded Systems, Computational Intelligence and Telematics in Control >Weight Adjustments in a Routing Algorithm for Wireless Sensor and Actuator Networks Using Q-Learning
【24h】

Weight Adjustments in a Routing Algorithm for Wireless Sensor and Actuator Networks Using Q-Learning

机译:使用Q学习的无线传感器和执行器网络路由算法的重量调整

获取原文

摘要

Wireless Sensor and Actuator Networks like ISA SP100.11a and WirelessHART have a special device known as network manager, which has tasks such as admission control of devices, definition of routes and allocation of communication resources. The routing algorithms used in those protocols need to build routes that keep path redundancy, network reliability and balance energy consumption, resource use and latency. Some of the routing algorithms used for those protocols have weights that allow adjusting some route preferences. The dynamicity of wireless networks can be challenging for adjusting the routing algorithms, and Reinforcement Learning models can be useful to select and adapt weights and optimize routes according to application requirements and current operating conditions. In this work, a global routing agent with Q-Learning is proposed for weight adjustments of a state-of-the-art routing algorithm, aiming the balance of overall latency and lifetime of the network. States are modeled as a set of weights and actions change the current state. The rewards are positive when a state-action pair increases the expected network lifetime and decreases the average network latency. Experiments were conducted using a WirelessHART simulator, and the results showed that the approach can balance the latency and network lifetime when compared with other state-of-the-art routing algorithms with fixed weights.
机译:无线传感器和执行器网络,如ISA SP100.11a和WirelessHART具有被称为网络管理器,其具有任务,例如设备的准入控制,路由的定义和通信资源的分配的特殊设备。在这些协议中所使用的路由算法需要的是保持路径冗余,网络可靠性和平衡的能源消耗,资源利用和延迟构建路线。一些用于这些协议的路由算法具有权重是在调整一些路线偏好。无线网络的动态性是具有挑战性,用于调节的路由算法,并强化学习模型可以是有用的选择,并根据应用要求和当前操作条件适应权重和优化的路由。在这项工作中,用Q学习的全局路由代理提出了一个国家的最先进的路由算法进行体重调整,目标总延时和网络生命周期的平衡。美国建模为一组权重和行动改变现状。回报是正当的状态 - 动作对增加了预期网络的寿命,并降低了平均网络延迟。实验使用的WirelessHART模拟器进行,该结果表明,当与其它国家的最先进路由算法与固定权相比该方法能平衡等待时间和网络的生命周期。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号