首页> 外文会议>International Symposium on Quality Electronic Design >SAC: A Novel Multi-hop Routing Policy in Hybrid Distributed IoT System based on Multi-agent Reinforcement Learning
【24h】

SAC: A Novel Multi-hop Routing Policy in Hybrid Distributed IoT System based on Multi-agent Reinforcement Learning

机译:SAC:基于多功能钢筋学习的混合分布式IOT系统的新型多跳路由策略

获取原文

摘要

Energy harvesting (EH) IoT devices have attracted vast attention in both academia and industry as they can work sustainably by harvesting energy from the ambient environment. However, due to the weak and transient nature of harvesting power, EH technology is unable to support power-intensive IoT devices such as IoT edge servers. Therefore, the hybrid IoT system where the EH IoT devices and non-EH IoT devices co-exist is forthcoming. This paper explored the routing problem in such a hybrid distributed IoT system. We first proposed a comprehensive multi-hop routing mechanism of this hybrid system. After that, we proposed a distributed multi-agent deep reinforcement learning algorithm, known as spatial asynchronous advantage actor-critic (SAC), to optimize the system routing policy and energy allocation while maximizing the total amount of transmitted data and the overall data delivery to the sink node. The experiments indicate that SAC can averagely complete at least $sim 1.5 imes$ transmission rate and $sim 12.9imes$ Sink packet delivery rate compared with the baselines.
机译:能源收获(EH)物联网设备在学术界和工业中引起了广大的关注,因为它们可以通过从周围环境中收获能量来可持续地工作。但是,由于收获功率的弱和瞬态性质,EH技术无法支持电源密集型的IOT设备,如IOT边缘服务器。因此,即将出现EH IOT设备和非EH IOT设备的混合IOT系统。本文探讨了这种混合分布式IOT系统中的路由问题。我们首先提出了这种混合系统的全面的多跳路由机制。之后,我们提出了一种分布式多代理深度加强学习算法,称为空间异步优势演员 - 评论家(SAC),以优化系统路由策略和能量分配,同时最大化传输数据的总量和整体数据传送水槽节点。实验表明,与基线相比,SAC可以平均至少完成至少$ SIM 1.5 倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号