首页> 外文会议>IEEE/CIC International Conference on Communications in China >Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks
【24h】

Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

机译:无线设备到设备网络中的服务质量路由的分布式强化学习

获取原文

摘要

In this paper, we aim to determine the multi-hop route between a device-to-device (D2D) source-destination pair which meets the quality-of-service (QoS) of services. We model this QoS routing problem in D2D as a Markov decision process (MDP) and proposes a distributed multi-agent reinforcement learning routing algorithm. We consider the QoS requirements in terms of bandwidth, delay, and packet loss rate, and allocate the routing path according to link information averaged over time in dynamic network environments. By decomposing the Q-function into multiple local Q-functions, each agent can compute its own optimal strategy based on local observations, which greatly reduces the costs of learning and searching in large-scale multi-state systems. The simulation results show that the proposed algorithm can significantly reduce the average end-to-end delay, the average packet loss rate and service rejection rate compared with both the minimum hop algorithm and the traditional routing algorithm which only considers static parameters.
机译:在本文中,我们旨在确定满足服务质量(QoS)的设备到设备(D2D)源-目的地对之间的多跳路由。我们将此模型在D2D中作为Markov决策过程(MDP)进行建模,并提出了一种分布式多主体强化学习路由算法。我们考虑带宽,延迟和丢包率方面对QoS的要求,并根据动态网络环境中随时间平均的链路信息分配路由路径。通过将Q函数分解为多个局部Q函数,每个代理都可以根据局部观测值计算自己的最佳策略,从而大大降低了在大型多状态系统中学习和搜索的成本。仿真结果表明,与最小跳算法和传统的仅考虑静态参数的路由算法相比,该算法可以显着降低平均端到端时延,平均丢包率和业务拒绝率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号