Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

机译：无线设备到设备网络中的服务质量路由的分布式强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we aim to determine the multi-hop route between a device-to-device (D2D) source-destination pair which meets the quality-of-service (QoS) of services. We model this QoS routing problem in D2D as a Markov decision process (MDP) and proposes a distributed multi-agent reinforcement learning routing algorithm. We consider the QoS requirements in terms of bandwidth, delay, and packet loss rate, and allocate the routing path according to link information averaged over time in dynamic network environments. By decomposing the Q-function into multiple local Q-functions, each agent can compute its own optimal strategy based on local observations, which greatly reduces the costs of learning and searching in large-scale multi-state systems. The simulation results show that the proposed algorithm can significantly reduce the average end-to-end delay, the average packet loss rate and service rejection rate compared with both the minimum hop algorithm and the traditional routing algorithm which only considers static parameters.

机译：在本文中，我们旨在确定满足服务质量（QoS）的设备到设备（D2D）源－目的地对之间的多跳路由。我们将此模型在D2D中作为Markov决策过程（MDP）进行建模，并提出了一种分布式多主体强化学习路由算法。我们考虑带宽，延迟和丢包率方面对QoS的要求，并根据动态网络环境中随时间平均的链路信息分配路由路径。通过将Q函数分解为多个局部Q函数，每个代理都可以根据局部观测值计算自己的最佳策略，从而大大降低了在大型多状态系统中学习和搜索的成本。仿真结果表明，与最小跳算法和传统的仅考虑静态参数的路由算法相比，该算法可以显着降低平均端到端时延，平均丢包率和业务拒绝率。

著录项

来源
《IEEE/CIC International Conference on Communications in China》|2018年|282-286|共5页
会议地点
作者
Dongyu Liu; Zexu Li; Zeyu Hu; Yong Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Routing; Device-to-device communication; Quality of service; Heuristic algorithms; Loss measurement; Bandwidth; Delays;

机译：路由;设备间通信;服务质量;启发式算法;损耗测量;带宽;延迟;

相似文献

外文文献
中文文献
专利

1. 无线自组织网络中带宽约束的分布式按需组播路由协议 [J] . 余燕平, 倪玲玲, 郑元琰东南大学学报（英文版） . 2015,第001期
2. An energy-efficient distributed adaptive cooperative routing based on reinforcement learning in wireless multimedia sensor networks [J] . Wang Denghui, Liu Jian, Yao Dezhong Computer networks . 2020,第Sepa4期

机译：基于无线多媒体传感器网络中的增强学习的节能分布式自适应配合路由
3. Application of reinforcement learning to routing in distributed wireless networks: a review [J] . Al-Rawi Hasan A. A., Ng Ming Ann, Yau Kok-Lim Alvin Artificial Intelligence Review: An International Science and Engineering Journal . 2015,第3期

机译：强化学习在分布式无线网络路由中的应用：综述
4. Bayesian Reinforcement Learning-Based Coalition Formation for Distributed Resource Sharing by Device-to-Device Users in Heterogeneous Cellular Networks [J] . Alia Asheralieva IEEE transactions on wireless communications . 2017,第8期

机译：基于贝叶斯强化学习的联盟网络，用于异构蜂窝网络中设备到设备用户的分布式资源共享
5. Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks [C] . Dongyu Liu, Zexu Li, Zeyu Hu, IEEE/CIC International Conference on Communications in China . 2018

机译：用于无线设备到设备网络中的服务质量路由的分布式增强学习
6. Towards a framework for efficient resource allocation in wireless networks: Quality-of-service and distributed design. [D] . Li, Bin. 2014

机译：建立无线网络中有效资源分配的框架：服务质量和分布式设计。
7. A Trusted Routing Scheme Using Blockchain and Reinforcement Learning for Wireless Sensor Networks [O] . Jidian Yang, Shiwen He, Yang Xu, 2019

机译：无线传感器网络使用区块链和强化学习的可信路由方案
8. Reinforcement learning based routing for energy sensitive wireless mesh IoT networks [O] . Y. Liu, K.‐F. Tong, K.‐K. Wong 2019

机译：基于加强学习的能敏无线网状网络路由
9. Distributed Reinforcement Learning Scheme for Network Routing. [R] . Littman, M., Boyan, J. 1993

机译：分布式网络路由强化学习方案。

Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

摘要

著录项

相似文献

相关主题

期刊订阅