Resource Allocation and Power Control Policy for Device-to-Device Communication Using Multi-Agent Reinforcement Learning

Yifei Wei; Yinxiang Qu; Min Zhao; Lianping Zhang; F.Richard Yu

首页> 中文期刊>计算机、材料和连续体(英文) >Resource Allocation and Power Control Policy for Device-to-Device Communication Using Multi-Agent Reinforcement Learning

Resource Allocation and Power Control Policy for Device-to-Device Communication Using Multi-Agent Reinforcement Learning

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Device-to-Device(D2D)communication is a promising technology that can reduce the burden on cellular networks while increasing network capacity.In this paper,we focus on the channel resource allocation and power control to improve the system resource utilization and network throughput.Firstly,we treat each D2D pair as an independent agent.Each agent makes decisions based on the local channel states information observed by itself.The multi-agent Reinforcement Learning(RL)algorithm is proposed for our multi-user system.We assume that the D2D pair do not possess any information on the availability and quality of the resource block to be selected,so the problem is modeled as a stochastic non-cooperative game.Hence,each agent becomes a player and they make decisions together to achieve global optimization.Thereby,the multi-agent Q-learning algorithm based on game theory is established.Secondly,in order to accelerate the convergence rate of multi-agent Q-learning,we consider a power allocation strategy based on Fuzzy C-means(FCM)algorithm.The strategy firstly groups the D2D users by FCM,and treats each group as an agent,and then performs multi-agent Q-learning algorithm to determine the power for each group of D2D users.The simulation results show that the Q-learning algorithm based on multi-agent can improve the throughput of the system.In particular,FCM can greatly speed up the convergence of the multi-agent Q-learning algorithm while improving system throughput.

著录项

来源
《计算机、材料和连续体(英文)》|2020年第6期|1515-1532|共18页
作者
Yifei Wei; Yinxiang Qu; Min Zhao; Lianping Zhang; F.Richard Yu;
展开▼
作者单位

Beijing Key Laboratory of Work Safety Intelligent Monitoring,Beijing University of Posts and Telecommunications,Beijing,100876,China;

Alibaba Cloud Computing,Hangzhou,311121,China;

Department of Systems and Computer Engineering,Carleton University,Ottawa,K1S 5B6,Canada;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TN9;
关键词
D2D communication; resource allocation; power control; multi-agent; Q-learning; fuzzy C-means;
入库时间 2022-08-21 05:18:42

相似文献

中文文献
外文文献

1. A Robust Resource Allocation Scheme for Device-to-Device Communications Based on Q-Learning [J] . Azka Amin ,Xihua Liu ,Imran Khan . 计算机、材料和连续体(英文) . 2020,第11期
2. Multi-Agent Reinforcement Learning for Resource Allocation in IoT Networks with Edge Computing [J] . Xiaolan Liu ,Jiadong Yu ,Zhiyong Feng . 中国通信 . 2020,第009期
3. Joint Optimization of Channel Allocation, Link Assignment and Power Control for Device-to-Device Communication Underlaying Cellular Network [J] . TANG Rui ,ZHAO Jihong ,QU Hua . 中国通信 . 2015,第012期
4. Multi-Objective Deep Reinforcement Learning Based Time-Frequency Resource Allocation for Multi-Beam Satellite Communications [J] . Yuanzhi He ,Biao Sheng ,Hao Yin . 中国通信:英文版 . 2022,第1期
5. A Joint Power and Bandwidth Allocation Method Based on Deep Reinforcement Learning for V2V Communications in 5G [J] . Xin Hu ,Sujie Xu ,Libing Wang . 中国通信 . 2021,第007期
6. A Radio Resource Allocation Strategy in Future Wireless Communication Systems [A] . ZIRARUSHYA Pierre Celestin . 2008

Resource Allocation and Power Control Policy for Device-to-Device Communication Using Multi-Agent Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅