Weighted cooperative reinforcement learning-based energy-efficient autonomous resource selection strategy for underlay D2D communication

Sharma Sandeepika; Singh Brahmjit

首页> 外文期刊>Communications, IET >Weighted cooperative reinforcement learning-based energy-efficient autonomous resource selection strategy for underlay D2D communication

【24h】

Weighted cooperative reinforcement learning-based energy-efficient autonomous resource selection strategy for underlay D2D communication

机译：基于加权协作强化学习的底层D2D通信节能自主资源选择策略

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Underlay Device-to-Device (D2D) communication is a key technology responsible for high data rate, ultra-low latency with high spectral and energy efficiency in 5G cellular networks. But to achieve its full potential, optimal channel allocation and effective co-channel interference management must be accomplished. To address this challenge, we propose a multi-agent reinforcement learning based autonomous channel selection scheme for D2D communication. The proposed scheme, Weighted Cooperative Q-Learning based Resource Selection (WCopQLRS), allows a D2D pair to learn to select a channel from the available resources autonomously. Learning process of each D2D transmitter involves cooperation from neighboring D2D agents by exchanging their latest Q-values. An additional parameter called cooperation range is used to determine the neighboring pairs whose Q-values can be used for learning the optimal policy. The limited prior information prevents a linear increase in the dimensions of Q-value matrix of each learning agent when the number of D2D pairs within the cell is huge. Though WCopQL-RS involves additional information exchange among agents as compared to independent learning but also provides improved system throughput and convergence speed. It is shown through simulation results that WCopQL-RS outperforms other existing schemes in terms of average D2D user throughput, energy consumption and fairness value.

机译：底层设备到设备（D2D）通信是一项关键技术，可在5G蜂窝网络中实现高数据速率，超低延迟以及高频谱和能源效率。但是要发挥其全部潜力，必须实现最佳的信道分配和有效的同信道干扰管理。为了解决这一挑战，我们提出了一种基于多主体强化学习的D2D通信自主通道选择方案。提出的方案，基于加权合作Q学习的资源选择（WCopQLRS），允许D2D对学习从可用资源中自主选择频道。每个D2D发射机的学习过程都涉及到相邻D2D代理通过交换它们最新的Q值进行的合作。称为合作范围的附加参数用于确定其Q值可用于学习最佳策略的相邻对。当单元中的D2D对数量巨大时，有限的先验信息会阻止每个学习代理的Q值矩阵的尺寸线性增加。尽管与独立学习相比，WCopQL-RS涉及代理之间的附加信息交换，但也提供了改进的系统吞吐量和收敛速度。通过仿真结果表明，在平均D2D用户吞吐量，能耗和公平值方面，WCopQL-RS优于其他现有方案。

著录项

来源
《Communications, IET》 |2019年第14期|2078-2087|共10页
作者
Sharma Sandeepika; Singh Brahmjit;
展开▼
作者单位

NIT Kurukshetra ECE Dept Kurukshetra Haryana India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
multi-agent systems; channel allocation; resource allocation; learning (artificial intelligence); cellular radio; 5G mobile communication; telecommunication computing; cooperative communication; energy conservation; telecommunication power management; cochannel interference; 5G cellular networks; optimal channel allocation; multiagent reinforcement learning-based autonomous channel selection scheme; WCopQL-RS; learning agent; independent learning; average D2D user throughput; energy consumption; fairness value; underlay D2D communication; device-to-device communication; high spectral energy efficiency; ultra-low latency; resource pool; data rate; weighted cooperative reinforcement learning-based energy-efficient autonomous resource selection strategy; weighted cooperative Q-Learning; co-channel interference management;

机译：多代理系统;渠道分配;资源分配;学习（人工智能）;蜂窝无线电;5G移动通信;电信计算;合作交流;节能减排;电信电源管理;同频道干扰;5G蜂窝网络;最佳信道分配;基于多主体强化学习的自主信道选择方案;WCopQL-RS;学习代理独立学习;D2D用户平均吞吐量;能源消耗;公平价值;进行D2D通信设备到设备的通信;高频谱能量效率;超低延迟;资源池;数据速率;基于加权合作学习的节能自主资源选择策略;加权合作Q学习;同频道干扰管理;

相似文献

外文文献
中文文献
专利

1. Energy-Efficient Resource Allocation for High-Rate Underlay D2D Communications With Statistical CSI: A One-to-Many Strategy [J] . Li Runzhou, Hong Peilin, Xue Kaiping, IEEE Transactions on Vehicular Technology . 2020,第4期

机译：与统计CSI的高速底层D2D通信节能资源分配：一对多策略
2. Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory [J] . Chen Xu, Junhao Feng, Biyao Huang, Applied Sciences . 2017,第5期

机译：高效的D2D协作通信基于匹配理论的联合中继选择和资源分配
3. Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory [J] . Chen Xu, Junhao Feng, Biyao Huang, Applied Sciences . 2017,第5期

机译：高效的D2D协作通信基于匹配理论的联合中继选择和资源分配
4. Energy-Efficient Resource Allocation for Energy Harvesting-Powered D2D Communications Underlaying Cellular Networks [C] . Ke Wang, Wei Heng, Jinming Hu, IEEE Vehicular Technology Conference . 2018

机译：用于能量收集的D2D通信支持蜂窝网络的节能资源分配
5. Resource allocation strategies for cognitive and cooperative MIMO communications: Algorithm and protocol design. [D] . Nguyen, Diep Ngoc. 2013

机译：认知和协作MIMO通信的资源分配策略：算法和协议设计。
6. Energy-Efficient Power Allocation and Relay Selection Schemes for Relay-Assisted D2D Communications in 5G Wireless Networks [O] . Md Arifur Rahman, YoungDoo Lee, Insoo Koo 2018

机译：5G无线网络中中继辅助D2D通信的高能效功率分配和中继选择方案
7. Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory [O] . Chen Xu, Junhao Feng, Biyao Huang, 2017

机译：使用匹配理论的节能D2D合作通信联合继电器选择和资源分配

Weighted cooperative reinforcement learning-based energy-efficient autonomous resource selection strategy for underlay D2D communication

摘要

著录项

相似文献

相关主题

期刊订阅