An efficient actor-critic reinforcement learning for device-to-device communication underlaying sectored cellular network

Khuntia Pratap; Hazra Ranjay; Chong Peter

首页> 外文期刊>International journal of communication systems >An efficient actor-critic reinforcement learning for device-to-device communication underlaying sectored cellular network

【24h】

An efficient actor-critic reinforcement learning for device-to-device communication underlaying sectored cellular network

机译：用于设备到设备通信的高效演员批评批评学习界面跨越蜂窝网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a novel reinforcement learning (RL) approach with cell sectoring is proposed to solve the channel and power allocation issue for a device-to-device (D2D)-enabled cellular network when the prior traffic information is not known to the base station (BS). Further, this paper explores an optimal policy for resource and power allocation between users intending to maximize the sum-rate of the overall system. Since the behavior of wireless channel and traffic request of users in the system is stochastic in nature, the dynamic property of the environment allows us to employ an actor-critic RL technique to learn the best policy through continuous interaction with the surrounding. The proposed work comprises of four phases: cell splitting, clustering, queuing model, and channel allocation and power allocation simultaneously using an actor-critic RL. The implementation of cell splitting with novel clustering technique increases the network coverage, reduces co-channel cell interference, and minimizes the transmission power of nodes, whereas the queuing model solves the issue of waiting time for users in a priority-based data transmission. With the help of continuous state-action space, the actor-critic RL algorithm based on policy gradient improves the overall system sum-rate as well as the D2D throughput. The actor adopts a parameter-based stochastic policy for giving continuous action while the critic estimates the policy and criticizes the actor for the action. This reduces the high variance of the policy gradient. Through numerical simulations, the benefit of our resource sharing scheme over other existing traditional scheme is verified.

机译：在本文中，提出了一种新的增强学习（RL）方法，用于求解设备到设备（D2D）的蜂窝网络的信道和功率分配问题，当基础上未知出现之前的交通信息站（BS）。此外，本文探讨了有意如何最大化整个系统的资源和电力分配的最佳策略。由于系统中的用户的无线信道和业务请求的行为本质上是随机的，因此环境的动态属性使我们能够通过与周围的持续交互来学习最佳政策的演员 - 评论家RL技术。所提出的工作包括四个阶段：使用演员 - 评论家RL同时同时进行细胞分离，聚类，排队模型和信道分配和电力分配。具有新型聚类技术的小区分离的实现增加了网络覆盖，减少了共信道小区干扰，并最大限度地减少了节点的传输功率，而排队模型解决了基于优先级的数据传输中用户的等待时间问题。借助连续的状态动作空间，基于策略梯度的演员 - 评论家RL算法可以提高整体系统和速率以及D2D吞吐量。演员采用基于参数的随机政策，以持续行动，同时评论估计政策并批评行动者的行动。这降低了政策梯度的高方差。通过数值模拟，验证了我们资源共享方案对其他现有传统方案的益处。

著录项

来源
《International journal of communication systems》 |2020年第10期|e4315.1-e4315.15|共15页
作者
Khuntia Pratap; Hazra Ranjay; Chong Peter;
展开▼
作者单位

Natl Inst Technol Elect & Instrumentat Engn Dept Silchar 788010 Assam India;

Natl Inst Technol Elect & Instrumentat Engn Dept Silchar 788010 Assam India;

Auckland Univ Technol Elect & Elect Engn Dept Auckland New Zealand;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
actor-critic reinforcement learning; cell sectoring; device-to-device communication; k-means clustering; queuing model; resource allocation;

机译：演员批评钢筋学习;细胞段;设备到设备通信;K-Meanse群集;排队模型;资源分配;
入库时间 2022-08-18 21:19:27

相似文献

外文文献
中文文献
专利

1. Energy-Efficient Transmission Rate Selection and Power Control for Relay-Assisted Device-to-Device Communications Underlaying Cellular Networks [J] . Zhang Zitian, Wu Yue, Chu Xiaoli, Wireless Communications Letters, IEEE . 2020,第8期

机译：用于中继辅助设备到设备通信的节能传输速率选择和功率控制下层蜂窝网络
2. Energy-Efficient resource allocation for device-to-device underlay communications in cellular networks [J] . Anbiyaei Mohammad, Faez Karim, Najimi Maryam Signal Processing, IET . 2019,第6期

机译：蜂窝网络中设备到设备底层通信的节能资源分配
3. Multiple resource allocation in device-to-device communication underlaying cellular networks from an end-to-end energy-efficient perspective [J] . Quansheng Xu, Hong Ji, Xi Li Communications, IET . 2015,第9期

机译：从端到端的节能角度来看，在支持蜂窝网络的设备到设备通信中进行多种资源分配
4. An Actor-Critic Reinforcement Learning for Device-to-Device Communication Underlaying Cellular Network [C] . Pratap Khuntia, Ranjay Hazra IEEE Region 10 Conference . 2018

机译：设备到设备通信的蜂窝网络的Actor-Critic强化学习
5. Radio Resource Allocation for Device-to-Device Communications Underlaying Cellular Networks. [D] . Islam, Mohammad Tauhidul. 2016

机译：支持蜂窝网络的设备间通信的无线资源分配。
6. Network Coding for Efficient Video Multicast in Device-to-Device Communications [O] . Lei Wang, Yulong Li, Bo Pan, 2020

机译：设备间通信中的高效视频组播的网络编码
7. 4 Distributed Interference-Aware Energy-Efficient Resource Allocation for Device-to-Device Communications Underlaying Cellular Networks [O] . Zhenyu Zhou, Mianxiong Dong, Kaoru Ota, 2016

机译：4用于蜂窝网络的设备到设备通信的分布式干扰感知能量资源分配

An efficient actor-critic reinforcement learning for device-to-device communication underlaying sectored cellular network

摘要

著录项

相似文献

相关主题

期刊订阅