Dynamic Bandwidth Allocation Scheme for Wireless Networks with Energy Harvesting Using Actor-Critic Deep Reinforcement Learning

机译：使用演员批评深度加强学习的能量收集无线网络动态带宽分配方案

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an efficient bandwidth allocation scheme in heterogeneous wireless networks with a single macro-cell base station (MBS) and several small-cell base stations (SBSs) that are powered by solar energy harvesters. This paper aims to design an actor-critic deep reinforcement learning (RL) agent at the MBS (i.e. the main controller) with the purpose of maximizing user satisfaction ratio and energy efficiency in the network. The RL agent learns the stochastic arrivals of traffic requests and harvested energy through direct interaction with the network environment and thus can obtain the optimal bandwidth allocation policy in order to enhance network sustainability and performance. For this purpose, we first formulate the bandwidth allocation problem as the framework of a Markov decision process, and then, employ the actor-critic RL algorithm to find the optimal policy for bandwidth allocation. The actor and the critic of the RL agent use deep neural network to approximate the policy function and the value function, respectively. More specifically, the actor generates action based on the output of the policy network while the critic helps the actor evaluate the policy by using the value network. Simulation results are shown to illustrate the performance of the proposed scheme.

机译：在本文中，我们提出在异构无线网络与单个宏小区基站（MBS）和由太阳能供电的收割机几个小小区基站（SBSS）的有效带宽分配方案。本文旨在设计一种演员评论家深强化学习（RL）在MBS与网络中的最大化用户满意率和能量效率的目的剂（即主控制器）。该RL代理获悉流量请求的随机到达和通过与网络环境的直接交互的收集的能量，从而能够获得，以提高网络的可持续性和性能优化带宽分配策略。为此，我们首先制定了带宽分配问题作为一个马尔可夫决策过程的框架，然后，聘请演员评论家RL算法来寻找带宽分配的最优策略。演员和RL剂使用深层神经网络的评论家逼近政策功能和价值功能，分别。更具体地讲，演员基于政策网络的输出动作，而评论家帮助演员评估政策使用的价值网络。模拟结果显示了所提方案的性能。

著录项

来源
《International Conference on Artificial Intelligence in Information and Communication》|2019年|584 p. :|共5页
会议地点
作者
Quang Vinh Do; Insoo Koo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Base stations; Channel allocation; Bandwidth; Reinforcement learning; Wireless networks; Energy harvesting; Stochastic processes;

机译：基站;通道分配;带宽;加固学习;无线网络;能量收集;随机过程;

相似文献

外文文献
中文文献
专利

1. An efficient bandwidth allocation scheme for hierarchical cellular networks with energy harvesting: an actor-critic approach [J] . Quang Vinh Do, Van Hiep Vu, Koo Insoo International journal of electronics . 2019,第10a12期

机译：具有能量收集功能的分层蜂窝网络的有效带宽分配方案：行为者批评方法
2. Actor-critic deep learning for efficient user association and bandwidth allocation in dense mobile networks with green base stations [J] . Quang Vinh Do, Koo Insoo Wireless Networks . 2019,第8期

机译：Actor-critic深度学习可在具有绿色基站的密集移动网络中实现有效的用户关联和带宽分配
3. Deep Reinforcement Learning-based resource allocation strategy for Energy Harvesting-Powered Cognitive Machine-to-Machine Networks [J] . Xu Yi-Han, Tian Yong-Bo, Searyoh Prosper Komla, Computer Communications . 2020,第Jula期

机译：基于深度加强学习的资源分配策略，用于能源收集动力的认知机器到机网络
4. Dynamic Bandwidth Allocation Scheme for Wireless Networks with Energy Harvesting Using Actor-Critic Deep Reinforcement Learning [C] . Quang Vinh Do, Insoo Koo The 1st International Conference onArtificial Intelligence in Information and Communication . 2019

机译：具有Actor-Critical深度强化学习的能量收集无线网络的动态带宽分配方案
5. Bandwidth allocation schemes in cellular and wireless local area networks. [D] . Sun, Li-Hsiang. 2002

机译：蜂窝和无线局域网中的带宽分配方案。
6. Reinforcement Learning (RL)-Based Energy Efficient Resource Allocation for Energy Harvesting-Powered Wireless Body Area Network [O] . Yi-Han Xu, Jing-Wei Xie, Yang-Gang Zhang, 2020

机译：基于强化学习（RL）的能量有效资源分配用于能量收集供电的无线人体局域网
7. An Actor-Critic Reinforcement Learning Approach to Minimum age of Information Scheduling in Energy Harvesting Networks [O] . Shiyang Leng, Aylin Yener 2021

机译：能量收集网络中信息调度最低年龄的演员批评者加强学习方法

Dynamic Bandwidth Allocation Scheme for Wireless Networks with Energy Harvesting Using Actor-Critic Deep Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅