首页> 外文会议>International Conference on Artificial Intelligence in Information and Communication >Dynamic Bandwidth Allocation Scheme for Wireless Networks with Energy Harvesting Using Actor-Critic Deep Reinforcement Learning
【24h】

Dynamic Bandwidth Allocation Scheme for Wireless Networks with Energy Harvesting Using Actor-Critic Deep Reinforcement Learning

机译:使用演员批评深度加强学习的能量收集无线网络动态带宽分配方案

获取原文

摘要

In this paper, we propose an efficient bandwidth allocation scheme in heterogeneous wireless networks with a single macro-cell base station (MBS) and several small-cell base stations (SBSs) that are powered by solar energy harvesters. This paper aims to design an actor-critic deep reinforcement learning (RL) agent at the MBS (i.e. the main controller) with the purpose of maximizing user satisfaction ratio and energy efficiency in the network. The RL agent learns the stochastic arrivals of traffic requests and harvested energy through direct interaction with the network environment and thus can obtain the optimal bandwidth allocation policy in order to enhance network sustainability and performance. For this purpose, we first formulate the bandwidth allocation problem as the framework of a Markov decision process, and then, employ the actor-critic RL algorithm to find the optimal policy for bandwidth allocation. The actor and the critic of the RL agent use deep neural network to approximate the policy function and the value function, respectively. More specifically, the actor generates action based on the output of the policy network while the critic helps the actor evaluate the policy by using the value network. Simulation results are shown to illustrate the performance of the proposed scheme.
机译:在本文中,我们提出在异构无线网络与单个宏小区基站(MBS)和由太阳能供电的收割机几个小小区基站(SBSS)的有效带宽分配方案。本文旨在设计一种演员评论家深强化学习(RL)在MBS与网络中的最大化用户满意率和能量效率的目的剂(即主控制器)。该RL代理获悉流量请求的随机到达和通过与网络环境的直接交互的收集的能量,从而能够获得,以提高网络的可持续性和性能优化带宽分配策略。为此,我们首先制定了带宽分配问题作为一个马尔可夫决策过程的框架,然后,聘请演员评论家RL算法来寻找带宽分配的最优策略。演员和RL剂使用深层神经网络的评论家逼近政策功能和价值功能,分别。更具体地讲,演员基于政策网络的输出动作,而评论家帮助演员评估政策使用的价值网络。模拟结果显示了所提方案的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号