...
首页> 外文期刊>IEICE Transactions on Communications >Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks
【24h】

Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks

机译:用于集成服务网络中资源分配和路由的策略梯度SMDP

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we solve the call admission control (CAC) and routing problem in an integrated network that handles several classes of calls of different values and with different resource requirements. The problem of maximizing the average reward (or cost) of admitted calls per unit time is naturally formulated as a semi-Markov Decision Process (SMDP) problem, but is too complex to allow for an exact solution. Thus in this paper, a policy gradient algorithm, together with a decomposition approach, is proposed to find the dynamic (state-dependent) optimal CAC and routing policy among a parameterized policy space. To implement that gradient algorithm, we approximate the gradient of the average reward. Then, we present a simulation-based algorithm to estimate the approximate gradient of the average reward (called GSMDP algorithm), using only a single sample path of the underlying Markov chain for the SMDP of CAC and routing problem. The algorithm enhances performance in terms of convergence speed, rejectionrn probability, robustness to the changing arrival statistics and an overall received average revenue. The experimental simulations will compare our method's performance with other existing methods and show the robustness of our method.
机译:在本文中,我们解决了一个集成网络中的呼叫准入控制(CAC)和路由问题,该网络可以处理具有不同值和不同资源需求的几类呼叫。自然地将最大化每单位时间允许的呼叫的平均报酬(或成本)的问题描述为半马尔可夫决策过程(SMDP)问题,但是它太复杂而无法提供精确的解决方案。因此,在本文中,提出了一种策略梯度算法以及一种分解方法,以在参数化策略空间中找到动态(取决于状态)的最佳CAC和路由策略。为了实现该梯度算法,我们近似平均奖励的梯度。然后,我们提出了一种基于仿真的算法来估计平均奖励的近似梯度(称为GSMDP算法),仅针对CAC的SMDP和路由问题使用底层马尔可夫链的单个样本路径。该算法在收敛速度,拒绝概率,对不断变化的到达统计数据的鲁棒性以及总体接收到的平均收入方面提高了性能。实验仿真将我们的方法的性能与其他现有方法进行比较,并显示了我们方法的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号