首页> 外文会议>Information Theory and Applications Workshop >Max Weight Learning Algorithms with Application to Scheduling in Unknown Environments
【24h】

Max Weight Learning Algorithms with Application to Scheduling in Unknown Environments

机译:最大重量学习算法,应用于未知环境中的调度

获取原文

摘要

We consider a discrete time stochastic queueing system where a controller makes a 2-stage decision every slot. The decision at the first stage reveals a hidden source of randomness with a control-dependent (but unknown) probability distribution. The decision at the second stage incurs a penalty vector that depends on this revealed randomness. The goal is to stabilize all queues and minimize a convex function of the time average penalty vector subject to an additional set of time average penalty constraints. This setting fits a wide class of stochastic optimization problems. This includes problems of opportunistic scheduling in wireless networks, where a 2-stage decision about channel measurement and packet transmission must be made every slot without knowledge of the underlying transmission success probabilities. We develop a simple max-weight algorithm that learns efficient behavior by averaging functionals of previous outcomes. The algorithm yields performance that can be pushed arbitrarily close to optimal, with a tradeoff in convergence time and delay.
机译:我们考虑一个离散时间随机排队系统,其中控制器使每个插槽的2阶段决定。在第一阶段的决定揭示了隐藏的随机性来源,与控制相关的(但未知)概率分布。第二阶段的决定会引发一份惩罚载体,这取决于这一揭示的随机性。目标是稳定所有队列并最大限度地减少时间平均惩罚向量的凸起函数,这对其进行了一系列时间平均惩罚约束。此设置适合广泛的随机优化问题。这包括在无线网络中的机会主义调度问题,其中必须在每个时隙的情况下进行关于信道测量和分组传输的2阶段决定,而不知道底层传输成功概率。我们开发了一个简单的最大重量算法,通过平均先前结果的函数来学习高效行为。该算法产生的性能可以随意推动到最佳状态,在收敛时间和延迟中的权衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号