首页> 外文会议>2011 49th Annual Allerton Conference on Communication, Control, and Computing >Online fractional programming for Markov decision systems
【24h】

Online fractional programming for Markov decision systems

机译:马尔可夫决策系统的在线分数规划

获取原文

摘要

We consider a system with K states which operates over frames with different lengths. Every frame, the controller observes a new random event and then chooses a control action based on this observation. The current state, random event, and control action together affect: (i) the frame size, (ii) a vector of penalties incurred over the frame, and (iii) the transition probabilities to the next state visited at the end of the frame. The goal is to minimize the time average of one penalty subject to time average constraints on the others. This problem has applications to task scheduling in computer systems and wireless networks, where each task can take a different amount of time and may change the state of the network. An example is energy-optimal scheduling in a system with several energy-saving transmission modes, where transitions to a different mode incur energy and/or delay penalties. We pose the problem as a stochastic linear fractional program and present an online Lyapunov drift method for solving it. For large classes of problems, the solution can be implemented without any knowledge of the random event probabilities.
机译:我们考虑一个具有K个状态的系统,该系统在不同长度的帧上运行。在每一帧中,控制器都会观察到一个新的随机事件,然后根据该观察结果选择一个控制动作。当前状态,随机事件和控制动作共同影响:(i)帧大小,(ii)在帧上产生的惩罚向量,以及(iii)到帧末尾访问的下一个状态的转移概率。目的是使一种惩罚的时间平均最小化,但要受另一种惩罚的时间平均约束。此问题已应用于计算机系统和无线网络中的任务调度,其中每个任务可能花费不同的时间,并且可能会更改网络的状态。一个示例是在具有几种节能传输模式的系统中进行能量优化调度,其中转换到不同的模式会产生能量和/或延迟惩罚。我们将该问题提出为随机线性分数程序,并提出了在线Lyapunov漂移方法进行求解。对于大类问题,可以在不了解随机事件概率的情况下实施解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号