...
首页> 外文期刊>Theory of probability and its applications >GITTINS INDEX FOR SIMPLE FAMILY OF MARKOV BANDIT PROCESSES WITH SWITCHING COST AND NO DISCOUNTING
【24h】

GITTINS INDEX FOR SIMPLE FAMILY OF MARKOV BANDIT PROCESSES WITH SWITCHING COST AND NO DISCOUNTING

机译:Gittins Index用于简单的Markov Birtit流程,具有切换成本,没有折扣

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We consider the multiarmed bandit problem (the problem of Markov bandits) with switching penalties and no discounting in case when state spaces of all bandits are finite. An optimal strategy should have the largest average reward per unit time on an infinite time horizon. For this problem it is shown that an optimal strategy can be specified by a Gittins index under the natural assumption that the switching penalties are nonnegative.
机译:我们考虑复制的强盗问题(Markov Birits的问题),在所有匪徒的状态空间是有限的情况下,如果所有匪徒的状态空间都没有折扣。 最佳策略应在无限时间范围内每单位时间具有最大的平均奖励。 对于这个问题,结果表明,在自然假设下,可以通过Gittins指数指定最佳策略,即切换惩罚是非负面的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号