首页> 外文期刊>statistics >Minimax and bates strategies for the discounted infinite horizon one-armed-bandit.—explicit formulae and structural properties
【24h】

Minimax and bates strategies for the discounted infinite horizon one-armed-bandit.—explicit formulae and structural properties

机译:折扣无限地平线独臂强盗的最小最大值和贝茨策略——显式公式和结构属性

获取原文
获取外文期刊封面目录资料

摘要

Summary. The discounted infinite horizon One-Armed-Bandit is considered. It is shown— neglecting overshoot—that the least favourable prior distribution has support on only two points. Further it is shown that the minimax strategy can be described by a straight line boundary. As long as the test statistic stays above this boundary, observations are taken, by crossing the boundary the procedure is stopped, The minimax strategy for the normal case is explicitly given as well as the corresponding minimax value. It is shown, that a large class of distributions has the same asymptotic strategy and the same asymptotic minimax value as the normal one for a ?1
机译:总结。考虑打折的无限地平线独臂强盗。结果表明,忽略超调,最不利的先验分布只在两点上得到支持。进一步表明,极小值策略可以用直线边界来描述。只要检验统计量保持在该边界之上,就会进行观测,通过越过边界,程序就会停止,明确给出正常情况下的极小值策略以及相应的极小值。结果表明,对于 ?1,一大类分布具有与正态分布相同的渐近策略和相同的渐近极小值

著录项

  • 来源
    《statistics》 |1986年第2期|249-260|共页
  • 作者

    Peter Reimnitz;

  • 作者单位
  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 英语
  • 中图分类
  • 关键词

    One-Armed-Bandit;

    机译:独臂强盗;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号