首页> 外文会议>Algorithmic learning theory >Toward a Classification of Finite Partial-Monitoring Games
【24h】

Toward a Classification of Finite Partial-Monitoring Games

机译:走向有限局部监控游戏的分类

获取原文
获取原文并翻译 | 示例

摘要

In a finite partial-monitoring game against Nature, the Learner repeatedly chooses one of finitely many actions, the Nature responds with one of finitely many outcomes, the Learner suffers a loss and receives feedback signal, both of which are fixed functions of the action and the outcome. The goal of the Learner is to minimize its total cumulative loss. We make progress towards classification of these games based on their minimax expected regret. Namely, we classify almost all games with two outcomes: We show that their minimax expected regret is either zero, ?(T~(1/2)), ?(T~(2/3)), or G(T) and we give a simple and efficiently computable classification of these four classes of games. Our hope is that the result can serve as a stepping stone toward classifying all finite partial-monitoring games.
机译:在针对自然的有限局部监控游戏中,学习者反复选择有限的多个动作之一,自然以有限的多个结果之一做出响应,学习者遭受损失并收到反馈信号,这两者都是该行为的固定功能,结果。学习者的目标是最大程度地减少其累计损失。我们根据这些游戏的预期最大后悔程度在分类这些游戏方面取得了进展。也就是说,我们对几乎所有具有以下结果的游戏进行了分类:我们证明了它们的极小极大期望后悔是零,?(T〜(1/2)),?(T〜(2/3))或G(T)以及我们对这四类游戏进行了简单有效的分类。我们希望结果可以作为对所有有限的局部监控游戏进行分类的垫脚石。

著录项

  • 来源
    《Algorithmic learning theory》|2010年|p.224-238|共15页
  • 会议地点 Canberra(AU);Canberra(AU)
  • 作者单位

    Department of Computing Science, University of Alberta, Canada;

    Department of Computing Science, University of Alberta, Canada;

    Department of Computing Science, University of Alberta, Canada;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 人工智能理论;
  • 关键词

  • 入库时间 2022-08-26 13:58:04

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号