首页> 外文会议>AAAI Conference on Artificial Intelligence >An Empirical Study on the Practical Impact of Prior Beliefs over Policy Types
【24h】

An Empirical Study on the Practical Impact of Prior Beliefs over Policy Types

机译:对政策类型的现有信念实际影响的实证研究

获取原文

摘要

Many multiagent applications require an agent to learn quickly how to interact with previously unknown other agents. To address this problem, researchers have studied learning algorithms which compute posterior beliefs over a hypothesised set of policies, based on the observed actions of the other agents. The posterior belief is complemented by the prior belief, which specifies the subjective likelihood of policies before any actions are observed. In this paper, we present the first comprehensive empirical study on the practical impact of prior beliefs over policies in repeated interactions. We show that prior beliefs can have a significant impact on the long-term performance of such methods, and that the magnitude of the impact depends on the depth of the planning horizon. Moreover, our results demonstrate that automatic methods can be used to compute prior beliefs with consistent performance effects. This indicates that prior beliefs could be eliminated as a manual parameter and instead be computed automatically.
机译:许多多应用应用程序要求代理商快速学习如何与先前未知的其他代理进行交互。为了解决这个问题,研究人员已经研究了学习算法,这些算法根据观察到的其他代理的观察到的行动,在假设的一系列政策中计算了后视信念。后方信仰由先前的信仰补充,这指出了在观察到任何行动之前政策的主观可能性。在本文中,我们提出了第一个全面的实证研究对反复互动中的先前信仰对政策的实际影响。我们表明,先前的信仰可能对这些方法的长期性能产生重大影响,并且影响的幅度取决于规划地平线的深度。此外,我们的结果表明,自动方法可用于计算以一致的性能效果来计算现有信念。这表明可以消除现有信念作为手动参数,而是自动计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号