首页> 外文会议>International joint conference on artificial intelligence;IJCAI-09 >Speeding Up Exact Solutions of Interactive Dynamic Influence Diagrams Using Action Equivalence
【24h】

Speeding Up Exact Solutions of Interactive Dynamic Influence Diagrams Using Action Equivalence

机译:使用动作等效性加快交互式动态影响图的精确解

获取原文

摘要

Interactive dynamic influence diagrams (I-DIDs) are graphical models for sequential decision making in partially observable settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of candidate models ascribed to other agents, over time. Previous approach for exactly solving I-DIDs groups together models having similar solutions into behaviorally equivalent classes and updates these classes. We present a new method that, in addition to aggregating behaviorally equivalent models, further groups models that prescribe identical actions at a single time step. We show how to update these augmented classes and prove that our method is exact. The new approach enables us to bound the aggregated model space by the cardinality of other agents' actions. We evaluate its performance and provide empirical results in support.
机译:交互式动态影响图(I-DID)是用于在其他代理共享的部分可观察设置中进行顺序决策的图形模型。随着时间的流逝,用于解决I-DID的算法面临着归因于其他代理的候选模型呈指数增长的空间的挑战。之前用于精确解决I-DID的方法将具有类似解决方案的模型组合在一起,成为行为等效的类,并更新这些类。我们提出了一种新方法,除了汇总行为等效的模型外,还可以对在单个时间步规定相同动作的模型进行进一步分组。我们展示了如何更新这些扩充的类并证明我们的方法是正确的。这种新方法使我们能够通过其他代理动作的基数来约束聚合模型空间。我们评估其性能并提供实证结果以提供支持。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号