首页> 外文学位 >Exploiting locality of interaction in networked distributed POMDPs: An empirical evaluation
【24h】

Exploiting locality of interaction in networked distributed POMDPs: An empirical evaluation

机译:利用网络分布式POMDP中交互的局部性:一项实证评估

获取原文
获取原文并翻译 | 示例

摘要

In many real-world multiagent applications such as distributed sensor nets, a network of agents is formed based on each agent's limited interactions with a small number of neighbors. While distributed POMDPs capture the real-world uncertainty in multiagent domains, they fail to exploit such locality of interaction. Distributed constraint optimization (DCOP) captures the locality of interaction but fails to capture planning under uncertainty. In previous works, Nair et al. presented a model synthesized from distributed POMDPs and DCOPs, called Networked Distributed POMDPs (ND-POMDPs). Also, they presented LID-JESP, a locally interacting distributed joint equilibrium-based search for policies: a distributed policy generation algorithm based on DBA (distributed breakout algorithm). In this thesis, I introduce two of its variants. The first variant, SLID-JESP is based on DSA (distributed stochastic algorithm) that allows neighboring agents to change their policies in the same cycle. I also introduce a technique called hyper-link-based decomposition that allows us to exploit locality of interaction further, resulting in faster run times for both LID-JESP and its stochastic variant without any loss in solution quality. Finally, I present a thorough experimental analysis of LID-JESP and its variants against benchmark algorithms that do not exploit locality of interaction. Furthermore, towards improving quality of the solutions, I provide a heuristic to select a better initial policy.
机译:在许多实际的多智能体应用程序(例如分布式传感器网络)中,基于每个智能体与少量邻居的有限交互,形成了智能体网络。尽管分布式POMDP捕获了多代理域中的实际不确定性,但它们无法利用这种交互性。分布式约束优化(DCOP)可以捕获交互的局部性,但不能捕获不确定性下的计划。在以前的工作中,Nair等人。提出了一种由分布式POMDP和DCOP合成的模型,称为网络分布式POMDP(ND-POMDP)。他们还提出了LID-JESP,这是一种基于本地交互的基于联合联合均衡的策略搜索:一种基于DBA的分布式策略生成算法(分布式突破算法)。在本文中,我介绍了它的两个变体。第一个变体SLID-JESP基于DSA(分布式随机算法),它允许相邻代理在同一周期内更改其策略。我还介绍了一种称为基于超链接的分解的技术,该技术使我们可以进一步利用交互的局部性,从而使LID-JESP及其随机变量的运行时间更快,而解决方案质量却没有任何损失。最后,我针对未利用交互局部性的基准算法对LID-JESP及其变体进行了全面的实验分析。此外,为了提高解决方案的质量,我提供了一种启发式方法,以选择更好的初始策略。

著录项

  • 作者

    Kim, Yoonheui.;

  • 作者单位

    University of Southern California.;

  • 授予单位 University of Southern California.;
  • 学科 Computer science.
  • 学位 M.S.
  • 年度 2006
  • 页码 46 p.
  • 总页数 46
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号