首页> 外文学位 >Exploiting domain structure in multiagent decision-theoretic planning and reasoning.
【24h】

Exploiting domain structure in multiagent decision-theoretic planning and reasoning.

机译:在多主体决策理论规划和推理中利用领域结构。

获取原文
获取原文并翻译 | 示例

摘要

This thesis focuses on decision-theoretic reasoning and planning problems that arise when a group of collaborative agents are tasked to achieve a goal that requires collective effort. The main contribution of this thesis is the development of effective, scalable and quality-bounded computational approaches for multiagent planning and coordination under uncertainty. This is achieved by a synthesis of techniques from multiple areas of artificial intelligence, machine learning and operations research. Empirically, each algorithmic contribution has been tested rigorously on common benchmark problems and, in many cases, real-world applications from machine learning and operations research literature.;The first part of the thesis addresses multiagent single-step decision making problems where a single joint-decision is required for the plan. We examine these decision-theoretic problems within the broad frameworks of distributed constraint optimization and Markov random fields. Such models succinctly capture the structure of interaction among different decision variables, which is subsequently exploited by algorithms to enhance scalability. The algorithms presented in this thesis are rigorously grounded on concepts from mathematical programming and optimization.;The second part of the thesis addresses multiagent sequential decision making problems under uncertainty and partial observability. We use the decentralized partially observable Markov decision processes (Dec-POMDPs) to formulate multiagent planning problems. To address the challenge of NEXP-Hard complexity and yet push the envelope of scalability, we represent the domain structure in a multiagent system using graphical models such as dynamic Bayesian networks and constraint networks. By exploiting such graphical planning representation in an algorithmic framework composed of techniques from different sub-areas of artificial intelligence, machine learning and operations research, we show impressive gains in increasing the scalability, the range of problems addressed and enabling quality-bounded solutions for multiagent decision theoretic planning.;Our contributions for sequential decision making include a) development of efficient dynamic programming algorithms for finite-horizon decision making, resulting in significantly increased scalability w.r.t. the number of agents and multiple orders-of-magnitude speedup over previous best approaches; b) development of probabilistic inference based algorithms for infinite-horizon decision making, resulting in new insights connecting inference techniques from the machine learning literature to multiagent systems; c) development of mathematical programming based scalable techniques for quality bounded solutions in multiagent systems, which has been considered intractable so far.;Several of our contributions are some of the first for the respective class of problems. For example, we show for the first time how machine learning is closely related to multiagent decision making via a maximum likelihood formulation of the planning problem. We develop new graphical models and machine learning based inference algorithms for large factored planning problems. We also show for the first time how the problem of optimizing agents' policies can be formulated as a compact mixed-integer program, resulting in optimal solution for a range of Dec-POMDP benchmarks.;In summary, we present a synthesis of different techniques from multiple sub-areas of AI, ML and OR to address the scalability and efficiency of algorithms for decision-theoretic reasoning and planning in multiagent systems. Such advances have already shown great promise to bridge the gap between multiagent systems and real-world applications.
机译:本文着重于决策理论推理和计划问题,这些问题是在一组协作代理受命实现需要集体努力的目标时出现的。本文的主要贡献是开发了有效,可扩展且有质量限制的计算方法,用于不确定性下的多主体规划和协调。这是通过综合来自人工智能,机器学习和运筹学等多个领域的技术来实现的。从经验上讲,每种算法的贡献都经过了通用基准问题的严格测试,并且在许多情况下,还来自机器学习和运筹学文献中的实际应用。;论文的第一部分解决了多智能体单步决策问题,其中单个联合-该计划需要决策。我们在分布式约束优化和马尔可夫随机域的广泛框架内研究了这些决策理论问题。这样的模型简洁地捕获了不同决策变量之间的交互结构,随后被算法利用以增强可伸缩性。本文提出的算法严格基于数学编程和优化的概念。本文的第二部分解决了不确定性和局部可观性下的多主体顺序决策问题。我们使用分散的可部分观察的马尔可夫决策过程(Dec-POMDPs)来制定多主体规划问题。为了解决NEXP-Hard复杂性的挑战并提高可扩展性,我们使用图形模型(例如动态贝叶斯网络和约束网络)在多主体系统中表示域结构。通过在由来自人工智能,机器学习和运筹学等不同子领域的技术组成的算法框架中利用这种图形化的规划表示形式,我们在增加可扩展性,解决的问题范围以及实现多代理的质量受限解决方案方面显示出令人瞩目的成就决策理论规划。;我们对顺序决策的贡献包括:a)开发用于有限水平决策的有​​效动态编程算法,从而显着提高可扩展性与以前的最佳方法相比,代理的数量和多个数量级的加速; b)开发基于概率推理的算法,用于无限水平决策,从而产生了新的见解,将推理技术从机器学习文献连接到多智能体系统; c)开发基于数学编程的可扩展技术以解决多主体系统中的质量有界解决方案,到目前为止,这被认为是棘手的。;我们的若干贡献是针对各个问题类别的第一批贡献。例如,我们首次展示了通过最大似然性规划问题,机器学习如何与多主体决策密切相关。我们针对大型因素规划问题开发了新的图形模型和基于机器学习的推理算法。我们还首次展示了如何将优化代理策略的问题描述为紧凑的混合整数程序,从而为一系列Dec-POMDP基准提供了最佳解决方案。总之,我们提出了各种技术的综合来自AI,ML和OR的多个子区域,以解决用于多智能体系统中的决策理论推理和计划的算法的可伸缩性和效率。这样的进步已经显示出弥合多代理系统与实际应用程序之间鸿沟的巨大希望。

著录项

  • 作者

    Kumar, Akshat.;

  • 作者单位

    University of Massachusetts Amherst.;

  • 授予单位 University of Massachusetts Amherst.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 207 p.
  • 总页数 207
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号