Value-Directed Belief State Approximation for POMDPs

机译：POMDP的价值导向的信念状态近似

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might approxi-mate the belief state. Other schemes for belief-state approximation (e.g., based on minimizing a measure such as KL-divergence between the true and estimated state) are not necessarily is deter-mined by the expected error in utility rather than by the error in the belief state itself. We propose heuristic methods for finding good projection schemes for belief state estimation-exhibiting anytime characteristics-given a POMDP value function. We also describe several algorithms for constructing bounds on the error in decision qual-ity (expected utility) associated with acting in ac-cordance with a given belief state approximation.

机译：我们考虑问题信念状态监视的目的在于为部分可观察的马尔可夫决策过程（POMDP）实施一项政策，特别是如何近似信念状态。信念状态近似的其他方案（例如，基于最小化真实状态和估计状态之间的KL差异等度量）不一定由效用的预期误差而不是信念状态本身的误差来确定。我们提出了一种启发式方法，用于为信念状态估计找到良好的投影方案，无论何时都具有特征，并赋予POMDP值函数。我们还描述了几种算法，用于构造决策质量（预期效用）中的误差界限，该误差与根据给定的信念状态近似进行的行为相关。

著录项

来源
《Sixteenth Conference (2000) on Uncertainty in Artificial Intelligence June 30-July 3, 2000 Stanford University, Stanford, California》|2000年|p.497-506|共10页
会议地点 Stanford CA(US);Stanford CA(US)
作者
Pascal Poupart; Craig Boutilier;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Anytime Point-Based Approximations for Large POMDPs [J] . Gordon G., Pineau J., Thrun S. The Journal of Artificial Intelligence Research . 2006,第5期

机译：大型POMDP的随时基于点的近似
2. Anytime Point-Based Approximations for Large POMDPs [J] . J. Pineau, G. Gordon, S. Thrun Journal of Automation, Mobile Robotics & Intelligent Systems . 2006,第5期

机译：大型POMDP的随时基于点的近似
3. Real user evaluation of a POMDP spoken dialogue system using automatic belief compression [J] . Paul A. Crook, Simon Keizer, Zhuoran Wang, Computer speech and language . 2014,第4期

机译：使用自动信念压缩对POMDP口语对话系统进行真实用户评估
4. Value-Directed Belief State Approximation for POMDPs [C] . Pascal Poupart, Craig Boutilier Conference on uncertainty in artificial intelligence . 2000

机译：POMDP的价值定向信仰状态近似
5. Lookahead Approximations for Online Learning with Nonlinear Parametric Belief Models [D] . Han, Weidong. 2019

机译：用非线性参数信念模型进行在线学习的寻求逼近
6. Iterative Approximation of Basic Belief Assignment Based on Distance of Evidence [O] . Yi Yang, Yuanli Liu -1

机译：基于证据距离的基本信念分配的迭代逼近
7. Vector-space analysis of belief-state approximation for POMDPs [O] . Pascal Poupart, Craig Boutilier 2001

机译：POMDP的置信态近似的向量空间分析

Value-Directed Belief State Approximation for POMDPs

摘要

著录项

相似文献

相关主题

期刊订阅