Task-Based Decomposition of Factored POMDPs

Shani G.

首页> 外文期刊>Cybernetics, IEEE Transactions on >Task-Based Decomposition of Factored POMDPs

【24h】

Task-Based Decomposition of Factored POMDPs

机译：基于任务的分解式POMDP分解

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, partially observable Markov decision processes (POMDP) solvers have shown the ability to scale up significantly using domain structure, such as factored representations. In many domains, the agent is required to complete a set of independent tasks. We propose to decompose a factored POMDP into a set of restricted POMDPs over subsets of task relevant state variables. We solve each such model independently, acquiring a value function. The combination of the value functions of the restricted POMDPs is then used to form a policy for the complete POMDP. We explain the process of identifying variables that correspond to tasks, and how to create a model restricted to a single task, or to a subset of tasks. We demonstrate our approach on a number of benchmarks from the factored POMDP literature, showing that our methods are applicable to models with more than 100 state variables.

机译：最近，部分可观察的马尔可夫决策过程（POMDP）求解器显示了使用域结构（例如因子表示）进行显着扩展的能力。在许多域中，要求代理完成一组独立的任务。我们建议将分解后的POMDP分解为与任务相关的状态变量子集上的一组受限POMDP。我们独立求解每个这样的模型，获得一个值函数。然后，将受限制的POMDP的值函数的组合用于形成完整POMDP的策略。我们解释了识别与任务相对应的变量的过程，以及如何创建仅限于单个任务或任务子集的模型。我们从分解的POMDP文献中的许多基准上证明了我们的方法，表明我们的方法适用于具有100多个状态变量的模型。

著录项

来源
《Cybernetics, IEEE Transactions on》 |2014年第2期|208-216|共9页
作者
Shani G.;
展开▼
作者单位

Information Systems Engineering, Ben Gurion University, Negev, Israel|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Factored POMDP; partially observable Markov decision processes (POMDP); point-based algorithms;

机译：分解的POMDP;部分可观察的Markov决策过程（POMDP）;基于点的算法;

相似文献

外文文献
中文文献
专利

1. A novel factored POMDP model for affective dialogue management [J] . Ren Fuji, Wang Yu, Quan Changqin Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2016,第1期

机译：用于情感对话管理的新型分解式POMDP模型
2. Characterizing and decomposing the neural correlates of individual differences in reading ability among adolescents with task-based fMRI [J] . Kai Wang, Daniel R. Leopold, Marie T. Banich, Developmental cognitive neuroscience. . 2019,第4期

机译：基于任务的功能磁共振成像表征和分解青少年阅读能力个体差异的神经相关性
3. Variance decomposition for single-subject task-based fMRI activity estimates across many sessions [J] . Gonzalez-Castillo Javier, Chen Gang, Nichol Thomas E., NeuroImage . 2017,第期

机译：许多会话中的单亲任务的FMRI活动估算的方差分解
4. An Effective Maintenance Policy for a Multi-component Dynamic System Using Factored POMDPs [C] . Ipek Kivanc, Demet OEzguer-Uenlueakin European conference on symbolic and quantitative approaches to reasoning with uncertainty . 2019

机译：使用分解式POMDP的多组件动态系统的有效维护策略
5. Active Cyber Deception and Attacker Intent Recognition Using Factored Interactive POMDPs [D] . Shinde, Aditya P. 2020

机译：有源网络欺骗和攻击者使用因子互动POMDPS的意图识别
6. Characterizing and decomposing the neural correlates of individual differences in reading ability among adolescents with task-based fMRI [O] . Kai Wang, Daniel R. Leopold, Marie T. Banich, 2019

机译：基于任务的功能磁共振成像表征和分解青少年阅读能力个体差异的神经相关性
7. Decomposing Large-Scale POMDP Via Belief State Analysis [O] . Xin Li, William K. Cheung, Jiming Liu 2013

机译：通过信念状态分析分解大规模pOmDp
8. Critical Analysis of Nitramine Decomposition Data: Activation Energies and Frequency Factors for HMX and RDX Decomposition [R] . Schroeder, M. A. 1985

机译：硝胺分解数据的临界分析：HmX和RDX分解的活化能和频率因子

Task-Based Decomposition of Factored POMDPs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅