首页> 外文会议>IEEE Data Science Workshop >Learning Partially Observable Markov Decision Processes Using Coupled Canonical Polyadic Decomposition
【24h】

Learning Partially Observable Markov Decision Processes Using Coupled Canonical Polyadic Decomposition

机译:使用耦合规范多态分解学习部分可观察的马尔可夫决策过程

获取原文

摘要

We propose a new algorithm for learning the model parameters of a partially observable Markov decision process (POMDP) based on coupled canonical polyadic decomposition (CPD). Coupled CPD for a set of tensors is an extension to CPD for individual tensors, which has improved identifiability properties, as well as an analogous simultaneous diagonalization (SD) algorithm for uniquely recovering the latent factors efficiently. We explain how to form a set of three-way tensors from the trajectory of a POMDP under a stationary memoryless policy, so that coupled CPD can be applied afterwards to recover the model parameters, with identifiability and computational guarantees.
机译:我们提出了一种新的算法,用于基于耦合规范多Adadic分解(CPD)的部分可观察的马尔可夫决策过程(POMDP)的模型参数。一组张量的耦合CPD是单个张量的CPD的扩展,它具有改进的可识别性,以及类似的同时对角化(SD)算法,可以有效地唯一地恢复潜在因子。我们解释了如何在平稳的无记忆策略下从POMDP的轨迹形成一组三张量,以便随后可以使用耦合CPD恢复模型参数,并且具有可识别性和计算保证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号