首页> 外文会议>Conference on uncertainty in artificial intelligence >Hilbert Space Embeddings of Predictive State Representations
【24h】

Hilbert Space Embeddings of Predictive State Representations

机译:预测状态表示的希尔伯特空间嵌入

获取原文
获取外文期刊封面目录资料

摘要

Predictive State Representations (PSRs) are an expressive class of models for controlled stochastic processes. PSRs represent state as a set of predictions of future observable events. Because PSRs are defined entirely in terms of observable data, statistically consistent estimates of PSR parameters can be learned efficiently by manipulating moments of observed training data. Most learning algorithms for PSRs have assumed that actions and observations are finite with low cardinality. In this paper, we generalize PSRs to infinite sets of observations and actions, using the recent concept of Hilbert space embeddings of distributions. The essence is to represent the state as one or more nonparamet-ric conditional embedding operators in a Reproducing Kernel Hilbert Space (RKHS) and leverage recent work in kernel methods to estimate, predict, and update the representation. We show that these Hilbert space em-beddings of PSRs are able to gracefully handle continuous actions and observations, and that our learned models outperform competing system identification algorithms on several prediction benchmarks.
机译:预测状态表示(PSR)是用于控制随机过程的一种富有表现力的模型。 PSR将状态表示为对未来可观察事件的一组预测。由于PSR是完全根据可观察的数据定义的,因此可以通过操纵观察到的训练数据的时刻来有效地学习PSR参数的统计上一致的估计。大多数针对PSR的学习算法都假定动作和观察值是有限的,且基数较低。在本文中,我们使用分布的希尔伯特空间嵌入的最新概念,将PSR推广到无限的观察和行动集。本质是将状态表示为一个再生内核希尔伯特空间(RKHS)中的一个或多个非参数条件嵌入运算符,并利用内核方法中的最新工作来估计,预测和更新该表示。我们证明,这些PSR的希尔伯特空间嵌入能够很好地处理连续的动作和观察结果,并且我们的模型在几个预测基准上均优于竞争性系统识别算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号