【24h】

Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization

机译:使用Kronecker因子分解的值函数逼近的紧凑谱库

获取原文
获取原文并翻译 | 示例

摘要

A new spectral approach to value function approximation has recently been proposed to automatically construct basis functions from samples. Global basis functions called proto-value functions are generated by di-agonalizing a diffusion operator, such as a reversible random walk or the Laplacian, on a graph formed from connecting nearby samples. This paper addresses the challenge of scaling this approach to large domains. We propose using Kronecker factorization coupled with the Metropolis-Hastings algorithm to decompose reversible transition matrices. The result is that the basis functions can be computed on much smaller matrices and combined to form the overall bases. We demonstrate that in several continuous Markov decision processes, compact basis functions can be constructed without significant loss in performance. In one domain, basis functions were compressed by a factor of 36. A theoretical analysis relates the quality of the approximation to the spectral gap. Our approach generalizes to other basis constructions as well.
机译:最近,有人提出了一种新的频谱方法来近似值函数,以从样本中自动构建基函数。通过对角化扩散算子(例如可逆随机游走或拉普拉斯算子)在对角线附近生成的图形上生成的全局基础函数,称为原型值函数。本文解决了将这种方法扩展到大域的挑战。我们建议结合Kronecker分解和Metropolis-Hastings算法来分解可逆转换矩阵。结果是,可以在小得多的矩阵上计算基函数,并将其组合以形成整体基。我们证明,在几个连续的马尔可夫决策过程中,可以构造紧凑的基函数而不会显着降低性能。在一个域中,基函数被压缩了36倍。理论分析将逼近的质量与光谱间隙相关联。我们的方法也可以推广到其他基础结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号