首页> 外文会议>AAAI Conference on Artificial Intelligence >Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization
【24h】

Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization

机译:使用Kronecker分解的值函数近似的紧凑型光谱碱基

获取原文

摘要

A new spectral approach to value function approximation has recently been proposed to automatically construct basis functions from samples. Global basis functions called proto-value functions are generated by di-agonalizing a diffusion operator, such as a reversible random walk or the Laplacian, on a graph formed from connecting nearby samples. This paper addresses the challenge of scaling this approach to large domains. We propose using Kronecker factorization coupled with the Metropolis-Hastings algorithm to decompose reversible transition matrices. The result is that the basis functions can be computed on much smaller matrices and combined to form the overall bases. We demonstrate that in several continuous Markov decision processes, compact basis functions can be constructed without significant loss in performance. In one domain, basis functions were compressed by a factor of 36. A theoretical analysis relates the quality of the approximation to the spectral gap. Our approach generalizes to other basis constructions as well.
机译:最近已经提出了一种新的值函数近似的谱方法,以自动构建来自样本的基本功能。通过Di-Anonalizing在由连接附近样品形成的曲线图上的漫射算子(例如可逆随机步道或拉普拉斯)等漫射算子而产生的全局基础函数。本文涉及将这种方法缩放到大域的挑战。我们建议使用与Metropolis-Hastings算法耦合的Kronecker分解,以分解可逆转换矩阵。结果是可以在更小的矩阵上计算基本函数并组合以形成整体基础。我们证明,在若干连续马尔可夫决策过程中,可以构建紧凑的基本功能,而无需表现显着损失。在一个域中,基本函数被压缩为36倍。理论分析将近似的质量与光谱间隙相关。我们的方法也推广到其他基础结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号