首页> 外文会议>Annual Conference on Neural Information Processing Systems(NIPS); 20051205-10; British Columbia(CA) >Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions
【24h】

Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions

机译:扩散小波和拉普拉斯特征函数的值函数逼近

获取原文
获取原文并翻译 | 示例

摘要

We investigate the problem of automatically constructing efficient representations or basis functions for approximating value functions based on analyzing the structure and topology of the state space. In particular, two novel approaches to value function approximation are explored based on automatically constructing basis functions on state spaces that can be represented as graphs or manifolds: one approach uses the eigen-functions of the Laplacian, in effect performing a global Fourier analysis on the graph; the second approach is based on diffusion wavelets, which generalize classical wavelets to graphs using multiscale dilations induced by powers of a diffusion operator or random walk on the graph. Together, these approaches form the foundation of a new generation of methods for solving large Markov decision processes, in which the underlying representation and policies are simultaneously learned.
机译:我们在分析状态空间的结构和拓扑的基础上,研究了自动构造有效的表示形式或基函数以逼近值函数的问题。特别是,基于在状态空间上自动构建可表示为图形或流形的状态函数的基础函数,探索了两种新颖的值函数逼近方法:一种方法使用拉普拉斯算子的本征函数,实际上对拉普拉斯算子进行了全局傅里叶分析。图形;第二种方法基于扩散小波,该扩散小波使用由扩散算子或图上的随机游走的幂所诱发的多尺度膨胀将经典小波泛化为图。这些方法共同构成了用于解决大型马尔可夫决策过程的新一代方法的基础,在该方法中同时学习了基础表示和策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号