首页> 外文会议>Annual Conference on Neural Information Processing Systems >Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions
【24h】

Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions

机译:与扩散小波和拉普拉斯特征函数的价值函数近似

获取原文

摘要

We investigate the problem of automatically constructing efficient representations or basis functions for approximating value functions based on analyzing the structure and topology of the state space. In particular, two novel approaches to value function approximation are explored based on automatically constructing basis functions on state spaces that can be represented as graphs or manifolds: one approach uses the eigen-functions of the Laplacian, in effect performing a global Fourier analysis on the graph; the second approach is based on diffusion wavelets, which generalize classical wavelets to graphs using multiscale dilations induced by powers of a diffusion operator or random walk on the graph. Together, these approaches form the foundation of a new generation of methods for solving large Markov decision processes, in which the underlying representation and policies are simultaneously learned.
机译:我们研究了基于分析状态空间的结构和拓扑结构和近似值函数的高效陈述或基础函数的问题。特别地,基于在可以表示为图形或歧管的状态空间上自动构建基本函数来探索两种重大函数近似的新方法:一种方法使用拉普拉斯的eIgen函数,实际上是对...进行全局傅立叶分析图形;第二种方法是基于扩散小波,其使用由扩散操作员的功率或随机步行诱导的多尺度扩张来推广经典小波。这些方法在一起形成了解决大型马尔可夫决策过程的新一代方法的基础,其中潜在的代表和政策是同时学习的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号