A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

David Choi; Benjamin Van Roy

首页> 外文期刊>Discrete event dynamic systems: Theory and applications >A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

【24h】

A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

机译：用于定点近似和有效时差学习的广义卡尔曼滤波器

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The traditional Kalman filter can be viewed as a recursive stochastic algorithm that approximates an unknown function via a linear combination of prespecified basis functions given a sequence of noisy samples. In this paper, we generalize the algorithm to one that approximates the fixed point of an operator that is known to be a Euclidean norm contraction. Instead of noisy samples of the desired fixed point, the algorithm updates parameters based on noisy samples of functions generated by application of the operator, in the spirit of Robbins–Monro stochastic approximation. The algorithm is motivated by temporal-difference learning, and our developments lead to a possibly more efficient variant of temporal-difference learning. We establish convergence of the algorithm and explore efficiency gains through computational experiments involving optimal stopping and queueing problems.

机译：传统的卡尔曼滤波器可以看作是递归随机算法，通过给定有噪声样本序列的预定基函数的线性组合，可以近似未知函数。在本文中，我们将算法推广到一种近似算子的固定点的算法，该算子被称为欧几里得范数收缩。该算法以Robbins–Monro随机逼近的精神为基础，而不是基于所需固定点的噪声样本，而是根据操作员应用程序生成的函数的噪声样本来更新参数。该算法是由时差学习驱动的，并且我们的发展导致了时差学习的一种可能更有效的变体。我们建立算法的收敛性，并通过涉及最佳停止和排队问题的计算实验来探索效率提高。

著录项

来源
《Discrete event dynamic systems: Theory and applications 》 |2006年第2期| 共33页
作者
David Choi; Benjamin Van Roy;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词
Dynamic programming; Kalman filter; Optimal stopping; Queueing; Recursive least-squares; Reinforcement learning; Temporal-difference learning;

机译：动态编程;卡尔曼滤波器;最优停止;排队;递归最小二乘;强化学习;时差学习;

相似文献

外文文献
中文文献
专利

1. A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning [J] . David Choi, Benjamin Van Roy Discrete event dynamic systems: Theory and applications . 2006 ,第2期

机译：用于定点近似和有效时差学习的广义卡尔曼滤波器
2. Evolutionary Deep Learning with Extended Kalman Filter for Effective Prediction Modeling and Efficient Data Assimilation [J] . Li Qiao, Wu Zheng Yi, Rahman Atiqur Journal of Computing in Civil Engineering . 2019 ,第3期

机译：具有扩展卡尔曼滤波器的进化深度学习，可进行有效的预测建模和有效的数据同化
3. Evolutionary Deep Learning with Extended Kalman Filter for Effective Prediction Modeling and Efficient Data Assimilation [J] . Li Qiao, Wu Zheng Yi, Rahman Atiqur Journal of Computing in Civil Engineering . 2019 ,第3期

机译：扩展卡尔曼滤波器的进化深度学习，有效预测建模和高效数据同化
4. A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning [C] . David Choi, Benjamin Van Roy International Conference on Machine Learning . 2001

机译：用于定点近似和有效的时间差异学习的通用卡尔曼滤波器
5. Efficient multiplierless filter designs for fixed-coefficient and adaptive filtering. [D] . Chen, Chao-Liang. 2000

机译：用于固定系数和自适应滤波的高效无乘法器设计。
6. Measurement Noise Recommendation for Efficient Kalman Filtering over a Large Amount of Sensor Data [O] . Sebin Park, Myeong-Seon Gil, Hyeonseung Im, 2019

机译：建议对大量传感器数据进行有效卡尔曼滤波的测量噪声建议
7. Efficient Approximation of the Mahalanobis Distance for Tracking with the Kalman Filter [O] . R. R. Pinho, J. M. R. S Tavares, M. F. V. Correia 2007

机译：卡尔曼滤波器用于跟踪的马氏距离的有效近似

A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

摘要

著录项

相似文献

相关主题

期刊订阅