Algorithmic Survey of Parametric Value Function Approximation

Geist M.; Pietquin O.

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Algorithmic Survey of Parametric Value Function Approximation

【24h】

Algorithmic Survey of Parametric Value Function Approximation

机译：参数值函数逼近的算法调查

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning (RL) is a machine learning answer to the optimal control problem. It consists of learning an optimal control policy through interactions with the system to be controlled, the quality of this policy being quantified by the so-called value function. A recurrent subtopic of RL concerns computing an approximation of this value function when the system is too large for an exact representation. This survey reviews state-of-the-art methods for (parametric) value function approximation by grouping them into three main categories: bootstrapping, residual, and projected fixed-point approaches. Related algorithms are derived by considering one of the associated cost functions and a specific minimization method, generally a stochastic gradient descent or a recursive least-squares approach.

机译：强化学习（RL）是针对最佳控制问题的机器学习答案。它包括通过与要控制的系统进行交互来学习最佳控制策略，该策略的质量通过所谓的价值函数进行量化。 RL的子主题涉及当系统对于精确表示而言太大时，计算此值函数的近似值。本次调查将（参数）值函数逼近的最新方法归类为三个主要类别：自举，残差和投影定点方法。通过考虑相关的成本函数之一和特定的最小化方法（通常是随机梯度下降法或递归最小二乘法）来推导相关算法。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2013年第6期|845-867|共23页
作者
Geist M.; Pietquin O.;
展开▼
作者单位

IMS-MaLIS Research Group, Supé'||';

'||'lec, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Approximation algorithms; Cost function; Equations; Function approximation; Mathematical model; Supervised learning; Reinforcement learning (RL); survey; value function approximation;

机译：近似算法;成本函数;方程;函数近似;数学模型;监督学习;强化学习（RL）;调查;值函数近似;

相似文献

外文文献
中文文献
专利

1. A survey on parametric spline function approximation [J] . Khan A, Khan I, Aziz T Applied mathematics and computation . 2005,第2期

机译：参数样条函数逼近研究
2. Extrapolation algorithms and Pade approximations: a historical survey [J] . C. Brezinski Applied Numerical Mathematics . 1996,第3期

机译：外推算法和Pade近似：历史调查
3. Algorithmization of Calculations of the Kolmogorov-Nikol'skii Constants for Values of Approximations of Conjugated Differentiable Functions by Generalized Poisson Integrals [J] . Konstantin N. Zhyhallo Journal of automation and information sciences . 2019,第10期

机译：广义泊松积分的共轭可分函数逼近值的Kolmogorov-Nikol'skii常数的计算算法
4. Parametric Approximation of Functions Using Genetic Algorithms: An Example with a Logistic Curve [C] . Fernando Torrecilla-Pinero, Jesus A. Torrecilla-Pinero, Juan A. Gomez-Pulido, International Conference on Numerical Methods and Applications . 2011

机译：使用遗传算法的函数的参数逼近：具有逻辑曲线的示例
5. Multistage Stochastic Programming with Parametric Cost Function Approximations [D] . Perkins, Raymond Theodore, III. 2018

机译：具有参数成本函数逼近的多阶段随机规划
6. Data from multimodal functions based on an array of photovoltaic modules and an approximation with artificial neural networks as a scenario for testing optimization algorithms [O] . Carlos Robles-Algarín, Diego Restrepo-Leal, Adalberto Ospino Castro 2019

机译：来自基于光伏模块阵列的多峰函数的数据以及人工神经网络的近似值作为测试优化算法的场景
7. An algorithmic Survey of Parametric Value Function Approximation [O] . Geist, Matthieu, Pietquin, Olivier 2013

机译：参数值函数逼近的算法调查
8. Algorithms for Rational Approximations for a Confluent Hypergeometric Function. [R] . Luke, Y. L. 1976

机译：汇合超几何函数有理逼近的算法。

Algorithmic Survey of Parametric Value Function Approximation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅