首页> 外文OA文献 >A unified framework for linear function approximation of value functions in stochastic control

【2h】

A unified framework for linear function approximation of value functions in stochastic control

机译：随机控制中值函数的线性函数逼近的统一框架

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper contributes with a unified formulation that merges previ- ous analysis on the prediction of the performance ( value function ) of certain sequence of actions ( policy ) when an agent operates a Markov decision process with large state-space. When the states are represented by features and the value function is linearly approxi- mated, our analysis reveals a new relationship between two common cost functions used to obtain the optimal approximation. In addition, this analysis allows us to propose an efficient adaptive algorithm that provides an unbiased linear estimate. The performance of the pro- posed algorithm is illustrated by simulation, showing competitive results when compared with the state-of-the-art solutions.

机译：本文为统一的公式做出了贡献，该公式将先前对代理人在状态空间较大的情况下执行马尔可夫决策过程的行为（策略）的绩效（价值函数）的预测进行的分析合并在一起。当状态由特征表示并且值函数线性近似时，我们的分析揭示了用于获得最佳近似的两个常用成本函数之间的新关系。此外，该分析使我们能够提出一种有效的自适应算法，该算法可提供无偏线性估计。仿真结果说明了所提出算法的性能，与最新解决方案相比，该算法具有竞争优势。

著录项

作者
Sánchez Fernández Matilde; Valcarcel Macua Sergio; Zazo Bello Santiago;
展开▼
作者单位

展开▼
年度 2013
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Adaptive neural control for a class of stochastic nonlinear systems with unknown parameters, unknown nonlinear functions and stochastic disturbances [J] . Chen Chao-Yang, Gui Wei-Hua, Guan Zhi-Hong, Neurocomputing . 2017,第FEBa22期

机译：一类参数未知，非线性函数未知和随机干扰的随机非线性系统的自适应神经控制
2. Approximation of sparse controls in semilinear equations by piecewise linear functions [J] . Casas E., Herzog R., Wachsmuth G. Numerische Mathematik . 2012,第4期

机译：半线性方程组中稀疏控制的分段线性函数逼近
3. Approximation of sparse controls in semilinear equations by piecewise linear functions [J] . Eduardo Casas, Roland Herzog, Gerd Wachsmuth Numerische Mathematik . 2012,第4期

机译：半线性方程组中稀疏控制的分段线性函数逼近
4. A unified framework for linear function approximation of value functions in stochastic control [C] . Sanchez-Fernandez Matilde, Valcarcel Sergio, Zazo Santiago European Signal Processing Conference . 2013

机译：随机控制中值函数的线性函数逼近的统一框架
5. Convex Hulls, Relaxations, and Approximations of General Monomials and Multilinear Functions [D] . Xu, Yibo. 2018

机译：一般单项式和多重线性函数的凸包，松弛和逼近
6. Stochasticity Nonlinear Value Functions and Update Rules in Learning Aesthetic Biases [O] . Norberto M. Grzywacz 2021

机译：学习审美偏差中的瞬极非线性值函数和更新规则
7. A Unified Framework for Linear Function Approximation of Value Functions in Stochastic Control [O] . Sánchez-Fernández Matilde, Valcarcel Macua Sergio, Zazo Santiago 2013

机译：随机控制中值函数的线性函数逼近的统一框架
8. Sampling Representations and Approximations for Certain Functions and Stochastic Processes [R] . Habib, M. K. 1980

机译：某些函数和随机过程的抽样表示和近似

A unified framework for linear function approximation of value functions in stochastic control

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅