Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators

Baddeley B.

首页> 外文期刊>IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics >Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators

【24h】

Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators

机译：连续时间和空间中的强化学习：使用分布式函数逼近器时的主要问题是干扰和非病态调节

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Many interesting problems in reinforcement learning (RL) are continuous and/or high dimensional, and in this instance, RL techniques require the use of function approximators for learning value functions and policies. Often, local linear models have been

机译：强化学习（RL）中许多有趣的问题是连续的和/或高维的，在这种情况下，RL技术需要使用函数逼近器来学习价值函数和策略。通常，局部线性模型

著录项

来源
《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》 |2008年第4期|p.950-956|共7页
作者
Baddeley B.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化基础理论;
关键词

相似文献

外文文献
中文文献
专利

1. Experiments of conditioned reinforcement learning in continuous space control tasks [J] . Borja Fernandez-Gauna, Juan Luis Osa, Manuel Graña Neurocomputing . 2018,第JANa3期

机译：连续空间控制任务中条件强化学习的实验
2. Reinforcement Learning in Continuous Time and Space: A Stochastic Control Approach [J] . Haoran Wang, Thaleia Zariphopoulou, Xun Yu Zhou Journal of machine learning research . 2020,第a期

机译：连续时间和空间的加固学习：随机控制方法
3. Reinforcement Learning in Continuous Time and Space [J] . Kenji Doya Neural computation . 2000,第1期

机译：连续时间和空间中的强化学习
4. Reinforcement learning in continuous time and space: Interference and not ill-conditioning is the main problem when using distributed function approximators [C] . Bart Baddeley Symposium on Artificial Immune Systems and Immune System Modelling . 2007

机译：在连续时间和空间中加强学习：干扰且不良状态是使用分布式功能近似器时的主要问题
5. Time-independent and time-dependent wavepacket approaches to quantum dynamics using distributed approximating functionals. [D] . Zhu, Wei. 1995

机译：时间无关和时间相关的小波包方法使用分布式近似函数进行量子动力学。
6. Correction: Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail [O] . Eleni Vasilaki, Nicolas Frémaux, Robert Urbanczik, 2009

机译：更正：在连续状态和动作空间中基于峰值的强化学习：当策略梯度方法失败时
7. Approximating the Value Function for Continuous Space Reinforcement Learning in Robot Control [O] . Sebastian Buck, Michael Beetz, Thorsten Schmitt 2002

机译：机器人控制中连续空间强化学习的值函数逼近
8. Optimal Reward Functions in Distributed Reinforcement Learning [R] . Wolpert, David H., Tumer, Kagan 2000

机译：分布式强化学习中的最优奖励函数

Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅