Efficient exploration through active learning for value function approximation in reinforcement learning.

Akiyama T; Hachiya H; Sugiyama M

首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Efficient exploration through active learning for value function approximation in reinforcement learning.

【24h】

Efficient exploration through active learning for value function approximation in reinforcement learning.

机译：通过主动学习对强化学习中的价值函数近似进行有效探索。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares policy iteration (LSPI) framework allows us to employ statistical active learning methods for linear regression. Then we propose a design method of good sampling policies for efficient exploration, which is particularly useful when the sampling cost of immediate rewards is high. The effectiveness of the proposed method, which we call active policy iteration (API), is demonstrated through simulations with a batting robot.

机译：适当设计采样策略对于在强化学习中获得更好的控制策略非常重要。在本文中，我们首先显示最小二乘策略迭代（LSPI）框架允许我们采用统计主动学习方法进行线性回归。然后，我们提出了一种用于高效勘探的良好采样策略的设计方法，该方法在即时奖励的采样成本很高时特别有用。通过使用击球机器人进行仿真，证明了该方法的有效性，我们将其称为主动策略迭代（API）。

著录项

来源
《Neural Networks: The Official Journal of the International Neural Network Society》 |2010年第5期|共10页
作者
Akiyama T; Hachiya H; Sugiyama M;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学;
关键词

相似文献

外文文献
中文文献
专利

1. Efficient exploration through active learning for value function approximation in reinforcement learning. [J] . Akiyama T, Hachiya H, Sugiyama M Neural Networks: The Official Journal of the International Neural Network Society . 2010,第5期

机译：通过主动学习对强化学习中的价值函数近似进行有效探索。
2. Statistical Active Learning for Efficient Value Function Approximation in Reinforcement Learning [J] . Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama 電子情報通信学会技術研究報告 . 2009,第480期

机译：强化学习中有效值函数逼近的统计主动学习
3. Statistical Active Learning for Efficient Value Function Approximation in Reinforcement Learning [J] . Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2008,第480期

机译：强化学习中有效值函数逼近的统计主动学习
4. Active Policy Iteration: Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning [C] . Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama International joint conference on artificial intelligence;IJCAI-09 . 2009

机译：主动策略迭代：通过主动学习对强化学习中的价值函数逼近进行有效探索
5. Sparse Value Function Approximation for Reinforcement Learning. [D] . Painter-Wakefield, Christopher. 2013

机译：强化学习的稀疏值函数逼近。
6. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions [O] . Minija Tamosiunaite, Tamim Asfour, Florentin Wörgötter -1

机译：通过使用连续动作的基于受体场的函数逼近方法通过强化学习来学习达到
7. Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics [O] . Lubashevsky, Ihor, Kanemoto, Shigeru 2010

机译：用于多智能体强化学习的无标度记忆模型。意思场近似和岩石剪刀动力学
8. Reinforcement function design and bias for efficient learning in mobile robots [R] . Touzet, C. , Santos, J. M. 1998

机译：增强功能设计和偏移，有效学习移动机器人

Efficient exploration through active learning for value function approximation in reinforcement learning.

摘要

著录项

相似文献

相关主题

期刊订阅