Acceleration of Reinforcement Learning with Incomplete Prior Information

Kento Terashima; Hirotaka Takano; Junichi Murata

首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Acceleration of Reinforcement Learning with Incomplete Prior Information

【24h】

Acceleration of Reinforcement Learning with Incomplete Prior Information

机译：借助不完整的先验信息加速强化学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning is applicable to complex or unknown problems because the solution search process is done by trial-and-error. However, the calculation time for the trial-and-error search becomes larger as the scale of the problem increases. Therefore, in order to decrease calculation time, some methods have been proposed using the prior information on the problem. This paper improves a previously proposed method utilizing options as prior information. In order to increase the learning speed even with wrong options, methods for option correction by forgetting the policy and extending initiation sets are proposed.

机译：强化学习适用于复杂或未知的问题，因为解决方案搜索过程是通过反复试验来完成的。但是，试错搜索的计算时间随着问题规模的增加而变大。因此，为了减少计算时间，已经使用关于该问题的现有信息提出了一些方法。本文改进了先前提出的利用选项作为先验信息的方法。为了即使在错误的选项下也能提高学习速度，提出了通过忘记策略和扩展初始集来进行选项校正的方法。

著录项

来源
《Journal of Advanced Computatioanl Intelligence and Intelligent Informatics》 |2013年第100期|共10页
作者
Kento Terashima; Hirotaka Takano; Junichi Murata;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类其他计算机;
关键词
Reinforcement learning; Q-learning; Option; Prior information; Forgetting factor;

机译：强化学习;Q学习;选择;先验信息;遗忘因素;

相似文献

外文文献
中文文献
专利

1. Acceleration of Reinforcement Learning with Incomplete Prior Information [J] . Kento Terashima, Hirotaka Takano, Junichi Murata Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2013,第5a100期

机译：借助不完整的先验信息加速强化学习
2. Convergence of reinforcement learning algorithms and acceleration of learning - art. no. 026706 [J] . Potapov A., Ali MK. Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics . 2003,第2aPta2期

机译：强化学习算法的融合和学习加速。没有。 026706
3. Acceleration of game learning with prediction-based reinforcement learning - toward the emergence of planning behavior [J] . Yu Ohigashi, Takashi Omori, Koji Morikawa, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2002,第627期

机译：通过基于预测的强化学习来加速游戏学习-朝计划行为的方向发展
4. A study on use of prior information for acceleration of reinforcement learning [C] . Terashima Kento, Murata Junichi SICE Annual Conference 2011 : Final program and abstracts . 2011

机译：利用先验信息促进强化学习的研究
5. Learning to Make Decisions with Incomplete Information: Reinforcement Learning, Information Geometry, and Real-Life Applications [D] . Basu, Debabrota 2018

机译：学习使用不完整的信息进行决策：强化学习，信息几何和现实生活中的应用
6. Optimizing the Sensor Placement for Foot Plantar Center of Pressure without Prior Knowledge Using Deep Reinforcement Learning [O] . Cheng-Wu Lin, Shanq-Jang Ruan, Wei-Chun Hsu, 2020

机译：使用深度加强学习优化脚跖压力压力中心的传感器放置
7. Learning Transferable Domain Priors for Safe Exploration in Reinforcement Learning [O] . Thommen George Karimpanal, Santu Rana, Sunil Gupta, 2020

机译：学习可转让的域名前脚，以便在加固学习中安全探索

Acceleration of Reinforcement Learning with Incomplete Prior Information

摘要

著录项

相似文献

相关主题

期刊订阅