Memory-guided exploration in reinforcement learning

机译：记忆学习中的强化学习探索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We focus on the task transfer in reinforcement learning and specifically in Q-learning. There are three main model free methods for performing task transfer in Q-learning: direct transfer, soft transfer and memory-guided exploration. In direct transfer, the Q-values from a previous task are used to initialize the Q-values of the next task. The soft transfer initializes the Q-values of the new task with a weighted average of the standard initialization value and the Q-values of the previous task. In memory-guided exploration the Q-values of previous tasks are used as a guide in the initial exploration of the agent. The weight that the agent gives to its past experience decreases over time. We explore stability issues related to the off-policy nature of memory-guided exploration and compare memory-guided exploration to soft transfer and direct transfer in three different environments.

机译：我们专注于强化学习中的任务转移，尤其是Q学习中的任务转移。在Q学习中执行任务转移有三种主要的无模型方法：直接转移，软转移和内存引导的探索。在直接传输中，上一个任务的Q值用于初始化下一个任务的Q值。软传输使用标准初始化值和先前任务的Q值的加权平均值来初始化新任务的Q值。在以内存为指导的探索中，先前任务的Q值将用作对代理进行初始探索的指导。代理赋予其过去经验的权重会随着时间的推移而降低。我们探讨了与内存引导的探索的非策略性相关的稳定性问题，并将内存引导的探索与软传输和直接传输在三种不同环境中进行了比较。

著录项

来源
《》|2001年|P.1002-1007|共6页
会议地点
作者
Carroll; J.L.; Peterson; T.S.; Owens; N.E.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Learning to soar: Resource-constrained exploration in reinforcement learning [J] . Jen Jen Chung, Nicholas R.J. Lawrance, Salah Sukkarieh The International journal of robotics research . 2015,第2期

机译：学会腾飞：强化学习中资源受限的探索
2. Learning Exploration/Exploitation Strategies for Single Trajectory Reinforcement Learning [J] . Damien Ernst, Francis Maes, Michael Castronovo, JMLR: Workshop and Conference Proceedings . 2012,第2012期

机译：单轨强化学习的学习探索/开发策略
3. Efficient exploration through active learning for value function approximation in reinforcement learning. [J] . Akiyama T, Hachiya H, Sugiyama M Neural Networks: The Official Journal of the International Neural Network Society . 2010,第5期

机译：通过主动学习对强化学习中的价值函数近似进行有效探索。
4. Memory-guided exploration in reinforcement learning [C] . James L. Carroll, Todd S. Peterson, Nancy E. Owens International Joint Conference on Neural Networks . 2001

机译：钢筋学习中的记忆引导探索
5. Memory-Guided Planning: Contributions of the Hippocampus and Episodic Memory to Model-Based Reinforcement Learning [D] . Vikbladh, Oliver Mattias. 2019

机译：记忆导向规划：海马和集体记忆对基于模型的增强学习的贡献
6. An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A2A receptor [O] . Xuhan Liu, Kai Ye, Herman W. T. van Vlijmen, 2019

机译：探索策略通过深度强化学习来改善从头配体的多样性：腺苷A2A受体的情况
7. Memory-guided exploration in reinforcement learning [O] . James L. Carroll, Todd S. Peterson, Nancy E. Owens 2001

机译：强化学习中的记忆导向探索

Memory-guided exploration in reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅