Principled Methods for Biasing Reinforcement Learning Agents

机译：偏置强化学习代理的原则方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) is a powerful technique for learning in domains where there is no instructive feedback but only evaluative feedback and is rapidly expanding in industrial and research fields. One of the main limitations of RL is the slowness in convergence. Thus, several methods have been proposed to speed up RL. They involve the incorporation of prior knowledge or bias into RL. In this paper, we present a new method for incorporating bias into RL. This method extends the choosing initial Q-values method proposed by Hailu G. and Sommer G. and one kind of learning mechanism is introduced into agent. This allows for much more specific information to guide the agent which action to choose and meanwhile it is helpful to reduce the state research space. So it improves the learning performance and speed up the convergence of the learning process greatly.

机译：强化学习（RL）是一种强大的技术，可用于没有指导性反馈但只有评估性反馈的领域中进行学习，并且在工业和研究领域中正在迅速扩展。 RL的主要限制之一是收敛速度慢。因此，已经提出了几种方法来加速RL。它们涉及将现有知识或偏见纳入RL。在本文中，我们提出了一种将偏差纳入RL的新方法。该方法扩展了Hailu G.和Sommer G.提出的选择初始Q值方法，并将一种学习机制引入到agent中。这样可以提供更多具体信息来指导代理选择哪种操作，同时有助于减少状态研究空间。这样可以大大提高学习效果，大大加快学习过程的收敛速度。

著录项

来源
《AICI 2011;International conference on artificial intelligence and computational intelligence》|2011年|p.703-709|共7页
会议地点
作者
Zhi Li; Kun Hu; Zengrong Liu; Xueli Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
reinforcement learning; prior knowledge; bias; Q-learning; biasing Q-learning;

机译：强化学习;先验知识;偏压; Q学习偏向Q学习;

相似文献

外文文献
中文文献
专利

1. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
2. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
3. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体增强学习方法，具有竞争游戏的其他代理商
4. Principled Methods for Biasing Reinforcement Learning Agents [C] . Zhi Li, Kun Hu, Zengrong Liu, International Conference on Artificial Intelligence and Computational Intelligence . 2011

机译：偏置强化学习代理的原理方法
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents [O] . Jeongho Park, Juwon Lee, Taehwan Kim, 2021

机译：加固学习代理捕食者 - 猎物生态系统的共同演变
7. Evaluating persuasion strategies and deep reinforcement learning methods for negotiation dialogue agents [O] . Keizer Simon, Guhe Markus, Cuayahuitl Heriberto, 2017

机译：评估谈判对话者的说服策略和深度强化学习方法

Principled Methods for Biasing Reinforcement Learning Agents

摘要

著录项

相似文献

相关主题

期刊订阅