Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)

机译：强化学习中的代理商教学代理商（花蜜摘要）

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Using reinforcement learning(RL), agents can autonomously learn a control policy to master sequential-decision tasks. Rather than always learning tabula rasa, our recent work considers how an experienced RL agent, the teacher, can help another RL agent, the student, to learn. As a motivating example, consider a household robot that has learned to perform tasks in a household. When the consumer purchases a new robot, she would like the student robot to quickly learn to perform the same tasks as the teacher robot, even if the new robot has different state representation, learning method, or manufacturer. Our goals are to: 1) Allow the student to learn faster with the teacher than without it, 2) Allow the student and teacher to have different learning methods and knowledge representations, 3) Not limit the student's performance when the teacher is sub-optimal, 4) Not require a complex, shared language, and 5) Limit the amount of communication required between the agents.

机译：使用钢筋学习（RL），代理可以自主地学习控制策略以掌握顺序决策任务。我们最近的工作而不是总是学习塔杜RASA，而是考虑了经验丰富的RL代理人，老师可以帮助另一个RL代理人，学生学习。作为一个激励例子，考虑一下已经学会在家庭中执行任务的家用机器人。当消费者购买新机器人时，她希望学生机器人能够快速学习与教师机器人一起执行相同的任务，即使新机器人有不同的状态表示，学习方法或制造商。我们的目标是：1）允许学生与老师更快地学习而不是没有它，2）允许学生和老师有不同的学习方法和知识表示，3）当老师是次优时，没有限制学生的表现4）不需要复杂，共享语言和5）限制代理商所需的通信量。

著录项

来源
《European conference on machine learning and knowledge discovery in databases》|2014年|524-528|共5页
会议地点
作者
Matthew E. Taylor; Lisa Torrey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent’s learning process in multiagent environments [J] . H. S. Al-Dayaa, D. B. Megherbi The Journal of Supercomputing . 2012,第1期

机译：使用座席状态发生频率并分析多座席环境中座席学习过程中的知识共享的强化学习技术
2. Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent's learning process in multiagent environments [J] . H.S. Al-Dayaa, D.B. Megherbi Journal of supercomputing . 2012,第1期

机译：使用代理状态发生频率并分析多代理环境中代理学习过程中的知识共享的强化学习技术
3. A multi-agent reinforcement learning method with learning of other agents for competitive game [J] . Yoichiro Matsuno, Tatsuya Yamazaki, Jun Matsuda, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第688期

机译：一种多智能体强化学习方法，结合其他智能体进行竞技游戏学习
4. Agents Teaching Agents in Reinforcement Learning (Nectar Abstract) [C] . Matthew E. Taylor, Lisa Torrey European Conference on Machine Learning and Knowledge Discovery in Databases . 2014

机译：强化学习的代理教学代理（Nectar Abstract）
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. Implementing Service-Learning Programs in Physical Education; Teacher Education as Teaching and Learning Models for All the Agents Involved: A Systematic Review [O] . Raquel Pérez-Ordás, Alberto Nuviala, Alberto Grao-Cruces, 2021

机译：在体育教育中实施服务学习计划;教师教育作为涉及所有代理商的教学和学习模式：系统审查
7. Self-improving reactive agents based on reinforcement learning, planning and teaching [O] . Long-ji Lin 1992

机译：基于强化学习，计划和教学的自我改善反应剂

Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)

摘要

著录项

相似文献

相关主题

期刊订阅