首页> 外文会议>European conference on machine learning and knowledge discovery in databases >Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)
【24h】

Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)

机译:强化学习中的代理商教学代理商(花蜜摘要)

获取原文

摘要

Using reinforcement learning(RL), agents can autonomously learn a control policy to master sequential-decision tasks. Rather than always learning tabula rasa, our recent work considers how an experienced RL agent, the teacher, can help another RL agent, the student, to learn. As a motivating example, consider a household robot that has learned to perform tasks in a household. When the consumer purchases a new robot, she would like the student robot to quickly learn to perform the same tasks as the teacher robot, even if the new robot has different state representation, learning method, or manufacturer. Our goals are to: 1) Allow the student to learn faster with the teacher than without it, 2) Allow the student and teacher to have different learning methods and knowledge representations, 3) Not limit the student's performance when the teacher is sub-optimal, 4) Not require a complex, shared language, and 5) Limit the amount of communication required between the agents.
机译:使用钢筋学习(RL),代理可以自主地学习控制策略以掌握顺序决策任务。我们最近的工作而不是总是学习塔杜RASA,而是考虑了经验丰富的RL代理人,老师可以帮助另一个RL代理人,学生学习。作为一个激励例子,考虑一下已经学会在家庭中执行任务的家用机器人。当消费者购买新机器人时,她希望学生机器人能够快速学习与教师机器人一起执行相同的任务,即使新机器人有不同的状态表示,学习方法或制造商。我们的目标是:1)允许学生与老师更快地学习而不是没有它,2)允许学生和老师有不同的学习方法和知识表示,3)当老师是次优时,没有限制学生的表现4)不需要复杂,共享语言和5)限制代理商所需的通信量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号