Active Imitation Learning

机译：主动模仿学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has shown the value of imitation in domains where a single mentor demonstrates execution of a known optimal policy for the benefit of a learning agent. We consider the more general scenario of learning from mentors who are themselves agents seeking to maximize their own rewards. We propose a new algorithm based on the concept of transferable utility for ensuring that an observer agent can learn efficiently in the context of a selfish, not necessarily helpful, mentor. We also address the questions of when an imitative agent should request help from a mentor, and when the mentor can be expected to acknowledge a request for help. In analogy with other types of active learning, we call the proposed approach active imitation learning.

机译：模仿学习，也称为观看学习或示范编程，已成为加速许多强化学习任务的一种手段。先前的工作已经表明了模仿的价值，在该领域中，单个指导者演示了为学习代理的利益而执行已知的最佳策略。我们考虑从导师那里学习的更一般情况，导师本身就是寻求最大化自己的报酬的代理商。我们提出了一种基于可转移效用的概念的新算法，以确保观察者代理可以在自私，不一定有用的导师的情况下有效学习。我们还将解决以下问题：模拟代理人何时应向导师寻求帮助，以及何时可以期望导师确认求助请求。与其他类型的主动学习类似，我们将提出的方法称为主动模仿学习。

著录项

来源
《AAAI Conference on Artificial Intelligence(AAAI-07); Innovative Applications of Artificial Intelligence Conference(IAAI-07); 20070722-26; 20070722-26; Vancouver(CA); Vancouver(CA)》|2007年|P.756-762|共7页
会议地点 Vancouver(CA);Vancouver(CA)
作者
Aaron P. Shon; Deepak Verma; Rajesh P. N. Rao;
展开▼
作者单位

Department of Computer Science and Engineering University of Washington Seattle, WA 98195;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Active Imitation Learning: Formal and Practical Reductions to I.I.D. Learning [J] . Kshitij Judah, Alan P. Fern, Thomas G. Dietterich, Journal of machine learning research . 2014,第Nov期

机译：主动模仿学习：减少I.I.D.的形式和实践学习
2. Imitation or innovation: To what extent do exploitative learning and exploratory learning foster imitation strategy and innovation strategy for sustained competitive advantage? [J] . Ali Murad Technological forecasting and social change . 2021,第Apra期

机译：模仿或创新：利用竞争优势的剥削策略和创新策略在多大程度上？
3. Evaluation of Occupational Performance Imitation Intervention on Three Imitation Learnings among Autism: Case Series [J] . Smily Jesu Priya Victor Paulraj, Ruwinah Abdul Karim, Jayachandran Vetrayan Procedia - Social and Behavioral Sciences . 2015,第2期

机译：孤独症患者三种模仿学习的职业绩效模仿干预评估：案例系列
4. Active Imitation Learning via Reduction to I.I.D. Active Learning [C] . Kshitij Judah, Alan P. Fern, Thomas G. Dietterich Conference on uncertainty in artificial intelligence . 2012

机译：通过减少I.I.D.进行主动模仿学习主动学习
5. Learning to search: Structured prediction techniques for imitation learning. [D] . Ratliff, Nathan D. 2009

机译：学习搜索：模仿学习的结构化预测技术。
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Learning How to Actively Learn: A Deep Imitation Learning Approach [O] . Ming Liu, Wray Buntine, Gholamreza Haffari 2018

机译：学习如何积极学习：深度模仿学习方法

Active Imitation Learning

摘要

著录项

相似文献

相关主题

期刊订阅