Automatic State Abstraction from Demonstration

机译：演示中的自动状态抽象

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning from Demonstration (LfD) is a popular technique for building decision-making agents from human help. Traditional LfD methods use demonstrations as training examples for supervised learning, but complex tasks can require more examples than is practical to obtain. We present Abstraction from Demonstration (AfD), a novel form of LfD that uses demonstrations to infer state abstractions and reinforcement learning (RL) methods in those abstract state spaces to build a policy. Empirical results show that AfD is greater than an order of magnitude more sample efficient than just using demonstrations as training examples, and exponentially faster than RL alone.

机译：从演示中学习（LfD）是一种流行的技术，可以通过人的帮助来建立决策者。传统的LfD方法使用演示作为监督学习的训练示例，但是复杂的任务可能需要比实际获得更多的示例。我们介绍了演示的抽象形式（AfD），它是LfD的一种新颖形式，它使用演示来推断那些抽象状态空间中的状态抽象和强化学习（RL）方法以构建策略。实验结果表明，与仅使用演示作为训练示例相比，AfD的采样效率要高出一个数量级，并且比单独的RL快几倍。

著录项

来源
《》|2012年|p.1243-1248|共6页
会议地点
作者
Luis C. Cobo; Peng Zang; Charles L. Isbell Jr.; Andrea L. Thomaz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;人工智能理论;
关键词
入库时间 2022-08-26 15:09:16

相似文献

外文文献
中文文献
专利

1. Abstraction Assistant: An Automatic Text Abstraction System [J] . Wendy Wang, Drew Hwang Journal of the American Society for Information Science and Technology . 2010,第9期

机译：抽象助手：自动文本抽象系统
2. Automatic Processes in Database Building and Subsequent Automatic Abstractions [J] . D.E.RICHARDSON Cartographica . 1996,第1期

机译：数据库构建中的自动过程和后续的自动抽象
3. Stepping Back: Reflections on a Pedagogical Demonstration of Reflective Abstraction [J] . Allen Jedediah W. P., Bickhard Mark H. Human Development . 2015,第4a5期

机译：退后一步：对反思性抽象的教学论的反思
4. Automatic Task Decomposition and State Abstraction from Demonstration [C] . Luis C. Cobo, Charles L. Isbell, Andrea L. Thomaz International Conference on Autonomous Agents and Multiagent Systems . 2012

机译：自动任务分解与示范的抽象
5. Learning Hierarchical Abstractions from Human Demonstrations for Application-Scale Domains [D] . Leece, Michael. 2019

机译：从人类演示为应用程序规模域学习分层抽象
6. A research and demonstration procedure in stimulus control abstraction and environmental programming [O] . Israel Goldiamond 1964

机译：刺激控制抽象和环境程序设计的研究和演示程序
7. A research and demonstration procedure in stimulus control, abstraction, and environmental programming1 [O] . Goldiamond, Israel 1964

机译：刺激控制，抽象和环境程序设计的研究和演示程序1
8. Construct Abstraction for Automatic Information Abstraction from Digital Images [R] . Sugisaka, M. , Johnson, J. 2006

机译：构建数字图像自动信息抽象的抽象

Automatic State Abstraction from Demonstration

摘要

著录项

相似文献

相关主题

期刊订阅