LEARNING IMITATION STRATEGIES USING COST-BASED POLICY MAPPING AND TASK REWARDS

机译：使用基于成本的策略映射和任务奖励学习仿制策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning by imitation represents a powerful approach for efficient learning and low-overhead programming. An important part of the imitation process is the mapping of observations to an executable control strategy. This is particularly important if the capabilities of the imitating and the demonstrating agent differ significantly. This paper presents an approach that addresses this problem by optimizing a cost function. The result is an executable strategy that as closely as possible resembles the observed effects of the demonstrator on the environment. To ensure that the imitating agent replicates the important aspects of the observed task, a learning component is introduced which learns the appropriate cost function from rewards obtained while executing the imitation strategy. The performance of this approach is illustrated within the context of a simulated multi-agent environment.

机译：仿真学习代表了有效学习和低开销编程的强大方法。仿制过程的一个重要部分是对可执行控制策略的观察映射。如果模拟的能力和说明剂显着差异，则这尤其重要。本文介绍了一种方法，通过优化成本函数来解决这个问题。结果是一种可执行的策略，尽可能地就像示威者对环境的观察到的效果一样。为了确保模仿代理复制观察到的任务的重要方面，介绍了一个学习组件，其在执行仿制策略时从获得的奖励中了解了适当的成本函数。在模拟的多代理环境的上下文中示出了这种方法的性能。

著录项

来源
《IASTED International Conference on Intelligent Systems and Control》|2006年||共6页
会议地点
作者
Srichandan V. Gudla; Manfred Huber;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Imitation; Reinforcement Learning; Policy Mapping;

机译：模仿;加强学习;政策映射;

相似文献

外文文献
中文文献
专利

1. Integration of imitation learning using GAIL and reinforcement learning using task-achievement rewards via probabilistic graphical model [J] . Kinose Akira, Taniguchi Tadahiro Advanced Robotics: The International Journal of the Robotics Society of Japan . 2020,第15a16期

机译：通过概率图形模型使用任务成就奖励使用盖尔和强化学习的模仿学习
2. Imitation or innovation: To what extent do exploitative learning and exploratory learning foster imitation strategy and innovation strategy for sustained competitive advantage? [J] . Ali Murad Technological forecasting and social change . 2021,第Apra期

机译：模仿或创新：利用竞争优势的剥削策略和创新策略在多大程度上？
3. The effect of learning by imitation on a multi-robot system based on the coupling of low-level imitation strategy and online learning for cognitive map building [J] . Abdelhak Chatty, Philippe Gaussier, Syed Khursheed Hasnain, Advanced Robotics: The International Journal of the Robotics Society of Japan . 2014,第11a12期

机译：基于低级模仿策略与在线学习相结合的模仿学习对多机器人系统的影响
4. LEARNING IMITATION STRATEGIES USING COST-BASED POLICY MAPPING AND TASK REWARDS [C] . Srichandan V. Gudla, Manfred Huber IASTED International Conference on Intelligent Systems and Control . 2006

机译：使用基于成本的策略映射和任务奖励学习仿制策略
5. Adaptive cost-based policy mapping for imitation. [D] . Gudla, Srichandan Venkat. 2003

机译：用于模仿的基于成本的自适应策略映射。
6. Structure-Preserving Imitation Learning With Delayed Reward: An Evaluation Within the RoboCup Soccer 2D Simulation Environment [O] . Quang Dang Nguyen, Mikhail Prokopenko 2020

机译：延迟奖励的结构保留模仿学习：Robocup Soccer 2D模拟环境中的评估
7. Integration of imitation learning using GAIL and reinforcement learning using task-achievement rewards via probabilistic graphical model [O] . Akira Kinose, Tadahiro Taniguchi 2020

机译：概率图形模型使用盖爪和加固学习的仿制学习的集成

LEARNING IMITATION STRATEGIES USING COST-BASED POLICY MAPPING AND TASK REWARDS

摘要

著录项

相似文献

相关主题

期刊订阅