Cost-Based Policy Mapping for Imitation

机译：基于成本的模仿策略映射

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This is particularly important if the behavioral capabilities of the observed and imitating agent differ significantly. This paper presents an approach that addresses this problem by locally optimizing a cost function representing the deviation from the observed state sequence and the cost of the actions required to perform the imitation. The result are imitation strategies that can be performed by the imitating agent and that as closely as possible resemble the observations of the demonstrating agent. The performance of this approach is illustrated within the context of a simulated multi-agent environment.

机译：模仿代表了在机器人和计算机系统中进行编程和自主学习的强大方法。模仿的一个重要方面是将观察结果映射到可执行控制策略。如果所观察到的和模仿剂的行为能力明显不同，这一点尤其重要。本文提出了一种通过局部优化成本函数来解决此问题的方法，该成本函数表示与观察到的状态序列的偏差以及执行模仿所需的操作成本。结果是可以由模仿剂执行的模仿策略，并且该模仿策略尽可能类似于对展示剂的观察。在模拟的多主体环境中说明了这种方法的性能。

著录项

来源
《International Florida Artiticial Intelligence Research Society Conference and International Flairs Conference: Recent Advances in Artificial Intelligece; 2003》|2003年|P.17-21|共5页
会议地点
作者
Srichandan V. Gudla; Manfred Huber;
展开▼
作者单位

Department of Computer Science Engineering University of Texas at Arlington Arlington, TX 76019-0015;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化系统理论;工程模拟;
关键词

相似文献

外文文献
中文文献
专利

1. A Cost-Based Range Estimation for Mapping Top-k Selection Queries over Relational Databases [J] . Anteneh Ayanso, Paulo B. Goes, Kumar Mehta Journal of database management . 2009,第4期

机译：关系数据库上前k个选择查询的基于成本的范围估计
2. COST-BASED INDEXING OF FARE AND FREIGHT TO FUEL PRICE: INNOVATIVE PRICING POLICY FOR INDIAN RAILWAYS [J] . MONICA SINGHANIA, SANJEEV SHARMA Cost management . 2016,第2期

机译：燃油价格的基于成本的索引：印度铁路的创新定价政策
3. A cost-based model to select best capacity scaling policy for reconfigurable manufacturing systems [J] . Shady S. Elmasry, Ayman M. A. Youssef, Mohamed A. Shalaby International Journal of Manufacturing Research . 2015,第2期

机译：基于成本的模型，为可重配置的制造系统选择最佳产能扩展策略
4. LEARNING IMITATION STRATEGIES USING COST-BASED POLICY MAPPING AND TASK REWARDS [C] . Srichandan V. Gudla, Manfred Huber IASTED International Conference on Intelligent Systems and Control . 2006

机译：使用基于成本的策略映射和任务奖励学习仿制策略
5. Adaptive cost-based policy mapping for imitation. [D] . Gudla, Srichandan Venkat. 2003

机译：用于模仿的基于成本的自适应策略映射。
6. Using multi-level Bayesian lesion-symptom mapping to probe the body-part-specificity of gesture imitation skills [O] . Elisabeth I.S. Achilles, Peter H. Weiss, Gereon R. Fink, -1

机译：使用多级贝叶斯病变症状映射来探究手势模仿技巧的身体部位特异性
7. A Cost-Based Model of Seasonal Production, with Application to Milk Policy [O] . Hennessy, David A., Roosen, Jutta 2003

机译：基于成本的季节性生产模型及其在牛奶政策中的应用

Cost-Based Policy Mapping for Imitation

摘要

著录项

相似文献

相关主题

期刊订阅