首页> 美国卫生研究院文献>Frontiers in Neurorobotics >Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules

【2h】

Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules

机译：混合学习模式的合作和竞争性强化与模仿学习

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on an off-policy reinforcement learning algorithm and behavior cloning from samples collected by a behavior policy that is constructed by a combination of all the policies. Since the mixing weights are determined by the performance of the module, a better policy is automatically selected based on the learning progress. Experimental results on a benchmark control task show that CRAIL successfully achieves fast learning by allowing modules with complicated network structures to exploit task-relevant samples for training.

机译：本文提出了合作和竞争的强化与模仿学习（CRAIL），用于从一组多个异构模块中选择合适的策略，并同时对其进行培训。每个学习模块都有其自己的网络体系结构，并根据非策略强化学习算法和从行为策略收集的样本中的行为克隆来改进策略，该行为策略是由所有策略的组合构成的。由于混合权重取决于模块的性能，因此会根据学习进度自动选择更好的策略。在基准控制任务上的实验结果表明，CRAIL通过允许具有复杂网络结构的模块利用与任务相关的样本进行训练来成功实现了快速学习。

著录项

期刊名称 Frontiers in Neurorobotics
作者
Eiji Uchibe;
展开▼
作者单位

展开▼
年(卷),期 2018(12),-1
年度 2018
页码 61
总页数 11
原文格式 PDF
正文语种
中图分类情报学;
关键词
reinforcement learning imitation learning modular architecture parallel learning entropy-regularization multiple importance sampling;

机译：强化学习;模仿学习;模块化架构;并行学习;熵正则化;多重要性采样;

相似文献

外文文献
中文文献
专利

1. Symbolization and Imitation Learning of Motion Sequence Using Competitive Modules [J] . Kazuyuki Samejima, Kenichi Katagiri, Kenji Doya, Electronics and Communications in Japan. Part 3, Fundamental Electronic Science . 2006,第9期

机译：使用竞争模块对运动序列进行符号化和模仿学习
2. Imitation or innovation: To what extent do exploitative learning and exploratory learning foster imitation strategy and innovation strategy for sustained competitive advantage? [J] . Ali Murad Technological forecasting and social change . 2021,第Apra期

机译：模仿或创新：利用竞争优势的剥削策略和创新策略在多大程度上？
3. Annealed cooperative-competitive learning of Mahalanobis-NRBF neural modules for nonlinear and chaotic differential function approximation [J] . Jiann-Ming Wu, Chun-Chang Wu, Ching-Wen Huang Neurocomputing . 2014,第JULa20期

机译：Mahalanobis-NRBF神经模块的退火合作竞争学习，用于非线性和混沌微分函数逼近
4. Reinforcement Learning with Multiple Heterogeneous Modules: A Framework for Developmental Robot Learning [C] . Uchibe, E., Doya, . 2005

机译：具有多个异构模块的强化学习：发展型机器人学习的框架
5. A comparison of cooperative -cooperative and cooperative -competitive goal structures and their effect on group problem -solving performance and student attitudes toward their learning environment [D] . Shumway, Steven LeRoy. 1999

机译：合作-合作与合作-竞争目标结构的比较及其对小组解决问题的表现和学生对其学习环境的态度的影响
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Anticipatory Model of Musical Style Imitation using Collaborative and Competitive Reinforcement Learning [O] . Cont, Arshia, Dubnov, Shlomo, Assayag, Gerard 2007

机译：基于协作和竞争性强化学习的音乐风格模仿的预期模型

Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules

摘要

著录项

相似文献

相关主题

期刊订阅