Learning modular neural network policies for multi-task and multi-robot transfer

机译：学习用于多任务和多机器人传输的模块化神经网络策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep reinforcement learning to train general purpose neural network policies alleviates some of the burden of manual representation engineering by using expressive policy classes, but exacerbates the challenge of data collection, since such methods tend to be less efficient than RL with low-dimensional, hand-designed representations. Transfer learning can mitigate this problem by enabling us to transfer information from one skill to another and even from one robot to another. We show that neural network policies can be decomposed into “task-specific” and “robot-specific” modules, where the task-specific modules are shared across robots, and the robot-specific modules are shared across all tasks on that robot. This allows for sharing task information, such as perception, between robots and sharing robot information, such as dynamics and kinematics, between tasks. We exploit this decomposition to train mix-and-match modules that can solve new robot-task combinations that were not seen during training. Using a novel approach to train modular neural networks, we demonstrate the effectiveness of our transfer method for enabling zero-shot generalization with a variety of robots and tasks in simulation for both visual and non-visual tasks.

机译：强化学习（RL）可以自动化各种机器人技能，但学习每项新技能需要相当多的真实数据收集和手动表示工程来设计策略类或功能。使用深度加强学习培训通用目的的神经网络政策通过使用表现力的政策课程减轻了手动表示工程的一些负担，但加剧了数据收集的挑战，因为这些方法往往比具有低维，手的RL效率低于RL - 指定的表示。转移学习可以通过使我们将信息从一个技能转移到另一个技能，甚至从一个机器人转移到另一个技能来缓解这个问题。我们表明，神经网络策略可以分解为“特定于任务特定”和“机器人特定的”模块，其中任务特定的模块在机器人中共享，并且机器人特定的模块在该机器人上的所有任务中共享。这允许在任务之间共享任务信息，例如感知，例如在任务之间共享机器人信息，例如动态和运动学。我们利用这种分解来训练混合和匹配模块，可以解决在培训期间没有看到的新机器人任务组合。使用一种新颖的培训模块化神经网络的方法，我们展示了传递方法的有效性，使零拍摄的常规通过各种机器人和任务在仿真中实现了视觉和非视觉任务。

著录项

来源
《IEEE International Conference on Robotics and Automation》|2017年|1 v.|共8页
会议地点
作者
Coline Devin; Abhishek Gupta; Trevor Darrell; Pieter Abbeel; Sergey Levine;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类机器人技术;
关键词
Neural networks; Robot sensing systems; Learning (artificial intelligence); Games; Training; Visualization;

机译：神经网络;机器人传感系统;学习（人工智能）;游戏;培训;可视化;

相似文献

外文文献
中文文献
专利

1. Evolutionary Multi-task Learning for Modular Knowledge Representation in Neural Networks [J] . Rohitash Chandra, Abhishek Gupta, Yew-Soon Ong, Neural processing letters . 2018,第3期

机译：神经网络中模块化知识表示的进化多任务学习
2. Multi-task transfer learning deep convolutional neural network: application to computer-aided diagnosis of breast cancer on mammograms [J] . Samala Ravi K., Chan Heang-Ping, Hadjiiski Lubomir M., Physics in medicine and biology. . 2017,第23期

机译：多任务转移学习深卷积神经网络：应用于乳房X线图乳腺癌的计算机辅助诊断
3. Learning programs is better than learning dynamics: A programmable neural network hierarchical architecture in a multi-task scenario [J] . Donnarumma Francesco, Prevete Roberto, de Giorgio Andrea, Adaptive Behavior . 2016,第1期

机译：学习程序胜于学习动力学：多任务场景中的可编程神经网络分层体系结构
4. Learning modular neural network policies for multi-task and multi-robot transfer [C] . Coline Devin, Abhishek Gupta, Trevor Darrell, IEEE International Conference on Robotics and Automation . 2017

机译：学习用于多任务和多机器人传输的模块化神经网络策略
5. Multi-task learning deep neural networks for automatic speech recognition [D] . Chen, Dongpeng. 2015

机译：多任务学习深度神经网络自动语音识别
6. Multi-task transfer learning deep convolutional neural network: Application to computer-aided diagnosis of breast cancer on mammograms [O] . Ravi K Samala, Heang-Ping Chan, Lubomir M Hadjiiski, -1

机译：多任务转移学习深度卷积神经网络：在乳腺钼靶计算机辅助诊断中的应用
7. Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer [O] . Devin, Coline, Gupta, Abhishek, Darrell, Trevor, 2016

机译：学习多任务和多机器人的模块化神经网络策略传递

Learning modular neural network policies for multi-task and multi-robot transfer

摘要

著录项

相似文献

相关主题

期刊订阅