Real-World Reinforcement Learning via Multifidelity Simulators

Cutler Mark; Walsh Thomas J.; How Jonathan P.

首页> 外文期刊>Robotics, IEEE Transactions on >Real-World Reinforcement Learning via Multifidelity Simulators

【24h】

Real-World Reinforcement Learning via Multifidelity Simulators

机译：通过多保真模拟器进行真实世界的强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) can be a tool for designing policies and controllers for robotic systems. However, the cost of real-world samples remains prohibitive as many RL algorithms require a large number of samples before learning useful policies. Simulators are one way to decrease the number of required real-world samples, but imperfect models make deciding when and how to trust samples from a simulator difficult. We present a framework for efficient RL in a scenario where multiple simulators of a target task are available, each with varying levels of fidelity. The framework is designed to limit the number of samples used in each successively higher-fidelity/cost simulator by allowing a learning agent to choose to run trajectories at the lowest level simulator that will still provide it with useful information. Theoretical proofs of the framework's sample complexity are given and empirical results are demonstrated on a remote-controlled car with multiple simulators. The approach enables RL algorithms to find near-optimal policies in a physical robot domain with fewer expensive real-world samples than previous transfer approaches or learning without simulators.

机译：强化学习（RL）可以是为机器人系统设计策略和控制器的工具。但是，由于许多RL算法在学习有用的策略之前需要大量样本，因此现实世界中样本的成本仍然很高。模拟器是减少所需的实际样本数量的一种方法，但是不完善的模型使决定何时以及如何信任模拟器中的样本变得困难。我们提供了一个目标任务的多个仿真器可用且每个逼真度都不同的情况下有效RL的框架。该框架旨在通过允许学习代理选择在最低级别的模拟器上运行轨迹（仍将为其提供有用信息）来限制每个相继的更高保真度/成本模拟器中使用的样本数量。给出了框架示例复杂性的理论证明，并在带有多个模拟器的遥控汽车上展示了实验结果。与以前的传输方法或没有模拟器的学习方法相比，该方法使RL算法能够在物理机器人领域中找到接近最优的策略，并具有更少的昂贵实际样本。

著录项

来源
《Robotics, IEEE Transactions on》 |2015年第3期|655-671|共17页
作者
Cutler Mark; Walsh Thomas J.; How Jonathan P.;
展开▼
作者单位

Laboratory of Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Animation and simulation; autonomous agents; learning and adaptive systems; reinforcement learning (RL);

机译：动画与仿真;自主主体;学习与自适应系统;强化学习（RL）;

相似文献

外文文献
中文文献
专利

1. Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms [J] . Suryan Varun, Gondhalekar Nahush, Tokekar Pratap IEEE Robotics & Automation Magazine . 2020,第2期

机译：高斯工艺的多程度强化学习：基于模型和无模型算法
2. Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms [J] . Suryan Varun, Gondhalekar Nahush, Tokekar Pratap Mathematical research letters: MRL . 2020,第2期

机译：高斯工艺的多程度强化学习：基于模型和无模型算法
3. Autonomous Learning of State Representations for Control: An Emerging Field Aims to Autonomously Learn State Representations for Reinforcement Learning Agents from Their Real-World Sensor Observations [J] . Wendelin Bohmer, Jost Tobias Springenberg, Joschka Boedecker, Kunstliche Intelligenz . 2015,第4期

机译：控制状态表示的自主学习：一个新兴领域旨在从现实世界的传感器观察中自主学习强化学习代理的状态表示
4. Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning [C] . Naijun Liu, Tao Lu, Yinghao Cai, Chinese Control and Decision Conference . 2020

机译：基于深度强化学习的真实机器人学习技巧学习
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Understanding Human Decision Making in an Interactive Landslide Simulator Tool via Reinforcement Learning [O] . Pratik Chaturvedi, Varun Dutt 2020

机译：通过加固学习了解互动滑坡模拟器工具的人为决策
7. Real-World Reinforcement Learning via Multi-Fidelity Simulators [O] . Mark Cutler, Thomas J. Walsh, Jonathan P. How, 2015

机译：通过多保真模拟器进行真实强化学习

Real-World Reinforcement Learning via Multifidelity Simulators

摘要

著录项

相似文献

相关主题

期刊订阅