首页> 外文会议> >Real time learning control of high d.o.f. robots: automatic generation of discrete states and learning transition models

【24h】

Real time learning control of high d.o.f. robots: automatic generation of discrete states and learning transition models

机译：高d.o.f.的实时学习控制机器人：自动生成离散状态和学习过渡模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a model-based RL approach to cope with continuous space of high D.O.F. robots, combining model learning and an actor-critic method. The model learner generates a discrete state-transition model that helps improvement of both the policy and state-representation. In general, model-based methods tends to fail in non-Markovian problems, but the proposed method, using actor-critic, can find good policies in such environments.

机译：我们提出了一种基于模型的RL方法来应对高D.O.F的连续空间。机器人，结合了模型学习和演员批评方法。模型学习器生成离散状态转换模型，该模型有助于改善策略和状态表示。通常，基于模型的方法往往会在非马尔可夫问题中失败，但是所提出的方法（使用行为批评家）可以在这种环境中找到良好的策略。

著录项

来源
《》|2003年|p.2316-2321|共6页
会议地点
作者
Kimura; H.; Kobayashi; S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术 ;
关键词
position control; legged locomotion; real-time systems; discrete systems; learning (artificial intelligence); real time learning control; discrete states automatic generation; learning transition models; nonMarkovian problems; model-based methods; actor-critic methods; mobile robots;

机译：位置控制;腿部运动;实时系统;离散系统;学习（人工智能）;实时学习控制;离散状态自动生成;学习过渡模型;非马尔可夫问题;基于模型的方法;行为批评方法;移动机器人;

相似文献

外文文献
中文文献
专利

1. Discrete-time learning control for robotic manipulators [J] . TATSUYA SUZUKI, MASANORI YASUE, SHIGERU OKUMA, Advanced Robotics . 1995 ,第1期

机译：机器人的离散时间学习控制
2. Cartesian-based learning control for robots in discrete-time formulation [J] . Tso S.K., Ma Y.X. IEEE Transactions on Systems, Man, and Cybernetics . 1992 ,第5期

机译：离散时间下基于笛卡尔的机器人学习控制
3. Incremental online sparsification for model learning in real-time robot control [J] . Duy Nguyen-Tuong, Jan Peters Neurocomputing . 2011 ,第11期

机译：增量在线稀疏化，用于实时机器人控制中的模型学习
4. Real time learning control of high d.o.f. robots: Automatic generation of discrete states and learning transition models [C] . Hajime Kimura, Shigenobu Kobayashi SICE Annual Conference . 2003

机译：高D.O.F的实时学习控制。机器人：自动生成离散状态和学习过渡模型
5. From Model-based to Data-driven Discrete-time Iterative Learning Control [D] . Song, Bing 2019

机译：从基于模型到数据驱动的离散时间迭代学习控制
6. GadgetArm—Automatic Grasp Generation and Manipulation of 4-DOF Robot Arm for Arbitrary Objects Through Reinforcement Learning [O] . JoungMin Park, SangYoon Lee, JaeWoon Lee, 2020

机译：Gadgetarm-自动掌握4-DOF机器人手臂通过加固学习进行任意物体的生成和操纵
7. RTMBA: A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control [O] . Todd Hester, Michael Quinlan, Peter Stone 2012

机译：RTMBA：基于实时模型的机器人控制强化学习架构

Real time learning control of high d.o.f. robots: automatic generation of discrete states and learning transition models

摘要

著录项

相似文献

相关主题

期刊订阅