Latent space policy search for robotics

机译：潜在空间策略搜索机器人

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning motor skills for robots is a hard task. In particular, a high number of degrees-of-freedom in the robot can pose serious challenges to existing reinforcement learning methods, since it leads to a high-dimensional search space. However, complex robots are often intrinsically redundant systems and, therefore, can be controlled using a latent manifold of much smaller dimensionality. In this paper, we present a novel policy search method that performs efficient reinforcement learning by uncovering the low-dimensional latent space of actuator redundancies. In contrast to previous attempts at combining reinforcement learning and dimensionality reduction, our approach does not perform dimensionality reduction as a preprocessing step but naturally combines it with policy search. Our evaluations show that the new approach outperforms existing algorithms for learning motor skills with high-dimensional robots.

机译：学习机器人的运动技能是一项艰巨的任务。特别是，机器人中的大量自由度会给现有的强化学习方法带来严峻的挑战，因为它会导致高维搜索空间。但是，复杂的机器人通常是本质上冗余的系统，因此，可以使用尺寸较小的潜在歧管进行控制。在本文中，我们提出了一种新颖的策略搜索方法，该方法通过发现执行器冗余的低维潜在空间来执行有效的强化学习。与之前尝试将强化学习和降维相结合的尝试相比，我们的方法不将降维作为预处理步骤，而是将其与策略搜索自然地结合在一起。我们的评估表明，该新方法优于使用高维机器人学习运动技能的现有算法。

著录项

来源
《IEEE/RSJ International Conference on Intelligent Robots and Systems》|2014年|1434-1440|共7页
会议地点
作者
Luck Kevin Sebastian; Neumann Gerhard; Berger Erik; Peters Jan; Amor Heni Ben;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Aerospace electronics; Equations; Joints; Learning (artificial intelligence); Mathematical model; Robots; Vectors;

机译：航空电子;方程;关节;学习（人工智能）;数学模型;机器人;矢量;

相似文献

外文文献
中文文献
专利

1. Learning in robotic manipulation: The role of dimensionality reduction in policy search methods Comment on "Hand synergies: Integration of robotics and neuroscience for understanding the control of biological and artificial hands" by Marco Santello et al [J] . Ficuciello Fanny, Siciliano Bruno Physics of life reviews . 2016,第Null期

机译：在机器人操纵中学习：降维在策略搜索方法中的作用Marco Santello等人在评论“手的协同作用：机器人和神经科学的融合以理解生物和人工手的控制”时发表了评论。
2. Human-in-the-Loop Differential Subspace Search in High-Dimensional Latent Space [J] . CHIA-HSING CHIU, YUKI KOYAMA, YU-CHI LAI, ACM Transactions on Graphics . 2020,第4CD期

机译：在高维潜空间中的LOOP差分子空间搜索
3. Efficient Robotic Object Search Via HIEM: Hierarchical Policy Learning With Intrinsic-Extrinsic Modeling [J] . Ye Xin, Yang Yezhou IEEE Robotics and Automation Letters . 2021,第3期

机译：有效的机器人对象搜索通过HIEM：具有内在外在建模的分层策略学习
4. Latent space policy search for robotics [C] . Luck Kevin Sebastian, Neumann Gerhard, Berger Erik, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2014

机译：潜在空间政策搜索机器人学
5. Code Similarity Search in a Latent Space [D] . Qi, Letao. 2017

机译：潜在空间中的代码相似性搜索
6. In search of induction and latency periods: Space-time interaction accounting for residential mobility risk factors and covariates [O] . Geoffrey M Jacquez, Jaymie Meliker, Andy Kaufmann 2007

机译：寻找归纳期和潜伏期：时空相互作用说明居民流动性风险因素和协变量
7. Latent Space Policy Search for Robotics [O] . Kevin Sebastian Luck, Gerhard Neumann, Erik Berger, 2015

机译：潜在空间政策搜索机器人

Latent space policy search for robotics

摘要

著录项

相似文献

相关主题

期刊订阅