【24h】

Latent space policy search for robotics

机译:潜在空间策略搜索机器人

获取原文

摘要

Learning motor skills for robots is a hard task. In particular, a high number of degrees-of-freedom in the robot can pose serious challenges to existing reinforcement learning methods, since it leads to a high-dimensional search space. However, complex robots are often intrinsically redundant systems and, therefore, can be controlled using a latent manifold of much smaller dimensionality. In this paper, we present a novel policy search method that performs efficient reinforcement learning by uncovering the low-dimensional latent space of actuator redundancies. In contrast to previous attempts at combining reinforcement learning and dimensionality reduction, our approach does not perform dimensionality reduction as a preprocessing step but naturally combines it with policy search. Our evaluations show that the new approach outperforms existing algorithms for learning motor skills with high-dimensional robots.
机译:学习机器人的运动技能是一项艰巨的任务。特别是,机器人中的大量自由度会给现有的强化学习方法带来严峻的挑战,因为它会导致高维搜索空间。但是,复杂的机器人通常是本质上冗余的系统,因此,可以使用尺寸较小的潜在歧管进行控制。在本文中,我们提出了一种新颖的策略搜索方法,该方法通过发现执行器冗余的低维潜在空间来执行有效的强化学习。与之前尝试将强化学习和降维相结合的尝试相比,我们的方法不将降维作为预处理步骤,而是将其与策略搜索自然地结合在一起。我们的评估表明,该新方法优于使用高维机器人学习运动技能的现有算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号