Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects

Yan Mengyuan; Zhu Yilin; Jin Ning; Bohg Jeannette

首页> 外文期刊>IEEE Robotics and Automation Letters >Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects

【24h】

Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects

机译：操纵可变形线性对象的状态估计自我监督

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We demonstrate model-based, visual robot manipulation of deformable linear objects. Our approach is based on a state-space representation of the physical system that the robot aims to control. This choice has multiple advantages, including the ease of incorporating physics priors in the dynamics model and perception model, and the ease of planning manipulation actions. In addition, physical states can naturally represent object instances of different appearances. Therefore, dynamics in the state space can be learned in one setting and directly used in other visually different settings. This is in contrast to dynamics learned in pixel space or latent space, where generalization to visual differences are not guaranteed. Challenges in taking the state-space approach are the estimation of the high-dimensional state of a deformable object from raw images, where annotations are very expensive on real data, and finding a dynamics model that is both accurate, generalizable, and efficient to compute. We are the first to demonstrate self-supervised training of rope state estimation on real images, without requiring expensive annotations. This is achieved by our novel self-supervising learning objective, which is generalizable across a wide range of visual appearances. With estimated rope states, we train a fast and differentiable neural network dynamics model that encodes the physics of mass-spring systems. Our method has a higher accuracy in predicting future states compared to models that do not involve explicit state estimation and do not use any physics prior, while only using 3% of training data. We also show that our approach achieves more efficient manipulation, both in simulation and on a real robot, when used within a model predictive controller.

机译：我们展示了基于模型的可变形线性对象的视觉机器人操作。我们的方法基于机器人旨在控制的物理系统的状态表示。这种选择具有多种优点，包括在动态模型和感知模型中纳入物理前驱，以及易于规划操纵行动。此外，物理状态可以自然地代表不同外观的对象实例。因此，可以在一个设置中学习状态空间中的动态，并直接用于其他视觉上不同的设置。这与在像素空间或潜在空间中学习的动态相反，其中不保证对视觉差异的泛化。采用状态空间方法的挑战是估计从原始图像的可变形对象的高维状态，其中注释在真实数据上非常昂贵，并找到一种准确，更广泛和有效的动力学模型。我们是第一个在实际图像上展示对绳子状态估计的自我监督培训，而无需昂贵的注释。这是我们的小说自我监督学习目标实现，这在广泛的视觉外观上是宽大的。凭借估计的绳索状态，我们培训了一种快速且可微差的神经网络动态模型，用于编码群众弹簧系统的物理学。与不涉及显式状态估计的模型相比，我们的方法在预测未来状态方面具有更高的准确性，并且不使用先前的任何物理，而仅使用3％的培训数据。我们还表明，当在模型预测控制器内使用时，我们的方法在模拟和真正的机器人中实现了更高效的操作。

著录项

来源
《IEEE Robotics and Automation Letters》 |2020年第2期|2372-2379|共8页
作者
Yan Mengyuan; Zhu Yilin; Jin Ning; Bohg Jeannette;
展开▼
作者单位

Stanford Univ Sch Engn Stanford CA 94305 USA;

Stanford Univ Sch Engn Stanford CA 94305 USA;

Stanford Univ Stanford CA 94305 USA|Calico Labs San Francisco CA USA;

Stanford Univ Sch Engn Stanford CA 94305 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Visual learning; deep learning in robotics and automation; perception for grasping and manipulation;

机译：视觉学习;在机器人和自动化中深入学习;对掌握和操纵的看法;

相似文献

外文文献
中文文献
专利

1. Manipulating Deformable Linear Objects: Fuzzy-Based Active Vibration Damping Skill [J] . Shigang Yue, Dominik Henrich Journal of Intelligent & Robotic Systems: Theory & Application . 2006,第3期

机译：操纵可变形线性物体：基于模糊的主动振动阻尼技术
2. Manipulating Deformable Linear Objects: Sensor-Based Skills of Adjustment Motions for Vibration Reduction [J] . Shigang Yue, Dominik Henrich Journal of Robotic Systems . 2005,第2期

机译：操纵可变形线性对象：基于传感器的减震调整运动技巧
3. Max margin learning of hierarchical configural deformable templates (HCDTs) for efficient object parsing and pose estimation [J] . Zhu L., Chen Y., Lin C., International Journal of Computer Vision . 2011,第1期

机译：分层结构可变形模板（HCDT）的最大余量学习，可进行有效的对象解析和姿势估计
4. A self-supervised learning system for object detection using physics simulation and multi-view pose estimation [C] . Chaitanya Mitash, Kostas E. Bekris, Abdeslam Boularias IEEE/RSJ International Conference on Intelligent Robots and Systems . 2017

机译：一种基于物理模拟和多视角姿态估计的自我监督学习系统
5. Tracking and manipulating deformable objects. [D] . Luo, Yanhong. 1998

机译：跟踪和操纵可变形对象。
6. A Physics-driven Neural Networks-based Simulation System (PhyNNeSS) for multimodal interactive virtual environments involving nonlinear deformable objects [O] . Suvranu De, Dhannanjay Deo, Ganesh Sankaranarayanan, -1

机译：一个物理驱动神经网络为基础的模拟系统（phyNNess）用于涉及非线性变形的对象的多峰交互式虚拟环境
7. Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects [O] . Mengyuan Yan, Yilin Zhu, Ning Jin, 2020

机译：操纵可变形线性对象的状态估计自我监督

Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅