Speeding Up Affordance Learning for Tool Use, Using Proprioceptive and Kinesthetic Inputs

机译：通过使用本体感受和动觉输入来加快工具使用的负担能力学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

End-to-end learning in deep reinforcement learning based on raw visual input has shown great promise in various tasks involving sensorimotor control. However, complex tasks such as tool use require recognition of affordance and a series of non-trivial subtasks such as reaching the tool, grasping the tool, and wielding the tool. In such tasks, end-to-end approaches with only the raw input (e.g. pixel-wise images) may fail to learn to perform the task or may take too long to converge. In this paper, inspired by the biological sensorimotor system, we explore the use of proprioceptive/kinesthetic inputs (internal inputs for body position and motion) as well as raw visual inputs (exteroception, external perception) for use in affordance learning for tool use tasks. We set up a reaching task in a simulated physics environment (MuJoCo), where the agent has to pick up a T-shaped tool to reach and drag a target object to a designated region in the environment. We used an Actor-Critic-based reinforcement learning algorithm called ACKTR (Actor-Critic using Kronecker-Factored Trust Region) and trained it using various input conditions to assess the utility of proprioceptive/kinesthetic inputs. Our results show that the inclusion of proprioceptive/kinesthetic inputs (position and velocity of the limb) greatly enhances the performance of the agent: higher success rate, and faster convergence to the solution. The lesson we learned is the important factor of the intertwined relationship of exteroceptive and proprioceptive in sensorimotor learning and that although end-to-end learning based on raw input may be appealing, separating the exteroceptive and proprioceptive/kinesthetic factors in the input to the learner, and providing the necessary internal inputs can lead to faster, more effective learning.

机译：基于原始视觉输入的深度强化学习中的端到端学习在涉及感觉运动控制的各种任务中显示出了巨大的希望。但是，复杂的任务（例如使用工具）需要识别负担能力，并且需要一系列非平凡的子任务，例如到达工具，抓紧工具和操纵工具。在此类任务中，仅使用原始输入的端到端方法（例如，逐像素图像）可能无法学习执行任务，或者可能花费太长时间才能收敛。在本文中，受生物感觉运动系统的启发，我们探索了本体感受/动觉输入（身体位置和运动的内部输入）以及原始视觉输入（外在感受，外部感知）的使用，以用于工具使用任务的能力学习。我们在模拟物理环境（MuJoCo）中设置了一个到达任务，其中代理必须拿起T形工具才能到达并将目标对象拖到环境中的指定区域。我们使用了一种基于Actor-Critic的强化学习算法，称为ACKTR（使用Kronecker-Factored Trust Region的Actor-Critic），并使用各种输入条件对其进行了训练，以评估本体感受/动觉输入的效用。我们的结果表明，包括本体感觉/运动感觉输入（肢体的位置和速度）在内，可大大提高代理的性能：成功率更高，解决方案收敛更快。我们吸取的教训是感觉运动学习中知觉和本体感受相互交织的关系的重要因素，尽管基于原始输入的端到端学习可能很有吸引力，但将输入中的知觉/本体感觉/运动感觉因素分开，并提供必要的内部投入，可以使学习更快，更有效。

著录项

来源
《International Joint Conference on Neural Networks》|2019年|1-8|共8页
会议地点
作者
Khuong N. Nguyen; Jaewook Yoo; Yoonsuck Choe;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Tools; Task analysis; Grippers; Reinforcement learning; Robot sensing systems; Visualization; Physics;

机译：工具;任务分析;抓手;强化学习;机器人传感系统;可视化;物理;
入库时间 2022-08-26 14:34:42

相似文献

外文文献
中文文献
专利

1. Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input [J] . Petar Kormushev, Sylvain Calinon, Darwin G. Caldwell Advanced Robotics: The International Journal of the Robotics Society of Japan . 2011,第5期

机译：通过动觉教学和触觉输入展示模仿的姿势和力量技能
2. Tools for Science Inquiry Learning: Tool Affordances, Experimentation Strategies, and Conceptual Understanding [J] . Bumbacher Engin, Salehi Shima, Wieman Carl, Journal of Science Education and Technology . 2018,第3期

机译：用于科学探究学习的工具：工具负担，实验策略和概念理解
3. Bayesian learning of tool affordances based on generalization of functional feature to estimate effects of unseen tools [J] . Raghvendra Jain, Tetsunari Inamura Artificial life and robotics . 2013,第1a2期

机译：基于功能特征泛化的贝叶斯工具供应能力学习，以估计未见工具的效果
4. Speeding Up Affordance Learning for Tool Use, Using Proprioceptive and Kinesthetic Inputs [C] . Khuong N. Nguyen, Jaewook Yoo, Yoonsuck Choe International Joint Conference on Neural Networks . 2019

机译：使用Broprioceptive和Kinesthetic Inples加快速度学习工具使用
5. Study of Kinesthetic Feedback Control for Compliant Proprioceptive Touch for Soft Robotic Finger-Like Actuators [D] . Boivin, Megan. 2021

机译：软机械手指状致动器兼容性的预拉伸触摸的运动反馈控制研究
6. Learning to grasp and extract affordances: the Integrated Learning of Grasps and Affordances (ILGA) model [O] . James Bonaiuto, Michael A. Arbib -1

机译：学习掌握和提取能力：掌握和能力整合学习（ILGA）模型
7. Comparative Experiential Learning of Mechanical Engineering Concepts through the Usage of Robot as a Kinesthetic Learning Tool [O] . S. M. Mizanoor Rahman -1

机译：机械工程概念的比较体验学习通过机器人的用途作为动力学学习工具
8. The Effects of Division of Attention upon Required Kinesthetic and Proprioceptive Discrimination. [R] . Weitzman, D. O., Notterman, J. M. 1976

机译：注意力分工对所需动觉和本体感觉辨别的影响。

Speeding Up Affordance Learning for Tool Use, Using Proprioceptive and Kinesthetic Inputs

摘要

著录项

相似文献

相关主题

期刊订阅