Analyzing Different Unstated Goal Constraints on Reinforcement Learning Algorithm for Reacher Task in the Robotic Scrub Nurse Application

机译：在机器人擦洗护士应用程序中，针对实现任务的强化学习算法分析不同的未阐明目标约束

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The main objective paper is to make an empirical analysis of the effect of various unstated spatial goal constraints on reinforcement learning policy for the “reacher” task in the Robotic Scrub Nurse (RSN) application. This “reacher” task is an essential part of the RSN manipulation task, such as the task of picking, grasping, or placing the surgical instruments. This paper provides our experimental results and the evaluation of the “reacher” task under different spatial goal constraints. We researched the effect of this unstated assumption on a reinforcement learning (RL) algorithm: Soft-Actor Critic with Hindsight Experience Replay (SAC+HER). We used the 7-DoF robotic arm to evaluate this state-of-the-art deep RL algorithm. We performed our experiments in a virtual environment while training the robotic arm to reach the random target points. The implementation of this RL algorithm showed a robust performance, which is measured by reward values and success rates. We observed, these reinforcement learning assumptions, particularly the unstated spatial goal constraints, can affect the performance of the RL agent. The important aspect of the “reacher” task and the development of reinforcement learning applications in medical robotics is one of the main motivations behind this research objective.

机译：主要目标文件是对机器人擦洗护士（RSN）应用程序中“到达者”任务的各种未阐明的空间目标约束对强化学习策略的影响进行实证分析。此“到达”任务是RSN操作任务的重要组成部分，例如拾取，抓握或放置手术器械的任务。本文提供了我们的实验结果以及在不同空间目标约束下对“到达者”任务的评估。我们研究了这种未阐明的假设对强化学习（RL）算法的影响：具有后视经验回放（SAC + HER）的软演员评论家。我们使用7自由度机械臂来评估这种最新的深度RL算法。我们在虚拟环境中进行实验，同时训练机械臂以达到随机目标点。此RL算法的实现显示出鲁棒的性能，该性能由奖励值和成功率来衡量。我们观察到，这些强化学习假设，特别是未说明的空间目标约束，可能会影响RL代理的性能。 “扩展”任务的重要方面以及医疗机器人中强化学习应用程序的开发是该研究目标的主要动机之一。

著录项

来源
《IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology》|2020年|42-47|共6页
会议地点
作者
Clinton Elian Gandana; Joel D. K. Disu; Hongzhi Xie; Lixu Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
“reacher” task; spatial constraints; Robotic Scrub Nurse; Reinforcement Learning; Soft-Actor Critic; Hindsight Experiment Replay;

机译：“到达者”任务;空间约束;机器人擦洗护士;强化学习;软演员评论家; Hindsight实验重播;

相似文献

外文文献
中文文献
专利

1. Path-Integral-Based Reinforcement Learning Algorithm for Goal-Directed Locomotion of Snake-Shaped Robot [J] . Qi Yongqiang, Yang Hailan, Rong Dan, Discrete dynamics in nature and society . 2021,第a期

机译：基于路径 - 积分的蛇形机器人目标机动的加强学习算法
2. SWIRL: A sequential windowed inverse reinforcement learning algorithm for robot tasks with delayed rewards [J] . Krishnan Sanjay, Garg Animesh, Liaw Richard, The International journal of robotics research . 2019,第2a3期

机译：SWIRL：顺序窗口逆强化学习算法，用于延迟奖励的机器人任务
3. Reinforcement Renaissance The power of deep neural networks has sparked renewed interest in reinforcement learning, with applications to games, robotics, and beyond [J] . Krakovsky Marina Communications of the ACM . 2016,第8期

机译：强化文艺复兴深度神经网络的力量激发了人们对强化学习及其在游戏，机器人技术及其他领域的应用的新兴趣。
4. Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching Tasks [C] . Pierre Aumjaud, David McAuliffe, Francisco Javier Rodríguez-Lera, International Workshop of Physical Agents . 2021

机译：解决机器人到达任务的加固学习实验和基准
5. Reinforcement Learning Algorithms for Representing and Managing Uncertainty in Robotics [D] . Martin, John D., Jr. 2021

机译：加强学习算法，用于在机器人学中代表和管理不确定性
6. Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms [O] . Kok-Lim Alvin Yau, Geong-Sen Poh, Su Fong Chien, -1

机译：强化学习在认知无线电网络中的应用：模型和算法
7. rl_reach: Reproducible reinforcement learning experiments for robotic reaching tasks [O] . Pierre Aumjaud, David McAuliffe, Francisco J. Rodríguez Lera, 2021

机译：RL_REACH：用于机器人到达任务的可重复加强学习实验

Analyzing Different Unstated Goal Constraints on Reinforcement Learning Algorithm for Reacher Task in the Robotic Scrub Nurse Application

摘要

著录项

相似文献

相关主题

期刊订阅