首页> 外文会议>IEEE International Conference on Robot and Human Interactive Communication >Collaborative Suturing: A Reinforcement Learning Approach to Automate Hand-off Task in Suturing for Surgical Robots
【24h】

Collaborative Suturing: A Reinforcement Learning Approach to Automate Hand-off Task in Suturing for Surgical Robots

机译:协作缝合:一种用于外科手术机器人缝合中自动交接任务的强化学习方法

获取原文

摘要

Over the past decade, Robot-Assisted Surgeries (RAS), have become more prevalent in facilitating successful operations. Of the various types of RAS, the domain of collaborative surgery has gained traction in medical research. Prominent examples include providing haptic feedback to sense tissue consistency, and automating sub-tasks during surgery such as cutting or needle hand-off - pulling and reorienting the needle after insertion during suturing. By fragmenting suturing into automated and manual tasks the surgeon could essentially control the process with one hand and also circumvent workspace restrictions imposed by the control interface present at the surgeon's side during the operation. This paper presents an exploration of a discrete reinforcement learning-based approach to automate the needle hand-off task. Users were asked to perform a simple running suture using the da Vinci Research Kit. The user trajectory was learnt by generating a sparse reward function and deriving an optimal policy using Q-learning. Trajectories obtained from three learnt policies were compared to the user defined trajectory. The results showed a root-mean-square error of [0.0044mm, 0.0027mm, 0.0020mm] in ℝ3. Additional trajectories from varying initial positions were produced from a single policy to simulate repeated passes of the hand-off task.
机译:在过去的十年中,机器人辅助手术(RAS)在促进成功手术方面变得越来越普遍。在各种类型的RAS中,协作外科领域在医学研究中得到了关注。突出的示例包括提供触觉反馈以感测组织的一致性,并在手术过程中自动执行子任务,例如切割或移交针头-在缝合过程中插入针头后将其拔出并重新定向。通过将缝合分割成自动和手动任务,外科医生基本上可以用一只手来控制过程,并且还可以规避手术期间外科医生一侧的控制界面所施加的工作空间限制。本文提出了一种基于离散强化学习的方法的方法,该方法可以自动完成针移交任务。要求用户使用da Vinci Research Kit进行简单的缝合线。通过生成稀疏奖励函数并使用Q学习获得最佳策略来学习用户轨迹。从三个学习到的策略获得的轨迹与用户定义的轨迹进行了比较。结果显示ℝ中的均方根误差为[0.0044mm,0.0027mm,0.0020mm] 3 。从单个策略产生了来自不同初始位置的其他轨迹,以模拟越区切换任务的重复通过。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号