首页> 外文会议>International Conference on Medical Image Computing and Computer-Assisted Intervention >Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets
【24h】

Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets

机译:通过动作三元组识别内窥镜视频中内窥镜视频中的仪器组织相互作用

获取原文

摘要

Recognition of surgical activity is an essential component to develop context-aware decision support for the operating room. In this work, we tackle the recognition of fine-grained activities, modeled as action triplets (instrument, verb, target) representing the tool activity. To this end, we introduce a new laparoscopic dataset, CholecT40, consisting of 40 videos from the public dataset Cholec80 in which all frames have been annotated using 128 triplet classes. Furthermore, we present an approach to recognize these triplets directly from the video data. It relies on a module called class activation guide, which uses the instrument activation maps to guide the verb and target recognition. To model the recognition of multiple triplets in the same frame, we also propose a trainable 3D interaction space, which captures the associations between the triplet components. Finally, we demonstrate the significance of these contributions via several ablation studies and comparisons to baselines on CholecT40.
机译:识别外科活动是为手术室制定背景感知决策支持的重要组成部分。在这项工作中,我们解决了识别细粒度的活动,建模为代表工具活动的动作三胞胎(仪器,动词,目标)。为此,我们介绍了一个新的腹腔镜数据集Cholect40,包括来自公共数据集CholeC80的40个视频,其中所有帧都使用了128个三联体类进行了注释。此外,我们提出了一种方法来直接从视频数据识别这些三联网。它依赖于名为Class激活指南的模块,它使用仪器激活映射来指导动词和目标识别。为了模拟同一帧中多个三胞胎的识别,我们还提出了可培训的3D交互空间,其捕获了三联组件之间的关联。最后,我们通过几种消融研究和对Cholect40的基线的比较来证明这些贡献的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号