首页> 美国卫生研究院文献>Healthcare Technology Letters >Learning the representation of instrument images in laparoscopy videos
【2h】

Learning the representation of instrument images in laparoscopy videos

机译:学习腹腔镜视频中仪器图像的表示

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Automatic recognition of instruments in laparoscopy videos poses many challenges that need to be addressed, like identifying multiple instruments appearing in various representations and in different lighting conditions, which in turn may be occluded by other instruments, tissue, blood, or smoke. Considering these challenges, it may be beneficial for recognition approaches that instrument frames are first detected in a sequence of video frames for further investigating only these frames. This pre-recognition step is also relevant for many other classification tasks in laparoscopy videos, such as action recognition or adverse event analysis. In this work, the authors address the task of binary classification to recognise video frames as either instrument or non-instrument images. They examine convolutional neural network models to learn the representation of instrument frames in videos and take a closer look at learned activation patterns. For this task, GoogLeNet together with batch normalisation is trained and validated using a publicly available dataset for instrument count classifications. They compared transfer learning with learning from scratch and evaluate on datasets from cholecystectomy and gynaecology. The evaluation shows that fine-tuning a pre-trained model on the instrument and non-instrument images is much faster and more stable in learning than training a model from scratch.
机译:腹腔镜视频中器械的自动识别提出了许多需要解决的挑战,例如识别以各种表示形式和在不同照明条件下出现的多种器械,而这些器械又可能被其他器械,组织,血液或烟雾所阻塞。考虑到这些挑战,对于识别方法而言可能有益的是,首先在视频帧序列中检测到仪器帧,以便仅进一步研究这些帧。此预识别步骤也与腹腔镜视频中的许多其他分类任务相关,例如动作识别或不良事件分析。在这项工作中,作者致力于二进制分类的任务,以将视频帧识别为仪器图像或非仪器图像。他们研究了卷积神经网络模型,以学习视频中仪器框架的表示,并仔细研究学习到的激活模式。对于此任务,使用可公开获得的用于仪器计数分类的数据集,对GoogLeNet和批次归一化进行了培训和验证。他们将迁移学习与从头开始学习进行了比较,并对胆囊切除术和妇科的数据集进行了评估。评估显示,与从头训练模型相比,在仪器和非仪表图像上微调预训练的模型在学习中要快得多且更稳定。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号