首页> 外文会议>Conference on advanced PACS-based imaging informatics and therapeutic applications >An Automatic System to Detect and Extract Text in Medical Images for De-identification
【24h】

An Automatic System to Detect and Extract Text in Medical Images for De-identification

机译:自动检测和提取医学图像中文本以进行去识别的系统

获取原文

摘要

Recently, there is an increasing need to share medical images for research purpose. In order to respect and preserve patient privacy, most of the medical images are de-identified with protected health information (PHI) before research sharing. Since manual de-identification is time-consuming and tedious, so an automatic de-identification system is necessary and helpful for the doctors to remove text from medical images. A lot of papers have been written about algorithms of text detection and extraction, however, little has been applied to de-identification of medical images. Since the de-identification system is designed for end-users, it should be effective, accurate and fast. This paper proposes an automatic system to detect and extract text from medical images for de-identification purposes, while keeping the anatomic structures intact. First, considering the text have a remarkable contrast with the background, a region variance based algorithm is used to detect the text regions. In post processing, geometric constraints are applied to the detected text regions to eliminate over-segmentation, e.g., lines and anatomic structures. After that, a region based level set method is used to extract text from the detected text regions. A GUI for the prototype application of the text detection and extraction system is implemented, which shows that our method can detect most of the text in the images. Experimental results validate that our method can detect and extract text in medical images with a 99% recall rate. Future research of this system includes algorithm improvement, performance evaluation, and computation optimization.
机译:近来,出于研究目的,越来越需要共享医学图像。为了尊重和保护患者的隐私,在共享研究成果之前,大多数医学图像都会用受保护的健康信息(PHI)进行标识。由于手动取消身份识别非常耗时且乏味,因此自动取消身份识别系统是必要的,对于医生从医学图像中删除文本非常有帮助。关于文本检测和提取算法的论文很多,但是很少用于医学图像的去识别。由于去识别系统是为最终用户设计的,因此它应该是有效,准确和快速的。本文提出了一种自动系统,用于从医学图像中检测和提取文本以进行识别,同时保持完整的解剖结构。首先,考虑到文本与背景的明显对比,使用基于区域差异的算法来检测文本区域。在后期处理中,将几何约束条件应用于检测到的文本区域,以消除过度分割的情况,例如线条和解剖结构。之后,使用基于区域的级别设置方法从检测到的文本区域中提取文本。实现了用于文本检测和提取系统的原型应用程序的GUI,这表明我们的方法可以检测图像中的大多数文本。实验结果验证了我们的方法能够以99%的查全率检测和提取医学图像中的文本。该系统的未来研究包括算法改进,性能评估和计算优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号