DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text

Menger Vincent; Scheepers Floor; van Wijk Lisette Maria; Spruit Marco

首页> 外文期刊>Telematics and Informatics >DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text

【24h】

DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text

机译：减少：一种模式匹配方法，用于自动取消识别荷兰医学文本

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to use medical text for research purposes, it is necessary to de-identify the text for legal and privacy reasons. We report on a pattern matching method to automatically de-identify medical text written in Dutch, which requires a low amount of effort to be hand tailored. First, a selection of Protected Health Information (PHI) categories is determined in cooperation with medical staff. Then, we devise a method for de-identifying all information in one of these PHI categories, that relies on lookup tables, decision rules and fuzzy string matching. Our de-identification method DEDUCE is validated on a test corpus of 200 nursing notes and 200 treatment plans obtained from the University Medical Center Utrecht (UMCU) in the Netherlands, achieving a total micro-averaged precision of 0.814, a recall of 0.916 and a F-1-score of 0.862. For person names, a recall of 0.964 was achieved, while no names of patients were missed.

机译：为了将医学文本用于研究目的，出于法律和隐私原因，有必要取消对文本的标识。我们报告了一种模式匹配方法，该方法可以自动识别以荷兰语编写的医学文本，这需要手工进行少量工作。首先，与医务人员合作确定受保护的健康信息（PHI）类别的选择。然后，我们设计了一种方法，该方法依赖于查找表，决策规则和模糊字符串匹配来对这些PHI类别之一中的所有信息进行去识别。我们从荷兰乌得勒支大学医学中心（UMCU）获得的200份护理笔记和200份治疗计划的测试语料库对我们的去识别方法DEDUCE进行了验证，总平均精确度达到0.814，召回率为0.916， F-1-得分为0.862。对于人名，召回率为0.964，而没有遗漏患者姓名。

著录项

来源
《Telematics and Informatics》 |2018年第4期|727-736|共10页
作者
Menger Vincent; Scheepers Floor; van Wijk Lisette Maria; Spruit Marco;
展开▼
作者单位

Univ Utrecht, Dept Informat & Comp Sci, POB 80089, NL-3508 TB Utrecht, Netherlands;

Univ Med Ctr Utrecht, Dept Psychiat, POB 85500, NL-3508 GA Utrecht, Netherlands;

Univ Utrecht, Dept Informat & Comp Sci, POB 80089, NL-3508 TB Utrecht, Netherlands;

Univ Utrecht, Dept Informat & Comp Sci, POB 80089, NL-3508 TB Utrecht, Netherlands;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
De-identification; Dutch medical text; Pattern matching; Protected Health Information; Patient privacy;

机译：取消身份识别;荷兰医学文本;模式匹配;受保护的健康信息;患者隐私;

相似文献

外文文献
中文文献
专利

1. A Rule-Based System for Automatic De-identification of Medical Narrative Texts [J] . Jelena Ja?imovi?, Cvetana Krstev, Drago Jelovac Informatica: An International Journal of Computing and Informatics . 2015,第1期

机译：基于规则的医学叙事文本自动识别系统
2. A Rule-Based System for Automatic De-identification of Medical Narrative Texts [J] . Jelena Ja?imovi?, Cvetana Krstev, Drago Jelovac Informatica: An International Journal of Computing and Informatics . 2015,第1期

机译：基于规则的医学叙事文本自动识别系统
3. A Rule-Based System for Automatic De-identification of Medical Narrative Texts [J] . Jelena Ja?imovi?, Cvetana Krstev, Drago Jelovac Informatica: An International Journal of Computing and Informatics . 2015,第1期

机译：基于规则的医学叙事文本自动识别系统
4. An Automatic System to Detect and Extract Text in Medical Images for De-identification [C] . Yingxuan Zhul, PD Singh, Khan Siddiqui, Conference on advanced PACS-based imaging informatics and therapeutic applications . 2010

机译：自动检测和提取医学图像中文本以进行去识别的系统
5. NEW METHODS FOR MEASURING THE DYNAMIC SURFACE TENSION. (DUTCH TEXT) (COATING, ADSORPTION, DIFFUSION). [D] . VAN HAVENBERGH, JAN EMIEL. 1984

机译：测量表面张力的新方法。（荷兰文字）（涂层，吸附，扩散）。
6. Generalizability and Comparison of Automatic Clinical Text De-Identification Methods and Resources [O] . Óscar Ferrández, Brett R. South, Shuying Shen, 2012

机译：临床文本自动识别方法和资源的可推广性和比较
7. Other-Anaphora Resolution in Biomedical Texts with Automatically Mined Patterns [O] . Chen Bin, Yang Xiaofeng, Su Jian, 2013

机译：具有自动挖掘模式的生物医学文本中的其他 - 回指解析

DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text

摘要

著录项

相似文献

相关主题

期刊订阅