首页> 外文会议>IEEE International Conference on Healthcare Informatics >Numerical Age Variations within Clinical Notes: The Potential Impact on De-Identification and Information Extraction
【24h】

Numerical Age Variations within Clinical Notes: The Potential Impact on De-Identification and Information Extraction

机译:临床节内的数值变化:对去识别和信息提取的潜在影响

获取原文

摘要

Many kinds of numbers and numerical concepts appear frequently in free text clinical notes from electronic health records, including patient ages. The variability in how ages are described may impact the success of information extraction strategies as well as the accuracy of de-identification systems. This brief paper describes an analysis of the variation in how numbers and numerical concepts are represented in clinical notes with respect to ages. We used an inverted index of approximately 100 million notes to obtain the frequency of various permutations of ages, including biologically implausible ages as well as age descriptions that might not be detected by many de-identification systems. Missing such rare, but nevertheless present, variations could result in missed information or even privacy violations.
机译:许多类型的数字和数值概念通常出现在电子健康记录中的自由文本临床笔记中,包括患者年龄。描述的变化可能会影响信息提取策略的成功以及去识别系统的准确性。本简要介绍描述了在临床记录中如何在相对于年龄在临床记录中表示的变化分析。我们使用了大约1亿笔记的倒指数,以获得各种年龄的各种排列的频率,包括生物学难以置信的年龄以及许多去识别系统可能无法检测到的年龄描述。缺少如此罕见的,但仍然存在,可能会导致错过的信息甚至隐私违规行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号